CN106095915A - The processing method and processing device of user identity - Google Patents

The processing method and processing device of user identity Download PDF

Info

Publication number
CN106095915A
CN106095915A CN201610405268.2A CN201610405268A CN106095915A CN 106095915 A CN106095915 A CN 106095915A CN 201610405268 A CN201610405268 A CN 201610405268A CN 106095915 A CN106095915 A CN 106095915A
Authority
CN
China
Prior art keywords
identity
user
behavioral data
setting
characteristic vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610405268.2A
Other languages
Chinese (zh)
Inventor
王超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201610405268.2A priority Critical patent/CN106095915A/en
Publication of CN106095915A publication Critical patent/CN106095915A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/30Authentication, i.e. establishing the identity or authorisation of security principals
    • G06F21/31User authentication

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Security & Cryptography (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Computer Hardware Design (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides the processing method and processing device of a kind of user identity.Its method includes: obtain the behavioral data of pending user;According to identity converter, the behavioral data of described pending user being converted into identity characteristic vector, described identity characteristic vector is for unique behavioral data identifying described pending user;According to identity processor, the identity characteristic vector of described pending user is processed, whether be to set identity to determine the identity of described pending user.By using the technique scheme of the present invention, the behavioral data according to pending user can be realized the identity of pending user is determined, compared with prior art, can effectively ensure the accuracy of the determination of user identity, and it is effectively improved the determination efficiency of user identity, carry out the information pushing relevant to this user identity according to the user identity determined so that follow-up, and then be effectively improved the efficiency of information pushing.

Description

The processing method and processing device of user identity
[technical field]
The present invention relates to big technical field of data processing, particularly relate to the processing method and processing device of a kind of user identity.
[background technology]
Along with popularizing that scientific and technological fast development, mobile terminal use, the various application along with mobile terminal the most more come The most, greatly facilitate the life of people.
Various application service providers, in order to more efficiently promote various service related information to user, need first to determine user Identity, the most just can ensure that the accuracy of pushed information, to improve pushing efficiency.In prior art, user identity is really Determining process relatively complicated, such as a lot of application service providers use the mode of excitation user to allow user at some application of registration or clothes The when of business, actively provide identity information.But the mode of this user identity still cannot ensure the user identity got Accuracy.
Therefore, existing technique scheme determines the inefficient of user identity.
[summary of the invention]
The invention provides the processing method and processing device of a kind of user identity, to improve the determination efficiency of user identity.
The present invention provides the processing method of a kind of user identity, described method to include:
Obtain the behavioral data of pending user;
According to identity converter, the behavioral data of described pending user being converted into identity characteristic vector, described identity is special Levy vector for unique behavioral data identifying described pending user;
According to identity processor, the identity characteristic vector of described pending user is processed, described pending to determine Whether the identity of user is to set identity.
The present invention also provides for the processing means of a kind of user identity, and described device includes:
Acquisition module, for obtaining the behavioral data of pending user;
Conversion module, for according to identity converter the behavioral data of described pending user is converted into identity characteristic to Amount, described identity characteristic vector is for unique behavioral data identifying described pending user;
Processing module, for the identity characteristic vector of described pending user being processed according to identity processor, with Whether the identity determining described pending user is to set identity.
The processing method and processing device of the user identity of the present invention, by using technique scheme, it is achieved according to pending The identity of pending user is determined by the behavioral data of user, compared with prior art, can effectively ensure user's body The accuracy of the determination of part, and it is effectively improved the determination efficiency of user identity, in order to follow-up according to the user identity determined Carry out the information pushing relevant to this user identity, and then be effectively improved the efficiency of information pushing.
[accompanying drawing explanation]
Fig. 1 is the flow chart of the processing method embodiment one of the user identity of the present invention.
Fig. 2 is the flow chart of the processing method embodiment two of the user identity of the present invention.
Fig. 3 is the structure chart of the processing means embodiment one of the user identity of the present invention.
Fig. 4 is the structure chart of the processing means embodiment two of the user identity of the present invention.
[detailed description of the invention]
In order to make the object, technical solutions and advantages of the present invention clearer, below in conjunction with the accompanying drawings with specific embodiment pair The present invention is described in detail.
Fig. 1 is the flow chart of the processing method embodiment one of the user identity of the present invention.As it is shown in figure 1, the present embodiment The processing method of user identity, specifically may include steps of:
100, the behavioral data of pending user is obtained;
The behavioral data of the pending user of the present embodiment can be that the historical behavior to this pending user is acquired Obtaining, the behavioral data of pending user is specifically as follows related data, can include a lot in this data Field.In actual application, the when of gathering the behavioral data of this pending user, can carry out targetedly in conjunction with setting identity Acquisition.Such as set identity as car owner, when obtaining the behavioral data of this pending user, can tend to obtain this and wait to locate The behavioral data relevant to car owner of reason user.Such as in these behavioral datas can include a period of time of this pending user Map retrieval number of times and use natural law, essential information point (point of interest;Poi) retrieval number of times and natural law, public transport Route retrieval, driving route retrieval, Walking Route inspection and subway line retrieval number of times, retrieval number of times accounting and natural law, user ground The retrieval number of times of figure use time cumulation duration, user to use each term that map natural law is relevant to car, retrieve number of times with Accounting in car coordinate indexing total degree and natural law etc..The most each poi can comprise four directions surface information: title, classification, warp Information such as degree latitude and neighbouring retail shop of restaurant of hotel etc., this poi information can also be referred to as " navigation map information ".Wherein with The term that car is relevant comprises: gas station, automobile services, sale of automobile, 4S shop, car detailing, Motor Maintenance, automobile decoration, Carwash, auto parts machinery, Vehicle Inspection field etc..
101, according to identity converter, the behavioral data of pending user is converted into identity characteristic vector;
Owing to the behavioral data of pending user cannot be processed by follow-up identity processor, the present embodiment needs First converting the behavioral data of this pending user, the identity characteristic vector after conversion identifies pending user for unique Behavioral data, the identity characteristic vector of such as the present embodiment can use numeral 0 and 1 constitute.
102, according to identity processor, the identity characteristic vector of pending user is processed, to determine pending user Identity be whether to set identity.
The identity processor of the present embodiment can process with the identity characteristic vector of pending user, it is achieved to pending Whether the identity of user is identified, be to set identity to determine the identity of pending user.The setting identity of the present embodiment can Think car owner, or can also be other such as drawing practitioner or certain art speciality practitioner or other machinery or Practitioner of Engineering etc..
The processing method of the user identity of the present embodiment, by using technique scheme, it is achieved according to pending user Behavioral data the identity of pending user is determined, compared with prior art, can effectively ensure user identity The accuracy determined, and be effectively improved the determination efficiency of user identity, in order to follow-up carry out according to the user identity determined The information pushing relevant to this user identity, and then it is effectively improved the efficiency of information pushing.
Still optionally further, on the basis of the technical scheme shown in above-mentioned Fig. 1, before step 101, it is also possible to include as Lower step a: according to the first behavior data set and the second behavioral data collection of non-setting identity user of setting identity user, generate Identity converter.
Specifically, the first behavior data set of the setting identity user of the present embodiment and second row of non-setting identity user Can also collect in advance for data set, such as, can obtain from each user access record on the internet.First Behavioral data concentrates the behavioral data including multiple setting identity user, permissible in the behavioral data of each setting identity user Data including multiple fields.The data of the multiple fields in the behavioral data of each setting identity user are relevant as one Connection data store, so, in the first behavior data set the behavioral data of the user of multiple setting identity be exactly according to Family stores one by one.In like manner the second behavioral data concentrates the behavioral data including multiple non-setting identity user, its storage Mode is with reference to the behavioral data storage mode in above-mentioned first behavior data set, the behavioral data of the user of multiple non-setting identity Also it is to store one by one according to user.The first behavior data set according to setting identity user sets the row of identity user For the behavioral data of the non-setting identity user that the second behavioral data of data and non-setting identity user is concentrated, genus can be found out In setting some general character of behavioral data of identity user, and it is not belonging to the individual character of specific identity user, as such, it is possible to according to First behavior data set and the second behavioral data collection, generate identity converter.This identity converter is possible not only to according to behavior number It is identified according to the identity of pending user, identity characteristic vector can also be generated according to recognition result simultaneously.
The quantity of the user of the setting identity that the first behavior data set includes and the second behavioral data collection in the present embodiment The quantity of the user of the non-setting identity included can be identical, it is also possible to differs;Quantity specifically can be 20,500 Or any number such as 800;And the quantity of the user of setting identity that includes of the first behavior data set and the second behavior number According to concentrating the quantity of the user of non-setting identity included, the identity converter obtained and identity processor carry out identity convert and The accuracy that identity processes is the highest.
The setting identity of such as the present embodiment can be car owner, and non-setting identity is non-car owner.According to car owner first All behavioral datas that second behavioral data of behavioral data collection and non-car owner is concentrated, can train an identity converter, should Identity converter is capable of the behavioral data of user being converted into a corresponding identity characteristic vector, to simplify behavioral data Representation, facilitate the follow-up identification carrying out user identity according to identity characteristic vector.
Such as this step a is specifically as follows: use each in the first behavior data set to set the behavior of identity user The behavioral data of each non-setting identity user that data and the second behavioral data are concentrated, the identity that training is preset converts mould Type, obtains identity converter.
The identity transformation model of the present embodiment specifically can use random forest (random forest) or gradient to promote Decision tree (Gradient Boost Decision Tree;GBDT) realize.
For example, it is possible to the behavioral data of the user of the setting identity that the first behavior data set is included and the second behavior number According to the behavioral data of the user concentrating the non-setting identity included, it is referred to as training data.Then these training datas are utilized to instruct Practicing random forest or GBDT model, can obtain N decision tree respectively, wherein a tree N of decision tree is by training process The decision tree number state modulator of middle configuration, can configure according to the actual requirements before training.Every in N decision tree One tree leaf node be all numbered, the m-th leaf node of such as n-th decision tree be (n, m).To every training data, It is the probability setting identity user that every tree all can have a leaf node to export this user, if probability is more than specifying threshold value, and can With by result queue for 1, no person is 0.And owing to the identity of the user of every training data determines that, when output result with When the identity of the user known is inconsistent, random forest or the GBDT model of correspondence can be adjusted, to adjust final acquisition N decision tree, improve N the decision tree accuracy to user identity identification.All known by use in training data Random forest or GBDT model are trained by the behavioral data of user identity, finally give N decision tree and are The identity converter obtained eventually, the identity of user can be identified according to behavioral data, the most also may be used by this identity converter To generate only one identity characteristic vector according to recognition result.Such as, for the behavioral data of each user, this N tree Leaf node can export one N*M vector of acquisition, wherein M is the leaf node of the most decision tree of leaf node number Number.If n-th tree m-th leaf node discriminative training data be labeled as 1, then ((n-1) * M+m) bit element is 1, No person is 0, and the element not exporting the leaf node of result corresponding is 0.If the leaf node number of certain decision tree is less than M, exceed Leaf node number element below is with 0 leaf node supplying disappearance.Due to each behavioral data pair in training data The identity of the user answered all determines that, so, according to the behavioral data of the user that these identity determine, can train this N certainly Plan tree is capable of identify that the ability of user identity, and simultaneously for the behavioral data of each user, this N decision tree can be corresponding defeated Go out a corresponding identity characteristic vector.N decision tree parameter after training all secures, and can accurately realize pending The identification of user identity, now this N decision tree can form an identity converter.
Still optionally further, on the basis of the technical scheme of above-mentioned embodiment illustrated in fig. 1, before step 102, also may be used To include step b: according to the first behavior data set, the second behavioral data collection and identity converter, generate identity processor.
Such as, following two steps can be included when this step b implements:
(b1) each in the first behavior data set is set behavioral data and the second behavioral data collection of identity user In the behavioral data of each non-setting identity user, be separately input in identity converter, obtain the first behavior data set In each set identity user identity characteristic vector sum second behavioral data concentrate each non-setting identity user Identity characteristic vector;
(b2) each in the first behavior data set is used to set identity characteristic vector sum second behavior of identity user The identity characteristic vector of the non-setting identity user of each in data set, the identity that training is preset processes model, obtains identity Processor.
Owing to identity converter can realize the behavioral data conversion to corresponding identity characteristic vector of user, this enforcement In example after generating identity converter, each in the first behavior data set can be set the behavioral data of identity user With the behavioral data of each non-setting identity user that the second behavioral data is concentrated, respectively by identity converter, it is converted into Characteristic vector, as such, it is possible to obtain first eigenvector set corresponding in the first behavior data set and the second behavioral data collection Corresponding second feature vector set.Then the characteristic vector in first eigenvector set and second feature vector set is divided Not as training data, the identity that training is preset processes model, to obtain identity processor.Such as, this identity preset processes Model is specifically as follows Logic Regression Models.Owing to the characteristic vector in first eigenvector set is the feature of setting user Vector, so, inputs the characteristic vector in first eigenvector set and processes in model to the identity preset, can calculate Whether user corresponding to this feature vector is the probability of the user presetting identity, if probability is more than or equal to predetermined probabilities threshold Value, it may be determined that this user is the user presetting identity;Probability is less than predetermined probabilities threshold value else if, then explanation needs to adjust This identity preset processes model so that the probit of calculating is more than this predetermined probabilities threshold value.By using first eigenvector All characteristic vectors in set and second feature vector set, constantly train this identity preset to process model, the most constantly Ground adjusts this identity preset and processes model, obtains this identity processor.
For example with GBDT model training be decision tree number be set to 3 tree time, the decision tree at training is the most permissible There are 4 leaf nodes.For the behavioral data of certain user a, leaf node (1,1), (2,2), the differentiation result that (3,4) are given is Be respectively 1,0,1, the identity characteristic vector of the 0-1 attribute that the behavioral data of the most pending user is corresponding be (1,0,0,0,0,0, 0,0,0,0,0,1), the thick differentiation end value corresponding for providing the leaf node differentiating result is marked.
Still optionally further, on the basis of the technical scheme of above-mentioned embodiment illustrated in fig. 1, step 102 specifically can be wrapped Include following steps:
(c1) the pending user corresponding to identity characteristic vector of pending user is calculated according to identity processor for setting The probit of the user of identity;
(c2) judge that whether probit is more than or equal to predetermined probabilities threshold value;If so, determine that pending user is for setting The user of identity;That otherwise determine this pending user is not the user setting identity.
The probit of the present embodiment can the most rule of thumb be chosen.
The technical scheme of above-described embodiment, compared with prior art, can ensure the standard of the determination of user identity effectively Really property, and be effectively improved the determination efficiency of user identity, in order to follow-up carry out and this user according to the user identity determined The information pushing that identity is relevant, and then it is effectively improved the efficiency of information pushing.
Fig. 2 is the flow chart of the processing method embodiment two of the user identity of the present invention.The present embodiment combines above-mentioned reality Executing the technical scheme of example, to set identity as car owner, the non-identity that sets, as a example by non-car owner, describes technical scheme. As in figure 2 it is shown, the processing method of the user identity of the present embodiment, specifically may include steps of:
200, the first behavior data set and the second behavioral data collection of non-car owner of car owner are gathered;
Such as, specifically the user data from the Internet can gather this first behavior data set and the second behavioral data Collection.Wherein the first behavior data set includes that the behavioral data of multiple car owner, the second behavioral data are concentrated and includes multiple non-car owner Behavioral data.
The behavioral data of the present embodiment includes the behavioral data of the multiple fields relevant to car, is referred to above-mentioned in detail The record of embodiment, does not repeats them here.
201, the behavioral data that the first behavior data set of car owner and second behavioral data of non-car owner are concentrated is utilized, right Random forest or GBDT model are trained, and N the decision tree obtained is as identity converter;
Concrete training principle is referred to the record of above-described embodiment, does not repeats them here.N the decision-making finally given The leaf node number setting the most decision tree of middle period son node number corresponding is M.
202, the behavioral data first behavior data set of car owner and second behavioral data of non-car owner concentrated, adopts respectively Change with identity converter, obtain the first identity characteristic vector set corresponding to the first behavior data set of car owner and non-car owner Second Identity of Local vector set corresponding to the second behavioral data collection;
Owing to each behavioral data can uniquely change a corresponding identity characteristic vector by identity transducer.So In the behavioral data of each car owner in the first behavior data set of car owner and the first identity characteristic vector of car owner one Identity characteristic vector one_to_one corresponding, the behavioral data of each non-car owner that second behavioral data of non-car owner is concentrated and non-car owner Second Identity of Local vector in an identity characteristic vector one_to_one corresponding.
203, the first identity characteristic vector set and second row of non-car owner that the first behavior data set of car owner is corresponding are utilized For the characteristic vector in the Second Identity of Local vector set that data set is corresponding, Logic Regression Models is trained, obtains identity Processor;
Concrete training principle is referred to the record of above-described embodiment, does not repeats them here.
204, the behavioral data of pending user is obtained;
205, the behavioral data of pending user is inputted to identity processor, calculate this pending user for car owner's Probit;
206, judge that whether probit is more than or equal to predetermined probabilities threshold value;If so, determine that pending user is car owner; Otherwise determine that this pending user is non-car owner.
The said method using the present embodiment can also realize the identification of the user to other default identity.
Along with the development of the Internet, going on a journey with car, carwash, the Internet service relevant to car such as Motor Maintenance is gradually by people Accept.In the internet, applications that these are relevant to car, car owner is the core of all services, therefore obtains car owner's body of the unknown Part becomes extremely important, especially during promoting service and ownership.Use the said method of the present embodiment, permissible Get whether pending user is car owner accurately and efficiently, in order to follow-up car owner's identity according to user carries out vehicle phase The information pushing closed, and then it is effectively improved the efficiency of information pushing.
Fig. 3 is the structure chart of the processing means embodiment one of the user identity of the present invention.As it is shown on figure 3, the present embodiment The processing means of user identity, specifically may include that acquisition module 10, conversion module 11 and processing module 12.
Wherein acquisition module 10 is for obtaining the behavioral data of pending user;Conversion module 11 is for converting according to identity The behavioral data of the pending user that acquisition module 10 is obtained by device converts identity characteristic vector, and identity characteristic vector is for unique Identify the behavioral data of pending user;Processing module 12 is pending for convert conversion module 11 according to identity processor Whether the identity characteristic vector of user processes, be to set identity to determine the identity of pending user.
The processing means of the user identity of the present embodiment, realizes the realization of the process of user identity by the above-mentioned module of employing Principle and technique effect, with above-mentioned related method embodiment realize identical, be referred to the record of above-described embodiment in detail, Do not repeat them here.
Fig. 4 is the structure chart of the processing means embodiment two of the user identity of the present invention.As shown in Figure 4, the present embodiment The processing means of user identity, on the basis of the technical scheme of above-mentioned embodiment illustrated in fig. 3, further comprises following technology Scheme.
As shown in Figure 4, the processing means of the user identity of the present embodiment also includes identity converter generation module 13.This body Part converter generation module 13 is for according to the first behavior data set and the second of non-setting identity user setting identity user Behavioral data collection, generates identity converter.
Still optionally further, the identity converter generation module 13 in the processing means of the user identity of the present embodiment is concrete Every for use each behavioral data setting identity user in the first behavior data set and the second behavioral data to concentrate The behavioral data of one non-setting identity user, the identity transformation model that training is preset, obtain identity converter.Wherein convert mould Block 11 is specifically used for according to the identity converter generated with identity converter generation module 13, by treating that acquisition module 10 obtains The behavioral data processing user converts identity characteristic vector.
Still optionally further, as shown in Figure 4, the processing means of the user identity of the present embodiment also includes identity processor Generation module 14.Wherein identity processor generation module 14 is for according to the first behavior data set, the second behavioral data collection and body The identity converter that part converter generation module 13 generates, generates identity processor.
Still optionally further, identity processor generation module 14 is specifically for setting each in the first behavior data set Determine the behavioral data of identity user and the behavioral data of each non-setting identity user of the second behavioral data concentration, the most defeated Enter to the identity converter of identity converter generation module 13 generation, obtain each in the first behavior data set and set body The identity characteristic vector of each non-setting identity user that identity characteristic vector sum second behavioral data of part user is concentrated;Adopt With in the first behavior data set each set identity user identity characteristic vector sum the second behavioral data concentrate each The identity characteristic vector of individual non-setting identity user, the identity that training is preset processes model, obtains identity processor.
The identity preset of such as the present embodiment processes model and includes Logic Regression Models.
Still optionally further, as shown in Figure 4, in the processing means of the user identity of the present embodiment, processing module 12 is concrete Including: computing unit 121 and judging unit 122.
Wherein computing unit 121 calculates for the identity processor generated according to identity processor generation module 14 and converts mould The probit of the user that pending user be setting identity that the identity characteristic vector of pending user that block 11 converts is corresponding;Sentence Whether the probit that disconnected unit 122 calculates for judging computing unit 121 is more than or equal to predetermined probabilities threshold value;The most true Fixed pending user is the user setting identity.
The processing means of the user identity of the present embodiment, realizes the realization of the process of user identity by the above-mentioned module of employing Principle and technique effect, with above-mentioned related method embodiment realize identical, be referred to the record of above-described embodiment in detail, Do not repeat them here.
In several embodiments provided by the present invention, it should be understood that disclosed system, apparatus and method are permissible Realize by another way.Such as, device embodiment described above is only schematically, such as, and described unit Dividing, be only a kind of logic function and divide, actual can have other dividing mode when realizing.
The described unit illustrated as separating component can be or may not be physically separate, shows as unit The parts shown can be or may not be physical location, i.e. may be located at a place, or can also be distributed to multiple On NE.Some or all of unit therein can be selected according to the actual needs to realize the mesh of the present embodiment scheme 's.
It addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, it is also possible to It is that unit is individually physically present, it is also possible to two or more unit are integrated in a unit.Above-mentioned integrated list Unit both can realize to use the form of hardware, it would however also be possible to employ hardware adds the form of SFU software functional unit and realizes.
The above-mentioned integrated unit realized with the form of SFU software functional unit, can be stored in an embodied on computer readable and deposit In storage media.Above-mentioned SFU software functional unit is stored in a storage medium, including some instructions with so that a computer Equipment (can be personal computer, server, or the network equipment etc.) or processor (processor) perform the present invention each The part steps of method described in embodiment.And aforesaid storage medium includes: USB flash disk, portable hard drive, read only memory (Read- Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disc or CD etc. various The medium of program code can be stored.
The foregoing is only presently preferred embodiments of the present invention, not in order to limit the present invention, all essences in the present invention Within god and principle, any modification, equivalent substitution and improvement etc. done, within should be included in the scope of protection of the invention.

Claims (14)

1. the processing method of a user identity, it is characterised in that described method includes:
Obtain the behavioral data of pending user;
According to identity converter, the behavioral data of described pending user is converted into identity characteristic vector, described identity characteristic to Amount is for unique behavioral data identifying described pending user;
According to identity processor, the identity characteristic vector of described pending user is processed, to determine described pending user Identity be whether to set identity.
Method the most according to claim 1, it is characterised in that according to identity converter by the behavior of described pending user Before data are converted into identity characteristic vector, described method also includes:
The first behavior data set according to setting identity user and the second behavioral data collection of non-setting identity user, generate described Identity converter.
Method the most according to claim 2, it is characterised in that according to setting the first behavior data set of identity user and non- Set the second behavioral data collection of identity user, generate described identity converter, specifically include:
Use the behavioral data of the described setting identity user of each in described first behavior data set and described second behavior The behavioral data of the described non-setting identity user of each in data set, the identity transformation model that training is preset, obtain described Identity converter.
Method the most according to claim 2, it is characterised in that described identity characteristic vector is carried out according to identity processor Processing, before determining whether the identity of described pending user is setting identity, described method also includes:
According to described first behavior data set, described second behavioral data collection and described identity converter, generate at described identity Reason device.
Method the most according to claim 4, it is characterised in that according to described first behavior data set, described second behavior Data set and described identity converter, generate described identity processor, specifically include:
By behavioral data and the described second behavior number of described for each in described first behavior data set setting identity user According to the behavioral data of each the described non-setting identity user concentrated, it is separately input in described identity converter, obtains institute State the second behavioral data collection described in the identity characteristic vector sum of the described setting identity user of each in the first behavior data set In each described non-setting identity user identity characteristic vector;
Use described in the identity characteristic vector sum of the described setting identity user of each in described first behavior data set second The identity characteristic vector of each described non-setting identity user that behavioral data is concentrated, the identity that training is preset processes model, Obtain described identity processor.
Method the most according to claim 5, it is characterised in that described default identity processes model and includes logistic regression mould Type.
7. according to the arbitrary described method of claim 1-6, it is characterised in that described according to identity processor to described pending Whether the identity characteristic vector of user processes, be to set identity to determine the identity of described pending user, specifically include:
Described pending user corresponding to identity characteristic vector calculating described pending user according to described identity processor is The probit of the user of described setting identity;
Judge that whether described probit is more than or equal to predetermined probabilities threshold value;If so, determine that described pending user is for described Set the user of identity.
8. the processing means of a user identity, it is characterised in that described device includes:
Acquisition module, for obtaining the behavioral data of pending user;
Conversion module, for the behavioral data of described pending user being converted into identity characteristic vector according to identity converter, Described identity characteristic vector is for unique behavioral data identifying described pending user;
Processing module, for processing the identity characteristic vector of described pending user according to identity processor, to determine Whether the identity of described pending user is to set identity.
Device the most according to claim 8, it is characterised in that described device also includes:
Identity converter generation module, for according to setting the first behavior data set of identity user and non-setting identity user Second behavioral data collection, generates described identity converter.
Device the most according to claim 9, it is characterised in that described identity converter generation module, specifically for using Behavioral data and described second behavioral data of each the described setting identity user in described first behavior data set are concentrated The behavioral data of each described non-setting identity user, the identity transformation model that training is preset, obtain described identity and convert Device.
11. devices according to claim 9, it is characterised in that described device also includes:
Identity processor generation module, for according to described first behavior data set, described second behavioral data collection and described body Part converter, generates described identity processor.
12. devices according to claim 11, it is characterised in that described identity processor generation module, specifically for:
By behavioral data and the described second behavior number of described for each in described first behavior data set setting identity user According to the behavioral data of each the described non-setting identity user concentrated, it is separately input in described identity converter, obtains institute State the second behavioral data collection described in the identity characteristic vector sum of the described setting identity user of each in the first behavior data set In each described non-setting identity user identity characteristic vector;
Use described in the identity characteristic vector sum of the described setting identity user of each in described first behavior data set second The identity characteristic vector of each described non-setting identity user that behavioral data is concentrated, the identity that training is preset processes model, Obtain described identity processor.
13. devices according to claim 12, it is characterised in that described default identity processes model and includes logistic regression Model.
14.-13 arbitrary described devices according to Claim 8, it is characterised in that described processing module, specifically include:
Computing unit, for according to described identity processor calculate described pending user identity characteristic vector corresponding described in Pending user is the probit of the user of described setting identity;
Judging unit, is used for judging that whether described probit is more than or equal to predetermined probabilities threshold value;If so, wait to locate described in determining Reason user is the user of described setting identity.
CN201610405268.2A 2016-06-08 2016-06-08 The processing method and processing device of user identity Pending CN106095915A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610405268.2A CN106095915A (en) 2016-06-08 2016-06-08 The processing method and processing device of user identity

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610405268.2A CN106095915A (en) 2016-06-08 2016-06-08 The processing method and processing device of user identity

Publications (1)

Publication Number Publication Date
CN106095915A true CN106095915A (en) 2016-11-09

Family

ID=57227723

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610405268.2A Pending CN106095915A (en) 2016-06-08 2016-06-08 The processing method and processing device of user identity

Country Status (1)

Country Link
CN (1) CN106095915A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107330445A (en) * 2017-05-31 2017-11-07 北京京东尚科信息技术有限公司 The Forecasting Methodology and device of user property
CN107688865A (en) * 2017-07-31 2018-02-13 上海恺英网络科技有限公司 Identify the method and apparatus of potential high consumption user in online game
CN111753266A (en) * 2019-03-29 2020-10-09 阿里巴巴集团控股有限公司 User authentication method, multimedia content pushing method and device
US10904724B2 (en) 2017-08-04 2021-01-26 Beijing Didi Infinity Technology And Development Co., Ltd. Methods and systems for naming a pick up location

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103530540A (en) * 2013-09-27 2014-01-22 西安交通大学 User identity attribute detection method based on man-machine interaction behavior characteristics
CN104462320A (en) * 2014-12-01 2015-03-25 中国联合网络通信集团有限公司 Method and device for realizing classification of network users
CN104778283A (en) * 2015-05-11 2015-07-15 苏州大学 User occupation classification method and system based on microblog
CN105095324A (en) * 2014-05-23 2015-11-25 富士通株式会社 User classification apparatus, user classification method and electronic device
CN105224681A (en) * 2015-10-23 2016-01-06 武汉大学 Based on the customer requirement retrieval method and system of family's place of working context environmental
CN105335390A (en) * 2014-07-09 2016-02-17 阿里巴巴集团控股有限公司 Object classification method, business pushing method and server
CN105378717A (en) * 2013-03-12 2016-03-02 国际商业机器公司 Method for user categorization in social media, computer program, and computer
CN105512938A (en) * 2016-02-03 2016-04-20 宜人恒业科技发展(北京)有限公司 Online credit risk assessment method based on long-term using behavior of user

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105378717A (en) * 2013-03-12 2016-03-02 国际商业机器公司 Method for user categorization in social media, computer program, and computer
CN103530540A (en) * 2013-09-27 2014-01-22 西安交通大学 User identity attribute detection method based on man-machine interaction behavior characteristics
CN105095324A (en) * 2014-05-23 2015-11-25 富士通株式会社 User classification apparatus, user classification method and electronic device
CN105335390A (en) * 2014-07-09 2016-02-17 阿里巴巴集团控股有限公司 Object classification method, business pushing method and server
CN104462320A (en) * 2014-12-01 2015-03-25 中国联合网络通信集团有限公司 Method and device for realizing classification of network users
CN104778283A (en) * 2015-05-11 2015-07-15 苏州大学 User occupation classification method and system based on microblog
CN105224681A (en) * 2015-10-23 2016-01-06 武汉大学 Based on the customer requirement retrieval method and system of family's place of working context environmental
CN105512938A (en) * 2016-02-03 2016-04-20 宜人恒业科技发展(北京)有限公司 Online credit risk assessment method based on long-term using behavior of user

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107330445A (en) * 2017-05-31 2017-11-07 北京京东尚科信息技术有限公司 The Forecasting Methodology and device of user property
CN107330445B (en) * 2017-05-31 2020-06-05 北京京东尚科信息技术有限公司 User attribute prediction method and device
CN107688865A (en) * 2017-07-31 2018-02-13 上海恺英网络科技有限公司 Identify the method and apparatus of potential high consumption user in online game
US10904724B2 (en) 2017-08-04 2021-01-26 Beijing Didi Infinity Technology And Development Co., Ltd. Methods and systems for naming a pick up location
CN111753266A (en) * 2019-03-29 2020-10-09 阿里巴巴集团控股有限公司 User authentication method, multimedia content pushing method and device

Similar Documents

Publication Publication Date Title
JP6568240B2 (en) Method and system for determining transportation service charges
CN107038478B (en) Road condition prediction method and device, computer equipment and readable medium
CN106095915A (en) The processing method and processing device of user identity
Wang et al. Detecting transportation modes based on LightGBM classifier from GPS trajectory data
CN108537326A (en) For the method for automatic driving vehicle, medium and system
US20190179312A1 (en) Method and Apparatus for Generating Information
CN107316048A (en) Point cloud classifications method and device
CN111442778A (en) Travel scheme recommendation method, device and equipment and computer readable storage medium
CN110443288B (en) Trajectory similarity calculation method based on sequencing learning
CN114450557B (en) Route deviation quantification and vehicle route learning based thereon
CN104599002B (en) Method and equipment for predicting order value
WO2019047645A1 (en) Information acquisition method and device
CN107341166A (en) A kind of vehicle-mounted APP recommendation method and system
CN106767835A (en) Localization method and device
CN111104585B (en) Question recommending method and device
CN110879960A (en) Method and computing device for generating image data set for convolutional neural network learning
CN111415024A (en) Arrival time estimation method and estimation device
CN115018039A (en) Neural network distillation method, target detection method and device
Yang et al. Feature selection in conditional random fields for map matching of GPS trajectories
Dalumpines et al. Making mode detection transferable: extracting activity and travel episodes from GPS data using the multinomial logit model and Python
CN112380918A (en) Road vehicle state identification method and device, electronic equipment and storage medium
CN112101156A (en) Target identification method and device and electronic equipment
CN113888867B (en) Parking space recommendation method and system based on LSTM (least squares) position prediction
WO2019049141A1 (en) A system and method for using knowledge gathered by a vehicle
CN106204404A (en) The method of adjustment of a kind of public transport transport power and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20161109