CN103383685A - Method and device for keyword attribute quantification based on user click data - Google Patents

Method and device for keyword attribute quantification based on user click data Download PDF

Info

Publication number
CN103383685A
CN103383685A CN 201210133409 CN201210133409A CN103383685A CN 103383685 A CN103383685 A CN 103383685A CN 201210133409 CN201210133409 CN 201210133409 CN 201210133409 A CN201210133409 A CN 201210133409A CN 103383685 A CN103383685 A CN 103383685A
Authority
CN
China
Prior art keywords
keyword
search results
score
link
quantification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 201210133409
Other languages
Chinese (zh)
Inventor
刘建柱
贾自艳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Shiji Guangsu Information Technology Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN 201210133409 priority Critical patent/CN103383685A/en
Publication of CN103383685A publication Critical patent/CN103383685A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention discloses a method for keyword attribute quantification based on user click data. The method comprises the following steps: acquiring a set of candidate keywords used for attribute quantification; determining a quantized scoring algorithm for the keywords and corresponding search result links; taking the scores of the search result links as an influencing factor of the keyword quantification scoring result; taking the scores of the keywords as an influencing factor of the quantification scoring result of the search result links; creating iteration quantification model for the keywords; performing quantification scoring according to the click data of the keywords and corresponding search result links, as well as the search links of the quantification model keywords corresponding search results; acquiring quantification data of the keyword attributes. The invention further discloses a device for keyword attribute quantification based on user click data. The method and the device can acquire more accurate keyword attributes quantification data.

Description

A kind of keyword attribute quantization method and device based on user click data
Technical field
The present invention relates to the Webpage search technical field, particularly a kind of keyword attribute quantization method and device based on user click data.
Background technology
For web page search engine, according to the keyword of user's input, providing Search Results is the topmost function of web page search engine, and when Search Results is carried out rational sorting, be unable to do without the quantized data for the keyword of user search; Wherein, relatively typical as certain attribute of the keyword of user search is quantized marking, thereby realize the sequence based on this attribute, for example the business degree of keyword is given a mark, thereby realize that business relevance ranking, the advertisement support to Search Results, the station recognition of practising fraud, the sequence of correlativity commercial exhibition are adjusted etc.
Existing keyword attribute quantization method is mainly by a cover character string rule, keyword to be mated, and gives a mark according to matching degree, thereby obtains the quantized data of a certain attribute of keyword, and sort based on this quantized data.
But, due to the analysis of existing quantization method shortage to the user search behavior, quantification discrimination to keyword attribute is not high, simultaneously to the complicated character string of some statements, long character string and not cover the query string recognition capability of keyword inadequate, the quantized data comparability of keyword attribute is not high, help not quite not reach good quantification effect for application such as follow-up sequences.
Summary of the invention
In view of this, the invention provides a kind of keyword attribute quantization method and device based on user click data, can access keyword attribute quantized data more accurately.
For reaching above-mentioned purpose, technical scheme of the present invention specifically is achieved in that
A kind of keyword attribute quantization method based on user click data, the method comprises:
Obtain the set be used to the candidate keywords of carrying out attribute quantification;
Determine respectively the quantification marking algorithm of described keyword and corresponding Search Results link thereof, with the mark of the described Search Results link influence factor as the quantification marking result of described keyword; With the mark of the described keyword influence factor as the quantification marking result of described Search Results link, set up the iterative quantization model of described keyword;
Corresponding Search Results link quantizes marking to described keyword according to the click data between described keyword and corresponding Search Results link and described quantitative model; By iterative computation, obtain the quantized data of the described attribute of keyword again.
Preferably, described attribute is the business degree.
Preferably, described set of obtaining be used to the candidate keywords of carrying out attribute quantification comprises:
Grasp electric business's site title channel keyword vertical with commodity as candidate keywords;
Choose the highest N bar keyword of frequency of occurrence as the candidate keywords of business degree attribute quantification from described candidate keywords;
Wherein N is positive integer.
Preferably, described iterative quantization model comprises:
T ( x ) = Σ i ( x → y i )
All user click frequency sums that T (x) expression is relevant with x, wherein, x can be keyword or corresponding Search Results link;
When x is keyword, x → y iDuring expression user search keyword (x), in the Search Results link that obtains, the number of clicks of certain link that the user clicked, during T this moment (x) expression user search keyword (x), the number of clicks sum of all Search Results links of clicking;
When x is the Search Results link, x → y iThe number of clicks that the same Search Results that the expression user arrives by different keyword searches links; T this moment (x) expression, the number of clicks sum that the same Search Results that the user arrives by all different keyword searches links;
score ( q ) = ( 1 - γ ) score 0 ( q ) + γ Σ i click ( u i → q ) score ( u i ) T ( q ) , Expression quantizes marking to keyword;
score ( u ) = ( 1 - γ ) score 0 ( u ) + γ Σ i click ( q i → u ) score ( q i ) T ( u ) , Link quantizes marking to Search Results in expression;
The quantification score of score (q) expression keyword, the quantification score of score (u) expression Search Results link; Click (u i→ q) represent that certain Search Results links the number of clicks related with corresponding keyword, click (q i→ u) represent that certain keyword links related number of clicks with corresponding Search Results; γ represents to transfer weight coefficient, score 0Represent initial business degree score.
Preferably, the click data between the link of described keyword and correspondence comprises:
During the user search keyword, in the Search Results link that obtains, the number of clicks of certain link that the user clicked, the number of clicks sum that the same Search Results that the user arrives by all different keyword searches links, during the user search keyword, the number of clicks sum of all Search Results links of clicking.
A kind of keyword attribute quantization device based on user click data, this device comprises:
Choose module, be used for obtaining the set be used to the candidate keywords of carrying out attribute quantification;
Quantitative model is set up module, is used for determining respectively the quantification marking algorithm of described keyword and corresponding Search Results link thereof, with the mark of the described Search Results link influence factor as the quantification marking result of described keyword; With the mark of the described keyword influence factor as the quantification marking result of described Search Results link, set up the iterative quantization model of described keyword;
Quantize computing module, corresponding Search Results link quantizes marking to described keyword for the click data between linking for the Search Results according to described keyword and correspondence and described quantitative model; By iterative computation, obtain the quantized data of the described attribute of keyword again.
Preferably, the described module of choosing comprises:
Placement unit is used for grasping electric business's site title channel keyword vertical with commodity as candidate keywords;
Choose the unit, be used for choosing the highest N bar keyword of frequency of occurrence as the quantification keyword seed of business degree attribute from described candidate keywords; Wherein N is positive integer.
Preferably, described quantitative model is set up the quantitative model that module sets up and is comprised:
T ( x ) = Σ i ( x → y i )
All user click frequency sums that T (x) expression is relevant with x, wherein, x can be keyword or corresponding Search Results link;
When x is keyword, x → y iDuring expression user search keyword (x), in the Search Results link that obtains, the number of clicks of certain link that the user clicked, during T this moment (x) expression user search keyword (x), the number of clicks sum of all Search Results links of clicking;
When x is the Search Results link, x → y iThe number of clicks that the same Search Results that the expression user arrives by different keyword searches links; T this moment (x) expression, the number of clicks sum that the same Search Results that the user arrives by all different keyword searches links;
score ( q ) = ( 1 - γ ) score 0 ( q ) + γ Σ i click ( u i → q ) score ( u i ) T ( q ) , Expression quantizes marking to keyword;
score ( u ) = ( 1 - γ ) score 0 ( u ) + γ Σ i click ( q i → u ) score ( q i ) T ( u ) , Link quantizes marking to Search Results in expression;
The quantification score of score (q) expression keyword, the quantification score of score (u) expression Search Results link; Click (u i→ q) represent that certain Search Results links the number of clicks related with corresponding keyword, click (q i→ u) represent that certain keyword links related number of clicks with corresponding Search Results; γ represents to transfer weight coefficient, score 0Represent initial business degree score.
Preferably, described quantification computing module comprises:
Data capture unit, when being used for obtaining the user search keyword, in the Search Results link that obtains, the number of clicks of certain link that the user clicked, the number of clicks sum that the same Search Results that the user arrives by all different keyword searches links, during the user search keyword, the number of clicks sum of all Search Results links of clicking;
Computing unit, corresponding link quantizes marking to described keyword to be used for using described quantitative model to set up the quantitative model that module sets up; By iterative computation, obtain the quantized data of the described attribute of keyword again.
as seen from the above technical solution, this keyword attribute quantization method and device based on user click data of the present invention, combine the analysis to the user search behavior, user click data and keyword are introduced in quantitative model with the relation between linking, thereby the quantification discrimination to keyword attribute is significantly improved, improved simultaneously the complicated character string of some statements, long character string and the query string recognition capability that does not cover keyword, and the comparability of the quantized data of keyword attribute, have very great help for the follow-up application such as sequence, reached good quantification effect.
Description of drawings
Fig. 1 is the quantization method process flow diagram of the embodiment of the present invention;
Fig. 2 is the iterative computation principle schematic of the embodiment of the present invention;
Fig. 3 is the quantization device structural representation of the embodiment of the present invention;
Fig. 4 be the embodiment of the present invention choose the modular structure schematic diagram;
Fig. 5 is the quantification computing module structural representation of the embodiment of the present invention.
Embodiment
For making purpose of the present invention, technical scheme and advantage clearer, referring to the accompanying drawing embodiment that develops simultaneously, the present invention is described in more detail.
The main core concept of the present invention is: if on the internet, a webpage is pointed to by a lot of other web page interlinkages, the content that it is described so is subject to general admitting and trust, higher authority is arranged, should have higher rank, we carry out modeling to the user search behavior based on this thought, wherein the keyword of user search is with user's request, and the direct embodiment of demand in search engine is exactly that the user can tend to click the Search Results link consistent with the keyword demand more.Take commercial demand as example, if the keyword of user search has commercial demand, after this keyword of user search, the Search Results link that tendency is clicked just has certain commerciality so; On the contrary, if user search a keyword, the Search Results link that the user is inclined to click all has certain commerciality, so counter this keyword that pushes away itself also has certain commerciality.
Based on above-mentioned thought, hereinafter take commerciality as example, specifically introducing the present invention excavates with the keyword of commercial demand and the set that links with commercial demand by iteration, set up the business degree attribute quantification model of keyword, finally keyword is carried out the method for business measurement by business measurement model and user click data under Unified frame.The quantization method of other attribute of keyword can be with reference to the method for business measurement, and its method is similar, has therefore just repeated no more the quantization method of other attribute of keyword herein.
Fig. 1 is keyword business degree attribute quantification method flow diagram of the present invention, as shown in Figure 1, comprises following flow process:
Step 101 is obtained the quantification keyword seed;
Quantizing keyword seed refers to, set for the candidate keywords of carrying out certain attribute quantification, take business degree attribute as example, concrete obtain manner can obtain from the keyword character string of the title (title) of electric business's webpage or the vertical channel of commercial articles searching, perhaps have gyp webpage from other and obtain in character string with commercial characteristics, perhaps artificial setting all can.
For the quantification keyword seed that gets by different modes, as the title of above-mentioned electric business's webpage or the keyword of the vertical channel of commercial articles searching, can also therefrom extract the final keyword seed that quantizes of N bar keyword conduct that wherein frequency of occurrence is the highest, to dwindle the subsequent quantizatiion calculated amount.
Step 102 is set up the keyword quantitative model;
Determine respectively the quantification marking algorithm of described keyword and corresponding Search Results link thereof, with the mark of the described Search Results link influence factor as the quantification marking result of described keyword; With the mark of the described keyword influence factor as the quantification marking result of described Search Results link, set up the iterative quantization model of described keyword;
Take the business degree as example, can utilize business degree marking algorithm to build business degree keyword quantitative model;
In the present invention, according to above-mentioned description for core concept of the present invention, business degree marking algorithm can be expressed as follows:
T ( x ) = Σ i ( x → y i ) (formula 1)
All user click frequency sums that T (x) expression is relevant with x, wherein, x can be keyword or Search Results link corresponding to keyword;
When x is keyword, x → y iDuring expression user search keyword (x), in the Search Results link that obtains, the number of clicks of certain link that the user clicked, during T this moment (x) expression user search keyword (x), the number of clicks sum of all Search Results links of clicking;
When x is the Search Results link, x → y iThe number of clicks that the same Search Results that the expression user arrives by different keyword searches links; T this moment (x) expression, the number of clicks sum that the same Search Results that the user arrives by all different keyword searches links.
By formula 1, we can obtain following quantitative formula:
score ( q ) = ( 1 - γ ) score 0 ( q ) + γ Σ i click ( u i → q ) score ( u i ) T ( q ) (formula 2)
score ( u ) = ( 1 - γ ) score 0 ( u ) + γ Σ i click ( q i → u ) score ( q i ) T ( u ) (formula 3)
Wherein formula 2 expressions are given a mark to keyword, and formula 3 expressions link Search Results gives a mark;
Q represents that query is keyword, and u represents that url is the Search Results link;
The quantification score of score (q) expression keyword, the quantification score of score (u) expression Search Results link; Click (u i→ q) represent that certain Search Results links the number of clicks related with corresponding keyword, click (q i→ u) represent that certain keyword links related number of clicks with corresponding Search Results; γ represents to transfer weight coefficient, score 0Represent initial business degree score, generally make a call to an identical mark, as 1 minute, this mark was a mark relatively, embodiment be that relativity and absolute value are irrelevant.
I is used for distinguishing different Search Results link or keyword; T (u), T (q) are as formula 1.
What above-mentioned formula 2,3 integral body embodied is two parts, and front portion is initial business degree score, and rear portion represents to click by the user score of behavioral data iteration, both by transferring weight coefficient γ to carry out combination adjustment, obtains final business degree score.
Carry out iteration based on above-mentioned formula 2 and formula 3 and excavate calculating, can construct quantitative model.The principle of concrete iterative computation as shown in Figure 2, by the mark transmission between keyword and corresponding Search Results link, along with the increase of iteration, mark can change simultaneously, finally reaches iteration stable, each mark embodies the relative size of business degree.
certainly, above-mentioned formula 1, 2, 3 are only one gives an example, its embodiment be that corresponding Search Results link quantizes marking to keyword according to the click data between keyword and corresponding Search Results link, transmit by the iteration mark again, finally obtain the process of the quantized data of keyword, iteration mark transmission between its Search Results that focuses on keyword and correspondence links, it is formula 2, 3 give a mark for keyword and corresponding Search Results link respectively, but the mark that in formula 2, Search Results is linked is as the influence factor of the quantification marking result of self, in formula 3 with the mark of the keyword influence factor as the quantification marking result of self, keyword and corresponding Search Results link thereof are taken into consideration, rather than isolated treating, thereby set up quantitative model more accurately.
Step 103, the Search Results link clicks data corresponding according to keyword and quantitative model carry out iterative quantization calculating to keyword, obtain the quantized data of keyword.
Namely corresponding Search Results link quantizes marking to described keyword according to the click data between described keyword and corresponding Search Results link and described quantitative model; By iterative computation, obtain the quantized data of the described attribute of keyword again.
Can obtain Search Results link clicks data corresponding to keyword from user's keyword-Search Results link clicks data, when being above-mentioned user search keyword, in the Search Results link that obtains, the number of clicks of certain link that the user clicked, the number of clicks sum that the same Search Results that the user arrives by all different keyword searches links, during the user search keyword, number of clicks sum of all Search Results links of clicking etc.; According to user click data, in the formula 3 of substitution quantitative model, can obtain business degree score corresponding to Search Results link, then can obtain the business degree score of this keyword according to formula 2.
Obtain the business degree score of keyword by step 103 after, can utilize this score to carry out follow-up multiple quantification and use, comprise the commerciality sequence, commercial webpage identification, commercial webpage recommending, advertisement support, cheating website identification etc.
For other attribute of keyword, as long as according to above-mentioned identical step, choose the quantification keyword seed that contains this attribute, in the subsequent quantizatiion model, transfer weight coefficient γ, and initial business score score 0Be decided according to the actual requirements, can obtain required quantitative model, and then calculate the quantification score of this attribute of keyword, no longer describe in detail here.
In addition, the present invention also provides a kind of keyword attribute quantization device based on user click data, and as shown in Figure 3, this device comprises:
Choose module 301, be used for choosing the quantification keyword seed of the attribute that contains the needs quantification;
Quantitative model is set up module 302, is used for determining respectively the quantification marking algorithm of described keyword and corresponding Search Results link thereof, with the mark of the described Search Results link influence factor as the quantification marking result of described keyword; With the mark of the described keyword influence factor as the quantification marking result of described Search Results link, set up the iterative quantization model of described keyword;
Quantize computing module 303, corresponding Search Results link quantizes marking to described keyword for the click data between linking for the Search Results according to described keyword and correspondence and described quantitative model; By iterative computation, obtain the quantized data of the described attribute of keyword again.
Wherein, describedly choose module 301 as shown in Figure 4, comprising:
Placement unit 401 is used for grasping electric business's site title channel keyword vertical with commodity as candidate keywords;
Choose unit 402, be used for choosing the highest N bar keyword of frequency of occurrence as the quantification keyword seed of business degree attribute from described candidate keywords; Wherein N is positive integer.
The quantitative model that described quantitative model is set up module 302 foundation comprises:
T ( x ) = Σ i ( x → y i )
All user click frequency sums that T (x) expression is relevant with x, wherein, x can be keyword or corresponding Search Results link;
When x is keyword, x → y iDuring expression user search keyword (x), in the Search Results link that obtains, the number of clicks of certain link that the user clicked, during T this moment (x) expression user search keyword (x), the number of clicks sum of all Search Results links of clicking;
When x is the Search Results link, x → y iThe number of clicks that the same Search Results that the expression user arrives by different keyword searches links; T this moment (x) expression, the number of clicks sum that the same Search Results that the user arrives by all different keyword searches links;
score ( q ) = ( 1 - γ ) score 0 ( q ) + γ Σ i click ( u i → q ) score ( u i ) T ( q ) , Expression quantizes marking to keyword;
score ( u ) = ( 1 - γ ) score 0 ( u ) + γ Σ i click ( q i → u ) score ( q i ) T ( u ) , Link quantizes marking to Search Results in expression;
Q represents that query is keyword, and u represents that url is the Search Results link;
The quantification score of score (q) expression keyword, the quantification score of score (u) expression Search Results link; Click (u i→ q) represent that certain Search Results links the number of clicks related with corresponding keyword, click (q i→ u) represent that certain keyword links related number of clicks with corresponding Search Results;
γ represents to transfer weight coefficient, score 0Represent initial business degree score.
As shown in Figure 5, described quantification computing module 303 comprises:
Data capture unit 501, when being used for obtaining the user search keyword, in the Search Results link that obtains, the number of clicks of certain link that the user clicked, the number of clicks sum that the same Search Results that the user arrives by all different keyword searches links, during the user search keyword, the number of clicks sum of all Search Results links of clicking;
Computing unit 502, corresponding link quantizes marking to described keyword to be used for using described quantitative model to set up the quantitative model that module 302 sets up; By iterative computation, obtain the quantized data of the described attribute of keyword again.
By the above embodiments as seen, this keyword attribute quantization method and device based on user click data of the present invention, be applied as example with commerciality marking, the marking accuracy rate of the commercial keyword that obtains and discrimination are all very high, and has comparability, relevance of searches be can greatly improve, the removal of rubbish website cheating page and the effects such as the power of carrying of the business type page greatly improved.
The above is only preferred embodiment of the present invention, and is in order to limit the present invention, within the spirit and principles in the present invention not all, any modification of making, is equal to replacement, improvement etc., within all should being included in the scope of protection of the invention.

Claims (9)

1. the keyword attribute quantization method based on user click data, is characterized in that, the method comprises:
Obtain the set be used to the candidate keywords of carrying out attribute quantification;
Determine respectively the quantification marking algorithm of described keyword and corresponding Search Results link thereof, with the mark of the described Search Results link influence factor as the quantification marking result of described keyword; With the mark of the described keyword influence factor as the quantification marking result of described Search Results link, set up the iterative quantization model of described keyword;
Corresponding Search Results link quantizes marking to described keyword according to the click data between described keyword and corresponding Search Results link and described quantitative model; By iterative computation, obtain the quantized data of the described attribute of keyword again.
2. the method for claim 1, is characterized in that, described attribute is the business degree.
3. method as claimed in claim 2, is characterized in that, described set of obtaining be used to the candidate keywords of carrying out attribute quantification comprises:
Grasp electric business's site title channel keyword vertical with commodity as candidate keywords;
Choose the highest N bar keyword of frequency of occurrence as the candidate keywords of business degree attribute quantification from described candidate keywords;
Wherein N is positive integer.
4. method as claimed in claim 2, is characterized in that, described iterative quantization model comprises:
T ( x ) = Σ i ( x → y i )
All user click frequency sums that T (x) expression is relevant with x, wherein, x can be keyword or corresponding Search Results link;
When x is keyword, x → y iDuring expression user search keyword (x), in the Search Results link that obtains, the number of clicks of certain link that the user clicked, during T this moment (x) expression user search keyword (x), the number of clicks sum of all Search Results links of clicking;
When x is the Search Results link, x → y iThe number of clicks that the same Search Results that the expression user arrives by different keyword searches links; T this moment (x) expression, the number of clicks sum that the same Search Results that the user arrives by all different keyword searches links;
score ( q ) = ( 1 - γ ) score 0 ( q ) + γ Σ i click ( u i → q ) score ( u i ) T ( q ) , Expression quantizes marking to keyword;
score ( u ) = ( 1 - γ ) score 0 ( u ) + γ Σ i click ( q i → u ) score ( q i ) T ( u ) , Link quantizes marking to Search Results in expression;
The quantification score of score (q) expression keyword, the quantification score of score (u) expression Search Results link; Click (u i→ q) represent that certain Search Results links the number of clicks related with corresponding keyword, click (q i→ u) represent that certain keyword links related number of clicks with corresponding Search Results; γ represents to transfer weight coefficient, score 0Represent initial business degree score.
5. method as claimed in claim 2, is characterized in that, the click data between the link of described keyword and correspondence comprises:
During the user search keyword, in the Search Results link that obtains, the number of clicks of certain link that the user clicked, the number of clicks sum that the same Search Results that the user arrives by all different keyword searches links, during the user search keyword, the number of clicks sum of all Search Results links of clicking.
6. the keyword attribute quantization device based on user click data, is characterized in that, this device comprises:
Choose module, be used for obtaining the set be used to the candidate keywords of carrying out attribute quantification;
Quantitative model is set up module, is used for determining respectively the quantification marking algorithm of described keyword and corresponding Search Results link thereof, with the mark of the described Search Results link influence factor as the quantification marking result of described keyword; With the mark of the described keyword influence factor as the quantification marking result of described Search Results link, set up the iterative quantization model of described keyword;
Quantize computing module, corresponding Search Results link quantizes marking to described keyword for the click data between linking for the Search Results according to described keyword and correspondence and described quantitative model; By iterative computation, obtain the quantized data of the described attribute of keyword again.
7. device as claimed in claim 6, is characterized in that, the described module of choosing comprises:
Placement unit is used for grasping electric business's site title channel keyword vertical with commodity as candidate keywords;
Choose the unit, be used for choosing the highest N bar keyword of frequency of occurrence as the quantification keyword seed of business degree attribute from described candidate keywords; Wherein N is positive integer.
8. device as claimed in claim 6, is characterized in that, the quantitative model that described quantitative model is set up module foundation comprises:
T ( x ) = Σ i ( x → y i )
All user click frequency sums that T (x) expression is relevant with x, wherein, x can be keyword or corresponding Search Results link;
When x is keyword, x → y iDuring expression user search keyword (x), in the Search Results link that obtains, the number of clicks of certain link that the user clicked, during T this moment (x) expression user search keyword (x), the number of clicks sum of all Search Results links of clicking;
When x is the Search Results link, x → y iThe number of clicks that the same Search Results that the expression user arrives by different keyword searches links; T this moment (x) expression, the number of clicks sum that the same Search Results that the user arrives by all different keyword searches links;
score ( q ) = ( 1 - γ ) score 0 ( q ) + γ Σ i click ( u i → q ) score ( u i ) T ( q ) , Expression quantizes marking to keyword;
score ( u ) = ( 1 - γ ) score 0 ( u ) + γ Σ i click ( q i → u ) score ( q i ) T ( u ) , Link quantizes marking to Search Results in expression;
The quantification score of score (q) expression keyword, the quantification score of score (u) expression Search Results link; Click (u i→ q) represent that certain Search Results links the number of clicks related with corresponding keyword, click (q i→ u) represent that certain keyword links related number of clicks with corresponding Search Results; γ represents to transfer weight coefficient, score 0Represent initial business degree score.
9. device as claimed in claim 8, is characterized in that, described quantification computing module comprises:
Data capture unit, when being used for obtaining the user search keyword, in the Search Results link that obtains, the number of clicks of certain link that the user clicked, the number of clicks sum that the same Search Results that the user arrives by all different keyword searches links, during the user search keyword, the number of clicks sum of all Search Results links of clicking;
Computing unit, corresponding link quantizes marking to described keyword to be used for using described quantitative model to set up the quantitative model that module sets up; By iterative computation, obtain the quantized data of the described attribute of keyword again.
CN 201210133409 2012-05-02 2012-05-02 Method and device for keyword attribute quantification based on user click data Pending CN103383685A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201210133409 CN103383685A (en) 2012-05-02 2012-05-02 Method and device for keyword attribute quantification based on user click data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201210133409 CN103383685A (en) 2012-05-02 2012-05-02 Method and device for keyword attribute quantification based on user click data

Publications (1)

Publication Number Publication Date
CN103383685A true CN103383685A (en) 2013-11-06

Family

ID=49491475

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201210133409 Pending CN103383685A (en) 2012-05-02 2012-05-02 Method and device for keyword attribute quantification based on user click data

Country Status (1)

Country Link
CN (1) CN103383685A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104750752A (en) * 2013-12-31 2015-07-01 ***通信集团公司 Determination method and device of user community with internet-surfing preference
CN104994424A (en) * 2015-06-30 2015-10-21 北京奇艺世纪科技有限公司 Method and device for constructing audio/video standard data set
CN105635117A (en) * 2015-12-22 2016-06-01 北京奇虎科技有限公司 Method and device for identifying user clicking link
CN107038609A (en) * 2017-04-24 2017-08-11 广州华企联信息科技有限公司 A kind of Method of Commodity Recommendation and system based on deep learning
WO2018032789A1 (en) * 2016-08-16 2018-02-22 武汉斗鱼网络科技有限公司 Method and system for calculating search precision of internet keywords
CN115169337A (en) * 2022-08-24 2022-10-11 中教畅享(北京)科技有限公司 Method for calculating keyword score in commodity title optimization

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104750752A (en) * 2013-12-31 2015-07-01 ***通信集团公司 Determination method and device of user community with internet-surfing preference
CN104750752B (en) * 2013-12-31 2018-06-15 ***通信集团公司 A kind of determining method and apparatus for the preferences user group that surfs the Internet
CN104994424A (en) * 2015-06-30 2015-10-21 北京奇艺世纪科技有限公司 Method and device for constructing audio/video standard data set
CN104994424B (en) * 2015-06-30 2018-03-20 北京奇艺世纪科技有限公司 A kind of method and apparatus for building audio and video standard data set
CN105635117A (en) * 2015-12-22 2016-06-01 北京奇虎科技有限公司 Method and device for identifying user clicking link
WO2018032789A1 (en) * 2016-08-16 2018-02-22 武汉斗鱼网络科技有限公司 Method and system for calculating search precision of internet keywords
CN107038609A (en) * 2017-04-24 2017-08-11 广州华企联信息科技有限公司 A kind of Method of Commodity Recommendation and system based on deep learning
CN115169337A (en) * 2022-08-24 2022-10-11 中教畅享(北京)科技有限公司 Method for calculating keyword score in commodity title optimization

Similar Documents

Publication Publication Date Title
CN104615767B (en) Training method, search processing method and the device of searching order model
CN101320375B (en) Digital book search method based on user click action
CN103955842B (en) A kind of online advertisement commending system and method towards mass media data
CN103383685A (en) Method and device for keyword attribute quantification based on user click data
CN105095187A (en) Search intention identification method and device
CN105138558B (en) The real time individual information collecting method of content is accessed based on user
CN103838754B (en) Information retrieval device and method
CN102651011B (en) Method and system for determining document characteristic and user characteristic
CN103186574A (en) Method and device for generating searching result
CN105488024A (en) Webpage topic sentence extraction method and apparatus
US20200026759A1 (en) Artificial intelligence engine for generating semantic directions for websites for automated entity targeting to mapped identities
CN105045901A (en) Search keyword push method and device
CN102754094A (en) Ranking user generated web content
CN103064945A (en) Situation searching method based on body
CN104361115B (en) It is a kind of based on the entry Weight Determination clicked jointly and device
CN103077190A (en) Hot event ranking method based on order learning technology
CN108763321A (en) A kind of related entities recommendation method based on extensive related entities network
CN102402566A (en) Web user behavior analysis method based on Chinese webpage automatic classification technology
CN102737021B (en) Search engine and realization method thereof
CN105159930A (en) Search keyword pushing method and apparatus
CN105975596A (en) Query expansion method and system of search engine
CN103150663A (en) Method and device for placing network placement data
CN101127046A (en) Method and system for sequencing to blog article
CN106991160A (en) A kind of microblogging propagation prediction method based on user force and content
CN107133282B (en) Improved evaluation object identification method based on bidirectional propagation

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
ASS Succession or assignment of patent right

Owner name: SHENZHEN SHIJI LIGHT SPEED INFORMATION TECHNOLOGY

Free format text: FORMER OWNER: TENGXUN SCI-TECH (SHENZHEN) CO., LTD.

Effective date: 20131022

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 518044 SHENZHEN, GUANGDONG PROVINCE TO: 518057 SHENZHEN, GUANGDONG PROVINCE

TA01 Transfer of patent application right

Effective date of registration: 20131022

Address after: 518057 Tencent Building, 16, Nanshan District hi tech park, Guangdong, Shenzhen

Applicant after: Shenzhen Shiji Guangsu Information Technology Co., Ltd.

Address before: Shenzhen Futian District City, Guangdong province 518044 Zhenxing Road, SEG Science Park 2 East Room 403

Applicant before: Tencent Technology (Shenzhen) Co., Ltd.

C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20131106