CN102999576B - For the method and apparatus determining the page-describing information corresponding to target pages - Google Patents

For the method and apparatus determining the page-describing information corresponding to target pages Download PDF

Info

Publication number
CN102999576B
CN102999576B CN201210452843.6A CN201210452843A CN102999576B CN 102999576 B CN102999576 B CN 102999576B CN 201210452843 A CN201210452843 A CN 201210452843A CN 102999576 B CN102999576 B CN 102999576B
Authority
CN
China
Prior art keywords
information
page
target pages
user
determines
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210452843.6A
Other languages
Chinese (zh)
Other versions
CN102999576A (en
Inventor
唐振江
董冰峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201210452843.6A priority Critical patent/CN102999576B/en
Publication of CN102999576A publication Critical patent/CN102999576A/en
Application granted granted Critical
Publication of CN102999576B publication Critical patent/CN102999576B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The method and apparatus that it is an object of the invention to provide a kind of page-describing information for determining corresponding to target pages.Specifically, it is determined that the classification relevant information corresponding to pending target pages;According to described classification relevant information, the candidate corresponding to described target pages is described information and adjusts accordingly process, to obtain the page-describing information corresponding to described target pages.Compared with prior art, the present invention is by the classification relevant information according to the target pages determined, candidate corresponding to described target pages is described information and adjusts accordingly process, to obtain the page-describing information corresponding to described target pages, so that the page-describing information of target pages is more accurate, not only increasing user and obtain information efficiency, the brose and reading also improving user is experienced and the resources conservation of subscriber equipment.

Description

For the method and apparatus determining the page-describing information corresponding to target pages
Technical field
The present invention relates to Internet technical field, particularly relate to a kind of for determining target pages institute The technology of corresponding page-describing information.
Background technology
Currently, along with the development of Internet technology and internet, applications to user learning, work with The infiltration of life, people more and more by network acquisition information, such as browsing pages or Search for the page results about particular topic.Correspondingly, if target pages can be accurately determined Page-describing information, will can be greatly improved user and obtain the efficiency of information, for example, search for use Family provides more suitably page results, or pushes away for the most relevant other for page browsing user Information.But, prior art the most only determines by first page participle being added up word frequency again The description information of this page, often there is bigger error, example in the page-describing information being achieved in that As pay close attention to " composition " user browsing composition write the page, if this page include one about The model essay of " rice tamale ", prior art then can obtain the description information that " rice tamale " is this page, Rather than " composition ".Especially, along with current search engine optimization or Optimization Technology for Website Spreading unchecked, the page-describing information utilizing the prior art to obtain is more and more unreliable, has a strong impact on People obtain efficiency and the experience of information.
Summary of the invention
It is an object of the invention to provide a kind of for determining the page-describing letter corresponding to target pages The method and apparatus of breath.
According to an aspect of the invention, it is provided it is a kind of for determining corresponding to target pages The method of page-describing information, wherein, the method comprises the following steps:
A determines the classification relevant information corresponding to pending target pages;
B, according to described classification relevant information, describes letter to the candidate corresponding to described target pages Breath adjusts accordingly process, to obtain the page-describing information corresponding to described target pages.
According to another aspect of the present invention, one is additionally provided right for determining target pages institute The information of the page-describing information answered determines equipment, and wherein, this information determines that equipment includes:
Sorter, for determining the classification relevant information corresponding to pending target pages;
Determine device, for according to described classification relevant information, to corresponding to described target pages Candidate describe information and adjust accordingly process, to obtain the page corresponding to described target pages Face describes information.
According to a further aspect of the invention, a kind of computer equipment, this computer are additionally provided Equipment include as aforementioned according to a further aspect of the present invention for determining corresponding to target pages The information of page-describing information determine equipment.
Compared with prior art, the present invention is by the relevant letter of the classification according to the target pages determined Breath, describes information to the candidate corresponding to described target pages and adjusts accordingly process, to obtain Obtain the page-describing information corresponding to described target pages, so that the page-describing of target pages Information is more accurate, not only increases user and obtains information efficiency, also improves browsing of user and read Read to experience and the resources conservation of subscriber equipment.And, the present invention also can believe according to described page-describing Breath, determines the present information corresponding with described target pages, thus further increasing letter Breath provides efficiency and user to obtain information efficiency.Further, the present invention may further determine that described target The content erotic degree information of the page, according to described page-describing information, and it is quick to combine described content Sensitivity information, determines the present information corresponding with described target pages, thus carries further High information provides efficiency and user to obtain information efficiency, and then the most correspondingly improves the clear of user Look at reading experience.Additionally, the present invention also can be according to the page-describing of the page corresponding to Search Results Information and the matching degree information of search sequence, carry out subsequent treatment, further to Search Results Shorten time of user's Webpage search, decrease user's flowing of access, improve user and obtain Win the confidence the efficiency of breath, and improve the search viewing experience of user.
Accompanying drawing explanation
The detailed description that non-limiting example is made made with reference to the following drawings by reading, The other features, objects and advantages of the present invention will become more apparent upon:
Fig. 1 illustrate according to one aspect of the invention for determining the page corresponding to target pages The equipment schematic diagram of description information;
Fig. 2 illustrate in accordance with a preferred embodiment of the present invention for determining corresponding to target pages The equipment schematic diagram of page-describing information;
Fig. 3 illustrate according to a further aspect of the present invention for determining the page corresponding to target pages Face describes the method flow diagram of information;
Fig. 4 illustrate in accordance with a preferred embodiment of the present invention for determining corresponding to target pages The method flow diagram of page-describing information.
In accompanying drawing, same or analogous reference represents same or analogous parts.
Detailed description of the invention
Below in conjunction with the accompanying drawings the present invention is described in further detail.
Fig. 1 illustrate according to one aspect of the invention for determining that the page corresponding to target pages is retouched The information stating information determines equipment 1, and wherein, information determines that equipment 1 includes that sorter 11 is with true Determine device 12.Specifically, sorter 11 determines the classification corresponding to pending target pages Relevant information;Determine device 12 according to described classification relevant information, to described target pages, institute is right The candidate answered describes information and adjusts accordingly process, to obtain the page corresponding to described target pages Face describes information.Here, information determines that equipment 1 includes but not limited to the network equipment, Yong Hushe Standby or the network equipment passes through the mutually integrated equipment constituted of network with subscriber equipment.Here, it is described The network equipment includes but not limited to such as network host, single network server, multiple network service Device collection or set of computers based on cloud computing etc. realize;Or realized by subscriber equipment.Here, Cloud is made up of a large amount of main frames based on cloud computing (Cloud Computing) or the webserver, Wherein, cloud computing is the one of Distributed Calculation, is made up of a group loosely-coupled computer collection A super virtual machine.Here, described subscriber equipment can be any one can with Family carries out man-machine friendship by modes such as keyboard, mouse, touch pad, touch screen or handwriting equipments Mutual electronic product, such as computer, mobile phone, PDA, palm PC PPC or panel computer Deng.Described network includes but not limited to the Internet, wide area network, Metropolitan Area Network (MAN), LAN, VPN Network, wireless self-organization network (Ad Hoc network) etc..Those skilled in the art will be understood that Above-mentioned information determines that equipment 1 is only for example, other network equipments that are existing or that be likely to occur from now on Or subscriber equipment is such as applicable to the present invention, within also should being included in scope, and This is incorporated herein with way of reference.Here, the network equipment and subscriber equipment all include that one can According to the instruction being previously set or storing, the electronics automatically carrying out numerical computations and information processing sets Standby, its hardware includes but not limited to microprocessor, special IC (ASIC), programmable gate Array (FPGA), digital processing unit (DSP), embedded device etc..
Specifically, sorter 11 first passes through the such as third party such as browser, search engine and sets The standby application programming interfaces (API) provided, obtain pending target pages;Or, pass through The dynamic web page techniques such as ASP, JSP, obtain the search sequence that user is inputted by subscriber equipment, This search sequence is submitted to search engine again, and receive that search engine fed back with this inquiry The Search Results that sequence is corresponding, using as pending target pages;Or, by http, Htths etc. arrange communication mode, obtain pending target pages;Then, sorter 11 is true Fixed classification relevant information corresponding to described target pages.Here, described classification relevant information includes But it is not limited to following at least any one: 1) virtual theme, here, described virtual theme intention institute State the access meaning of the user accessing this target pages that the page body matter of target pages can reflect Purport, for example, it is assumed that target pages such as " rowing regatta composition model essay " The body matter of (http://www.qc99.com/xiaoxue/sinj/101176.Html) is one and rows the boat Match composition model essay, and the user browsing this page wish study composition writing in terms of information, then Classification relevant information corresponding to this target pages is virtual theme such as composition;For another example, it is assumed that target The master of the page such as " download of flower material " (http://sucai.redocn.com/category/260/) Internal appearance is the picture of flower, and the user browsing this page wish to obtain the material about flower with Create for Arts, then the classification relevant information corresponding to this target pages is virtual theme such as skill Art class material;2) object is accurately mated, here, target described in described accurate coupling object intention The page contains content information on all four with user's request, and described user's request has not Substitutability, for example, it is assumed that target pages such as " oral cavity, Beijing expert-good doctor is online " (http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm?Province=be Ijing) relevant information such as the hospital about disease " oral ulcer " and attending doctor is contained, And the user browsing this page wishes that inquiry obtains about treatment disease such as " oral ulcer " not The page of the relevant information of other diseases such as " rhinitis ", then dividing corresponding to this target pages Class relevant information is for accurately mating object;For another example, it is assumed that target pages such as " IBM minicomputer IBM POWER720”(http://www.xinhuigroup.com/Product/10026/11479.html) Contain the letter such as the Products about IBM minicomputer IBM POWER720, specifications parameter Breath, and the user browsing this page wishes that inquiry obtains about IBM minicomputer IBM POWER720 rather than the page of other type products such as " IBM POWER 550 " relevant information Face, then the classification relevant information corresponding to this target pages is for accurately mating object;3) broad sense Match as, here, the content information of target pages described in described broad match object intention and user Demand has dependency, for example, it is assumed that target pages is as " iphone5 pink colour and the back side have The outer casing protective sleeve of heart pattern " (http://www.vipshop.com/show-0-48369-0.html?), and browse this page User be also possible to other accessories to iphone5 equipment as " Fructus Mali pumilae data wire " and with " iohone5 " Belong to other brands of like product such as intelligent machine such as " nokia " intelligent machine etc. interested, then should Classification relevant information corresponding to target pages is broad match object;4) mismatch object, here, The content information of target pages described in described mismatch object intention is not suitable for comprising at user's acquisition Presenting information outside the content information of this target pages, such as, user browses news report as " specially Family title Obama the most also opposes also friend and in-depth returns to Asia-Pacific strategy " Time (http://news.sina.com.cn/w/sd/2012-11-08/021925532469.shtml), except closing Note outside the content report of this news, the other guide information in this page will not be paid close attention to again, then should Classification relevant information corresponding to the page is mismatch object such as news report.Those skilled in the art should It is understood that above-mentioned classification relevant information is only for example, other classification that are existing or that be likely to occur from now on Relevant information is such as applicable to the present invention, within also should being included in scope, and at this It is incorporated herein with way of reference.
Such as, user inputs network address http://news.sina.com.cn/ in browser address bar, By "enter" key", the application that sorter 11 is provided by third party devices such as such as news websites Routine interface (API) gets the net corresponding with this network address http://news.sina.com.cn/ Page.For another example, user inputs key word " iphone by its subscriber equipment such as PC in search column Accessory ", click on search button, then sorter 11 is by dynamic web pages such as JSP or ASP Technology, gets the search sequence of this user input from this subscriber equipment, and based on this inquiry sequence Arrange and submit searching request, the application programming interfaces provided by search engine to search engine (API) obtain search engine obtain according to key word " iphone accessory " matching inquiry with One or more Search Results that key word " iphone accessory " matches, as " iphone joins Part [market price evaluation certified products crudely-made articles] ", " iphone accessory Apple Store (in State) " etc., as pending target pages.
Those skilled in the art only will be understood that the mode of the pending target pages of above-mentioned acquisition For citing, other modes obtaining pending target pages that are existing or that be likely to occur from now on As being applicable to the present invention, within also should being included in scope, and at this to quote Mode is incorporated herein.
Then, sorter 11 determines the classification relevant information corresponding to pending target pages, Here, sorter 11 determines the mode bag of the classification relevant information corresponding to described target pages Include but be not limited to following at least any one:
1) according to the page subject matter content of described target pages, determine corresponding to described target pages Classification relevant information.Specifically, sorter 11 first passes through such as page html tag Analysis method, extracts the page body matter of described target pages, or, according to VIPS (Vision-based Page Segmentation, the page segmentation of view-based access control model) algorithm, utilizes Webpage foreground color, background color, font color and size, frame, logical block and logical block it Between spacing, the visual signature such as element position, described target pages is carried out piecemeal process, to obtain Obtain the body matter piecemeal of described target pages;Then, sorter 11 is according to described page object The page body matter in face, determines the classification relevant information corresponding to described target pages.Such as, Assume that the described target pages that first sorter 11 gets is that news report is as " expert claims Austria Bar horse the most also opposes also friend and in-depth returns to Asia-Pacific strategy " (http://news.sina.com.cn/w/sd/2012-11-08/021925532469.shtml), then classify Device 11 analyzes method by such as page html tag, extracts the page of this target pages Body matter is the news report of " Obama the most also opposes also friend and in-depth returns to Asia-Pacific strategy ", Then sorter 11 determines that the classification relevant information corresponding to this target pages is mismatch object.Again As, it is assumed that the described target pages that first sorter 11 gets is for about treatment disease such as " mouth Chamber ulcer " page " oral cavity, Beijing expert-good doctor is online " of relevant information (http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm?Province=be Ijing), this target pages contains content information on all four with user's request, then classify Device 11 determines that the classification relevant information corresponding to this target pages is for accurately mating object.
2) according to the page access record information of the user accessing described target pages, determine described Classification relevant information corresponding to target pages.Such as, user user is just at browsing pages such as " iphone accessory only product can snap up at a low price!Digital accessory special show indulgence in limited time " (http://www.vipshop.com/show-0-48369-0.html?), and this user user is also To other accessories such as " Fructus Mali pumilae data wire " of iphone5 equipment and belong to same with " iohone5 " Other brands of series products such as intelligent machine such as " nokia " intelligent machine etc. are interested, then sorter 11 determine that the classification relevant information corresponding to this target pages is broad match object.
Those skilled in the art will be understood that the above-mentioned mode determining described classification relevant information is only Citing, the mode of relevant information of classifying described in other existing or determinations of being likely to occur from now on is such as It is applicable to the present invention, within also should being included in scope, and at this with the side of quoting Formula is incorporated herein.
Determine device 12 according to described classification relevant information, to the time corresponding to described target pages Description information is selected to adjust accordingly process, to obtain the page-describing corresponding to described target pages Information.Include but not limited to such as described target pages body matter here, described candidate describes information The description of the described classification relevant information corresponding to the description of information, described target pages.Specifically, Determine that device 12 first passes through and such as the content of pages of described target pages carried out word frequency statistics, Or, the page candidate that calling the third party website belonging to described target pages is provided describes information Application programming interfaces (API), it is thus achieved that the candidate corresponding to described target pages describes information;Then, Determine the described classification relevant information that device 12 determines according to sorter, to described target pages Corresponding candidate describes information and adjusts accordingly process, to obtain corresponding to described target pages Page-describing information.Those skilled in the art will be understood that above-mentioned candidate describes information and is only act Example, other candidates that are existing or that be likely to occur from now on describe information and are such as applicable to the present invention, Also within should being included in scope, and it is incorporated herein with way of reference at this.Here, Described corresponding adjustment processes operation and includes following at least any one:
-when described classification relevant information includes described virtual theme, according to described candidate, letter is described Breath carries out matching inquiry in virtual subject data base, using by corresponding matching inquiry result as Described page-describing information;
-when described classification relevant information includes described accurate coupling object, described candidate is described Information is as described page-describing information;
-when described classification relevant information includes described broad match object, retouch according to described candidate Information of stating carries out matching inquiry in generalized object data base, with described candidate is described information and The matching inquiry result of its correspondence is as described page-describing information;
-when described classification relevant information includes described mismatch object, described candidate is described information Empty, using as described page-describing information.
For example, it is assumed that sorter 11 determines that pending target pages is as " rowing regatta is write a composition Model essay " corresponding to (http://www.qc99.com/xiaoxue/sinj/101176.Html) described point Class relevant information is described virtual theme, and determines that first device 12 calls this target pages Third party website qc99 institute belonging to http://www.qc99.com/xiaoxue/sinj/101176.Html The page candidate provided describes Information application routine interface (API), it is thus achieved that this target pages The described candidate of http://www.qc99.com/xiaoxue/sinj/101176.Html describes information and includes " rowing regatta composition model essay " content etc., it is determined that device 12 describes information according to this candidate and exists Virtual subject data base carries out matching inquiry, obtains matching inquiry result as " in page main body Hold: rowing regatta composition model essay-correspondence classification relevant information: virtual theme (composition) ", so After this matching inquiry result as described page-describing information, here, described virtual subject data Stock contains multiple virtual theme, and it can be located at information and determines in equipment 1, may be additionally located at and information Determine that equipment 1 passes through in the server that network is connected;For another example, it is assumed that sorter 11 determines to be treated The target pages processed such as the page " north about treatment disease such as " oral ulcer " relevant information Oral cavity, capital expert-good doctor is online " (http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm?Province=be Ijing) described classification relevant information is for accurately mating object, and determines that device 12 is the most right The content of pages of this target pages carries out word frequency statistics, it is thus achieved that this target pages http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm?Province=beiji The described candidate of ng describes information and includes that " disease " oral ulcer " treatment-corresponding classification is relevant Information: accurately mate object " etc., it is determined that this candidate is described information as described by device 12 Page-describing information;And for example, it is assumed that sorter 11 determines pending target pages such as " iphone Accessory only product can snap up at a low price!Digital accessory special show indulgence in limited time " (http://www.vipshop.com/show-0-48369-0.html?) the relevant letter of described classification Breath is broad match object, and determines that device 12 is first to this target pages http://www.vipshop.com/show-0-48369-0.html?Content of pages carry out word frequency system Meter, it is thus achieved that this target pages http://www.vipshop.com/show-0-48369-0.html?Institute State candidate to describe information and include " digital accessory special show " etc., it is determined that device 12 is according to this candidate Description information carries out matching inquiry in generalized object data base, it is thus achieved that matching inquiry result is such as " iphone number accessory (protection housing accessory, charger etc.)-nokia number accessory-... " Deng, the matching inquiry result that this candidate describes information and correspondence thereof is believed as described page-describing Breath, here, described generalized object data base includes the classification set of generalized object, each broad sense Object can classification again, it can be located at information and determines in equipment 1, may be additionally located at true with information Locking equipment 1 passes through in the server that network is connected;The most such as, it is assumed that sorter 11 determines to be waited to locate The target pages of reason is news report as " expert claims Obama the most also to oppose also friend in-depth to be returned to Asia-Pacific strategy " (http://news.sina.com.cn/w/sd/2012-11-08/021925532469.shtml) Described classification relevant information be mismatch object, and determine that first device 12 calls this target The page candidate that third party website sina belonging to the page is provided describes Information application routine interface (API), it is thus achieved that the described candidate of this target pages describes information and includes " news report-correspondence Classification relevant information: mismatch object ", it is determined that this candidate is described information and empties by device 12, Being empty as described page-describing information, i.e. page-describing information corresponding to this target pages Lack.
Those skilled in the art will be understood that above-mentioned to the candidate's description corresponding to described target pages Information adjusts accordingly the mode of process and is only for example, and other are existing or are likely to occur from now on The candidate corresponding to described target pages described information adjust accordingly the mode of process such as It is applicable to the present invention, within also should being included in scope, and at this with the side of quoting Formula is incorporated herein.
Those skilled in the art will be understood that the page corresponding to the described target pages of above-mentioned acquisition is retouched The mode stating information is only for example, other described targets of acquisition that are existing or that be likely to occur from now on The mode of the page-describing information corresponding to the page is such as applicable to the present invention, also should be included in this Within invention protection domain, and it is incorporated herein with way of reference at this.
Information determines between each device of equipment 1 it is constant work.Specifically, classification Device 11 persistently determines the classification relevant information corresponding to pending target pages;Determine device 12 continue according to described classification relevant information, and the candidate corresponding to described target pages is described information Adjust accordingly process, to obtain the page-describing information corresponding to described target pages.Here, " continuing " information of referring to that skilled artisan would appreciate that determines each device of equipment 1 respectively Constantly carry out determination and the acquisition of page-describing information of classification relevant information, until information is true Locking equipment 1 stops the determination of classification relevant information in a long time.
Preferably, information determines that equipment 1 also includes that device (not shown) set up by model, specifically, Model set up device according to through mark classification information multiple training pages, carry out at machine learning Reason, to obtain the page classifications model for page classifications;Wherein, sorter 11 is according to described Page classifications model, page relevant information based on described target pages, determine that described classification is correlated with Information.
Specifically, model set up device according to through mark classification information multiple training pages, carry out Machine learning processes, to obtain the page classifications model for page classifications.For example, it is assumed that through mark Multiple training pages of note classification information are as follows:
I: rowing regatta composition model essay
Http:// www.qc99.com/xiaoxue/sinj/101176.Html, virtual theme
II:sina/ reading/novel shop/world's masterpiece/" the Count of Monte Christo "
Http:// vip.book.sina.com.cn/book/index_81300.html, virtual theme
III: oral cavity, Beijing expert-good doctor is online
http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm?
Province=beijing, accurately mates object
IV:sina sports news http://sports.sina.com.cn/, mismatch object
V:sina financial and economic news http://finance.sina.com.cn/, mismatch object
VI: only product netting index code accessory
http://www.vipshop.com/show-0-48369-0.html?, broad match object
VII: Dangdang.com protects product http://cosmetic.dangdang.com/, broad match object Then model sets up device according to this through marking multiple training pages of classification information, carries out machine learning Process, to as described in training set carry out linear regression analysis or to as described in training set carry out non- The modes such as linear regression analysis, it is thus achieved that for the page classifications model such as decision tree of page classifications, should Each node of decision tree corresponds to each page classifications, and wherein, described page classifications includes multiple The described training page, subject classification as the most virtual in page classifications includes the page I with II, accurately mates Object classification includes that the classification of page III, mismatch object includes that the page IV with V, broad match object divide Class includes the page VI and VII.
Then, sorter 11 is according to described page classifications model, page based on described target pages Face relevant information, determines described classification relevant information.Here, described page relevant information include but It is not limited to such as page body matter classification, page structure feature etc..For example, it is assumed that sorter 11 First the pending target pages obtained is " rowing regatta composition model essay " Http:// www.qc99.com/xiaoxue/sinj/101176.Html, then sorter 11 can be according to mould Type sets up the described page classifications model that device obtains, page relevant information based on this target pages Such as page body matter information, the page body matter classification of this target pages is divided with the described page In class model, the page body matter classification training the page included by each page classifications compares, As assumed, the page body matter classification determining this target pages is composition type, with virtual theme The content of pages classification of the training page included by page classifications is consistent, then sorter 11 determines The described classification relevant information of this target pages is virtual theme.
Preferably, information determines that equipment 1 also includes search process device (not shown), specifically Ground, first search process device obtains the one or more search knots corresponding with search sequence Really;Then, according to page-describing information and the described inquiry of the page corresponding to described Search Results The matching degree information of sequence, carries out subsequent treatment to the one or more Search Results;Then, By in the one or more Search Results of subsequent treatment at least one be supplied to described in look into Ask the application corresponding to sequence.
Specifically, search process device first passes through the dynamic page technology such as ASP, JSP, obtains In search engine search column, the mobile enquiry of input inquiry sequence please by subscriber equipment to take family Ask, the most again this search sequence mentioned to search engine, and receive that search engine fed back with One or more Search Results that this search sequence is corresponding, corresponding with search sequence to obtain One or more Search Results, for example, it is assumed that user user uses its PC at search engine Search column inputs key word " iphone protects housing accessory ", then clicks on search button, Then search process device is by dynamic page technology such as ASP, JSP, just can get user user The search sequence of input, is then based on this search sequence and submits page searching request to search engine, And receive that search engine fed back relative with this search sequence " iphone protects housing accessory " " homepage-rice the more Fructus Mali pumilae number accessory is just for the one or more Search Results answered such as Search Results A Product discount store ", Search Results B " ... 3C Fructus Mali pumilae accessory iphone shell cell-phone cover wholesale zero Sell containment vessel ", Search Results C " unique containment vessel iphone4s accessory recommend hands Machine Technology Times Sina website " etc..
Those skilled in the art will be understood that or many that above-mentioned acquisition is corresponding with search sequence The mode of individual Search Results is only for example, other existing or acquisitions of being likely to occur from now on look into The mode asking the corresponding one or more Search Results of sequence is such as applicable to the present invention, also should Within being included in scope, and it is incorporated herein with way of reference at this.
Then, search process device is believed according to the page-describing of the page corresponding to described Search Results Breath and the matching degree information of described search sequence, after carrying out the one or more Search Results Continuous process.Specifically, first the page corresponding to described Search Results is retouched by search process device The information of stating carries out semantic analysis, according to the word corresponding to described search sequence at described search knot Ratio shared in the total word included by page-describing information corresponding to Guo, determine described in search The matching degree information of page-describing information and described search sequence corresponding to hitch fruit, as when than When example is more than 0.95, determine that described matching degree information is matched, if ratio is between 0.95 And time between 0.7, determine that described matching degree information is moderate coupling, if ratio is less than 0.7, Determine that described matching degree information is for minuent coupling;Then, search process device is further according to this coupling Degree information, carries out subsequent treatment to the one or more Search Results, as to one or Order between multiple Search Results is adjusted, sieves the one or more Search Results Choosing.Such as, connect example, it is assumed that the page-describing information of the page corresponding to Search Results A with look into The matching degree asking sequence " iphone protects housing accessory " is higher than the page corresponding to Search Results B The matching degree of page-describing information and this search sequence " iphone protects housing accessory ", search " iphone protection shell is joined for the page-describing information of the page corresponding to hitch fruit B and search sequence Part " matching degree higher than the page-describing information of the page corresponding to Search Results C and this inquiry sequence The matching degree of row " iphone protects housing accessory ", then search process device is according to described coupling Degree information, determine Search Results A, Search Results B and Search Results C put in order for A, B, C, i.e. user user obtain corresponding with search sequence " iphone protects housing accessory " During Search Results, before Search Results A is positioned at Search Results B, Search Results B is positioned at search Before result C;For another example, search process device also can be according to described matching degree information, to search Result A, B, C screen, such as filtered search result, by Search Results C low for matching degree It is not applied to user.
Those skilled in the art will be understood that above-mentioned after carrying out the one or more Search Results The continuous mode processed is only for example, other existing or be likely to occur from now on to one or Multiple Search Results carry out the mode of subsequent treatment and are such as applicable to the present invention, also should be included in this Within invention protection domain, and it is incorporated herein with way of reference at this.
Then, search process device passes through the dynamic web page techniques such as such as ASP, JSP or PHP, Or the communication mode of other agreements, such as communication protocols such as http or https, will be through follow-up place Reason the one or more Search Results at least one be supplied to described search sequence institute right The application answered, it is right that the described Search Results after processing for application is supplied to described search sequence institute The user answered.Here, described application includes but not limited to such as search engine, browser etc..Such as, Connect example, search process device rear its is carried out Search Results A, B and C after subsequent treatment by It is supplied to user user according to matching degree information order A, B, C, browses for user, or, by page In faceted search result A, B and C, page matching degree information is less than the page Search Results of predetermined threshold It is not applied to user user.
Fig. 2 illustrate in accordance with a preferred embodiment of the present invention for determining corresponding to target pages The equipment schematic diagram of page-describing information, information determine device 1 include sorter 11 ', Determine device 12 ' and coalignment 13 '.Specifically, sorter 11 ' determines pending target Classification relevant information corresponding to the page;Determine device 12 ' according to described classification relevant information, right Candidate corresponding to described target pages describes information and adjusts accordingly process, to obtain described mesh Mark page-describing information corresponding to the page;Coalignment 13 ' according to described page-describing information, Determine and corresponding with described target pages present information, wherein, described in present information with described Page-describing information match.Here, sorter 11 ' and determine device 12 ' respectively with Fig. 1 Shown corresponding intrument is same or similar, therefore here is omitted, and is contained in by way of reference This.
Specifically, coalignment 13 ', according to described page-describing information, determines and described target What the page was corresponding presents information, wherein, described in present information and described page-describing information phase Coupling.Present described in here, information include but not limited to as to be shown in the page with certain carrier Such as link, text, picture, video, animation etc., it is used for transmitting the content of information to user, its Include but not limited to as described in page-describing information page describe the content information of information with as described in The style sheet information etc. that page-describing information is corresponding.Specifically, coalignment 13 ' is according to institute State page-describing information, by inquiring about in presenting information database corresponding to described description information Present information, determine and corresponding with described page-describing information present information;Or, pass through Presenting of the target pages corresponding to described page-describing information is inquired about in presenting information database Maybe this presents the resource distribution content information of associated user of user to user, determines and the described page What description information was corresponding presents information, wherein, described in present information and described page-describing and believe Manner of breathing mates.Present information database described in here, to can be located at information and determine in equipment 1, also Can be located at and determine with information in the data base that equipment 1 is connected by network.
For example, it is assumed that sorter 11 ' determines that pending target pages is as " iphone accessory is only Product can snap up at a low price!Digital accessory special show indulgence in limited time " (http://www.vipshop.com/show-0-48369-0.html?) the relevant letter of described classification Breath is broad match object, and determines this target pages that device 12 ' determines http://www.vipshop.com/show-0-48369-0.html?Described page-describing information bag Include " iphone number accessory (protection housing accessory, charger etc.)-nokia number accessory -... " etc., then coalignment 13 ' can by this page-describing information, as with this target pages http://www.vipshop.com/show-0-48369-0.html?Corresponding presents information;Again As, connecting example, coalignment 13 ' can be by described page-describing information " iphone number accessory (protection housing accessory, charger etc.)-nokia number accessory-... " content information, and What this page-describing information was corresponding presents other resource distribution content informations of user as " iphone sells Goods information " etc. content present information as described.
Those skilled in the art will be understood that and above-mentioned determine present corresponding with described target pages The mode of information is only for example, other existing or determination of being likely to occur from now on and described targets The mode of information that what the page was corresponding present such as is applicable to the present invention, also should be included in the present invention Within protection domain, and it is incorporated herein with way of reference at this.
Preferably, information determines that equipment 1 also includes sensitivity device (not shown), specifically, Sensitivity device determines the content erotic degree information of described target pages;Wherein, coalignment 13 ' According to described page-describing information, and combine described content erotic degree information, determine and described mesh What the mark page was corresponding presents information, wherein, described in present information and described page-describing information And described content erotic degree information match.
Specifically, sensitivity device passes through such as to resolve the html source code of described target pages, Obtain the content of pages information of described target pages, pre-by inquiry in this content of pages information Determine content erotic degree information, to determine the content erotic degree information of described target pages.Here, Described content erotic information includes but not limited to as being only suitable for content that certain special group browses such as Adult's information etc., as about causing death, disease, injured, damage or unknown losses etc. is unexpected The relevant content information etc. of situation.For example, it is assumed that sorter 11 ' obtain pending described Target pages is " perfume (or spice) how youngster No. 5 faces prohibited selling by European Union " The Xin Wen Bao of (http://news.163.com/12/1109/05/8FRIGU8300014AED.html) Lead, then sensitivity device is by resolving the html source code of this page, in finding the page of this page Appearance information includes word such as " prohibiting selling ", " allergy " etc., i.e. determines that the content of this target pages is quick Sensitivity information is " prohibiting selling ", " allergy ".
Those skilled in the art will be understood that foregoing sensitivity information is only for example, and other are existing Or the content erotic degree information that will be likely to occur from now on be such as applicable to the present invention, also should be included in this Within invention protection domain, and it is incorporated herein with way of reference at this.
Those skilled in the art will be understood that the above-mentioned mode determining described sensitivity information is only and lift Example, the mode of sensitivity information described in other existing or determinations of being likely to occur from now on is the most applicable In the present invention, within also should being included in scope, and it is contained in way of reference at this This.
Then, coalignment 13 ' is according to described page-describing information, and it is quick to combine described content Sensitivity information, determines and corresponding with described target pages presents information, wherein, described in present Information and described page-describing information and described content erotic degree information match.Such as, connect Example, it is assumed that determine that device 12 ' determines target pages " perfume (or spice) how youngster No. 5 faces prohibited selling by European Union " The described page of (http://news.163.com/12/1109/05/8FRIGU8300014AED.html) It is mismatch object that face describes the described classification relevant information that information is vacancy, i.e. this target pages, Then coalignment 13 ' is according to this page-describing information, and combine content erotic degree information " prohibit selling ", " allergy ", determine corresponding with this target pages presents information for being not suitable at this page Offer presents information, or, described in present information be other brand perfume, wherein, described in Existing information and described page-describing information and described content erotic degree information match.For another example, false If determining that device 12 ' determines that target pages is as " iphone accessory only product can snap up at a low price!Digital Accessory special show indulgence in limited time " (http://www.vipshop.com/show-0-48369-0.html?) Described page-describing information be " iphone number accessory (protection housing accessory, charger etc.) -nokia number accessory-... ", and sensitivity determines that device determines the described interior of this target pages Holding sensitivity information is to include being only suitable for content such as adult's information that certain special group browses, then Equipped put 13 ' according to this page-describing information, and combine described content erotic degree information, determine Corresponding with this target pages information that presents includes this page-describing information but statement is forbidden Child browses the information of this page, wherein, described in present information and described page-describing information and institute State content erotic degree information match.
Those skilled in the art will be understood that above-mentioned combination content erotic degree information presents described in determining The mode of information is only for example, and other are existing or be likely to occur from now on and really combine content erotic Degree information presents the mode of information and is such as applicable to the present invention described in determining, also should be included in this Within bright protection domain, and it is incorporated herein with way of reference at this.
In a preferred embodiment (with reference to Fig. 2), information determines that equipment 1 includes sorter 11 ', determine device 12 ', coalignment 13 ', generating means (not shown) and device is provided (not shown), wherein, sorter 11 ' includes acquiring unit 111 ' (not shown) and classification Unit 112 ' (not shown).Below with reference to Fig. 2, the preferred embodiment is described: concrete Ground, acquiring unit 111 ' obtains the accession page that user is accessed, using as described target pages; Taxon 112 ' determines the classification relevant information corresponding to described target pages;Determine device 12 ' According to described classification relevant information, the candidate corresponding to described target pages is described information and carries out phase Process should be adjusted, to obtain the page-describing information corresponding to described target pages;Coalignment 13 ' According to described page-describing information, determine and corresponding with described target pages present information, its In, described in present information and described page-describing information match;Generating means according to described in Existing information, is updated described target pages processing, to generate corresponding results page, its In, described results page include described in present information;Device is provided described results page to be provided To described user.Where it determines that device 12 ' is same or similar with corresponding intrument shown in Fig. 1, Equipped putting 13 ' same or similar with corresponding intrument shown in Fig. 2, therefore here is omitted, and passes through The mode quoted is incorporated herein.
Specifically, acquiring unit 111 ' first obtains the accessing page request of user, is visited by the page Ask that the page corresponding to request is as described target pages;Or, by such as browser, search The application programming interfaces (API) of third party device offers such as holding up are provided, obtain what user was accessed Accession page, using as described target pages.Such as, user user is in browser address bar Input http://news.sina.com.cn/, presses enter key, then acquiring unit 111 ' passes through browser The application programming interfaces (API) provided, just get the accessing page request of user user;Then, Acquiring unit 111 ', according to this page URL, sends respective page access request to page server, The corresponding HTML returned by page server is responded, and obtains relative with this accessing page request The page http://news.sina.com.cn/ answered, using page http://news.sina.com.cn/ as institute State target pages.For another example, it is assumed that user user inputs key word in search engine search column " iphone protects housing accessory ", then clicks on search button, then acquiring unit 111 ' passes through The application programming interfaces (API) that search engine provides, the page access just getting user user please Asking, then acquiring unit 111 ' submits to page search to ask based on this search sequence to search engine Ask, and receive that search engine fed back with this search sequence " iphone protects housing accessory " Corresponding one or more Search Results such as Search Results A " join by homepage-rice the more Fructus Mali pumilae number Part certified products discount store ", Search Results B " ... 3C Fructus Mali pumilae accessory iphone shell cell-phone cover is criticized Send out retail containment vessel ", Search Results C " unique containment vessel iphone4s accessory recommend Mobile phone Technology Times Sina website " etc., then acquiring unit 111 ' will include such Search Results Search results pages is as described target pages.
Taxon 112 ' determines the classification relevant information corresponding to described target pages.Here, Taxon 112 ' determines mode and Fig. 1 of the classification relevant information corresponding to described target pages Middle sorter 11 determines that the mode of the classification relevant information corresponding to described target pages is identical, For simplicity's sake, therefore do not repeat them here, and comprise by reference and this.
Preferably, taxon 112 ' may also be combined with the user's operation information of described user, really Fixed classification relevant information corresponding to described target pages;
Wherein, described user's operation information includes following at least any one:
-described user is about the page access session information of described accession page;
The page access record information of-described user;
Page search record corresponding to-described accession page.
Such as, the described user page about described accession page is included when described user's operation information When session information is asked in interview, here, described page access session information include but not limited to as The connected reference of accession page is operated by one user.Assume user user at Search Results such as " iphone accessory only product can snap up at a low price!Digital accessory special show indulgence in limited time " (http://www.vipshop.com/show-0-48369-0.html?The page corresponding to) clear During looking at, also inquiry obtains other information such as accessory " Fructus Mali pumilae data wire white " of its demand, Then taxon 112 ' determines that the classification relevant information corresponding to this target pages is broad match pair As;For another example, when described user's operation information includes the page access record information of described user, Assume that acquiring unit 111 ' gets accession page such as " rowing regatta composition that user user submits to Model essay " accessing page request of (http://www.qc99.com/xiaoxue/sinj/101176.Html), And user user often accesses as about the page how to write, then taxon 112 ' determines access The page such as (" rowing regatta composition model essay " Http:// www.qc99.com/xiaoxue/sinj/101176.Html) corresponding to classification relevant information For virtual theme such as writing.
Skilled artisans will appreciate that the user's operation information of above-mentioned combination user determines described The mode of classification relevant information is only for example, and other combinations that are existing or that be likely to occur from now on are used The user's operation information at family determines that the mode of described classification relevant information is such as applicable to the present invention, Also within should being included in scope, and it is incorporated herein with way of reference.
Generating means according to described in present information, described target pages is updated process, as To be embedded in described target pages with the described information that presents, to generate corresponding results page, Wherein, described results page include described in present information.For example, it is assumed that coalignment 13 ' is really Fixed with target pages as " iphone accessory only product can snap up at a low price!Digital accessory special show limit Shi Tehui " (http://www.vipshop.com/show-0-48369-0.html?) corresponding in Existing information includes that this page-describing information is as " iphone number accessory (is protected housing accessory, filled Electrical equipment etc.)-nokia number accessory-... ", then generating means can present information according to this, right This target pages is updated processing, will with as described in present information and be embedded in this target pages, As being embedded at the navigation segmented areas of this target pages, wherein, information is presented with described described in Page-describing information match.
Device is provided to pass through the dynamic web page techniques such as such as ASP, JSP or PHP, or other The communication mode of agreement, such as communication protocols such as http or https, provides described results page To described user.
Preferably, information determines that equipment 1 also includes position determining means (not shown), specifically Ground, position determining means determine described in present the target that information is corresponding in described target pages Positional information;Wherein, generating means according to described in present information, and combine described target location Information, is updated described target pages processing, to generate corresponding described results page, Wherein, described results page include in described target position information corresponding position described in present information.
Specifically, position determining means determine described in present information in described target pages institute right The target position information answered.Here, described target position information includes embedding the described information that presents Which position in described target pages, presents information as will be described and is embedded in described target pages The position that middle user preferably browses, or, described information to be presented is embedded in described page object Navigation segmented areas etc. in face.Here, position determining means determines described target position information Mode include but not limited to following at least any one:
1) according to the page layout information of described target pages, target position information is determined, as incited somebody to action On the right side of white space in the target pages such as page subfield as described in target position information, by target The page easily causes the surrounding of search column in the region such as search page that user notes wait as described in Target position information.For example, it is assumed that the pending described target pages that acquiring unit 111 ' obtains For " iphone accessory only product can snap up at a low price!Digital accessory special show indulgence in limited time " (http://www.vipshop.com/show-0-48369-0.html?), and position determining means leads to Cross such as html tag analytic method or according to VIPS (Vision-based Page Segmentation, the page segmentation of view-based access control model) algorithm, this target pages is resolved, To the style sheet information of this target pages, such as page layout information, wherein, this target pages On the right side of the page, subfield is white space, then position determining means can be right by the page in this target pages Subfield region, side is as described target position information.
2) according to the content of pages information of described target pages, by described target pages with described Present location of content region that the content of information matches as described target position information.Such as, Assume that the described target pages that acquiring unit 111 ' obtains is the page http://www.vipshop.com/show-0-48369-0.html?, institute that coalignment 13 ' determines State and present information and include content such as " iphone number accessory (protection housing accessory, charger etc.) -nokia number accessory-... ", position determining means by resolving this target pages, this page object Face comprises multiple channel content such as " luxurious ornaments ", " only product group ", " only product are still " etc., then position Determine the location of content region that device will match in this target pages with this content presenting information As described target position information, will this target pages be made in " only product are still " channel position region Described target position information for described information to be presented.
3) according to the page relevant information of described target pages, and the page combining described user is visited Ask record information, determine described in present the target location that information is corresponding in described target pages Information.For example, it is assumed that the pending described target pages that acquiring unit 111 ' obtains is the page http://www.vipshop.com/show-0-48369-0.html?, institute that coalignment 13 ' determines State and present information and include content such as " iphone number accessory (protection housing accessory, charger etc.) -nokia number accessory-... ", it is assumed that user user often clicks on this target pages http://www.vipshop.com/show-0-48369-0.html?In page top region content Link, then position determining means combines the page access record information of user user, by user user Often access this target pages http://www.vipshop.com/show-0-48369-0.html?In Content positional information such as page top region in this target pages present information as described Target position information corresponding in this target pages.
Skilled artisans will appreciate that the above-mentioned mode determining described target position information is only to lift Example, the mode of target position information described in other existing or determinations of being likely to occur from now on is as can It is applicable to the present invention, within also should being included in scope, and comprises with way of reference In this.
Then, generating means according to described in present information, and combine described target position information, Described target pages is updated process, will with as described in present information be embedded in as described in target At the described target position information of the page, to generate corresponding described results page, wherein, institute State results page include in described target position information corresponding position described in present information.Such as, connect Upper example, it is assumed that what position determining means determined presents information " iphone number accessory (protection shell Accessory, charger etc.)-nokia number accessory-... " at target pages http://www.vipshop.com/show-0-48369-0.html?In target position information be page Right side of face right regions, then what generating means was determined present information with this is embedded in this page object At the described target position information in face, to generate corresponding described results page.
Preferably, information determines that equipment 1 also includes that pattern determines device (not shown), specifically Ground, pattern determines that device presents, described in determining, the target that information is corresponding in described target pages Style information;Wherein, generating means according to described in present information, and combine described target patterns Information, is updated described target pages processing, to generate corresponding described results page, Wherein, described results page include corresponding with described target patterns information described in present letter Breath.
Specifically, pattern determine device determine described in present information in described target pages institute right The target patterns information answered, here, pattern determines that device presents information at described mesh described in determining The mode of target patterns information corresponding in the mark page includes but not limited to following arbitrary :
1) according to the pattern relevant information of described target pages, determine described in present information in institute State target patterns information corresponding in target pages.Specifically, pattern determines that device is the most true The pattern relevant information of fixed described target pages;Then, further according to the pattern of described target pages Relevant information, the one or more style setting information that extracts from this pattern relevant information is made For the described target patterns information presenting information, or, directly by the pattern phase of described target pages Pass information is as the described target patterns information presenting information.For example, it is assumed that acquiring unit 111 ' The described target pages " only product meeting brand fashion discount store " got http://www.vipshop.com/show-0-48369-0.html?, and what coalignment 13 ' determined The described information that presents includes content such as " iphone number accessory (protection housing accessory, charger Deng)-nokia number accessory-... ", then pattern determine device first can by such as based on Html tag analyze method or according to VIPS (Vision-based Page Segmentation, The page segmentation of view-based access control model) algorithm etc., described target pages is resolved, obtains described mesh The pattern relevant information of the mark page includes the navigation of page top navigation block, breadcrumb, text region Block, page left-hand column content blocks, page right hand column provide content blocks bottom Info Link block and the page Deng Segment feature, the font color in the page be Lycoperdon polymorphum Vitt, page tone be the page samples such as pink colour Formula is arranged;Then, pattern determine device can according to the pattern relevant information of described target pages, Present the target patterns information of information described in determining, present as will be described information page tone, Font colors etc. are set to the page tone with this initial search result page, font color etc. Causing, i.e. page tone is set to pink colour, font color is set to Lycoperdon polymorphum Vitt.
2) present the application class information of information described in basis, enter in style sheet data base Row matching inquiry, to obtain the style sheet information corresponding with described application class information, with As described target patterns information, wherein, described style sheet data base include application class with The mapping relations of style sheet.Here, described application class information includes but not limited to described The trade classification of the page corresponding to one accessing page request, such as food, environmental protection, news, cosmetic Product, flower, automobile, novel etc..Such as, for example, it is assumed that described in present the application of information and divide The application class information of category information belongs to food service industry, then pattern determines that device is at accession page sample Formula data base carries out matching inquiry, it is thus achieved that the page sample corresponding with described application class information Formula information includes that breadcrumb navigation, text summary region block, page layout background are green, page font Color is black etc.;For another example, it is assumed that described in present the application class of application class information of information Information belongs to cosmetic industry, then pattern determines that device is carried out in accession page pattern database Join inquiry, it is thus achieved that the style sheet information corresponding with described application class information includes breadcrumb Navigation, text summary region block, page layout background are that warm tones such as pink colour etc., page font color are White etc..Determine in equipment 1 here, described style sheet data base both can be located at information, also Can be located at and determine with information in the server that equipment 1 is connected by network.
Skilled artisans will appreciate that above-mentioned determine described in present information at described target pages The mode of the target patterns information corresponding in is only for example, and other are existing or may go out from now on Information target patterns information corresponding in described target pages is presented described in existing determination Mode is such as applicable to the present invention, within also should being included in scope, and to quote Mode is incorporated herein.
Then, generating means according to described in present information, and combine described target patterns information, It is updated described target pages processing, to generate corresponding described results page, wherein, Described results page include corresponding with described target patterns information described in present information.Example As, connect example, it is assumed that pattern determines the information that presents that device determines, and " iphone number accessory (is protected Protect housing accessory, charger etc.)-nokia number accessory-... " at target pages http://www.vipshop.com/show-0-48369-0.html?Target patterns letter corresponding in Breath includes that breadcrumb navigation, text summary region block, page layout background are warm tones such as pink colour etc., page Face font color is white etc., then this is presented information showing with this target patterns information by generating means Show that form is embedded in this target pages, to generate corresponding described results page, wherein, institute State results page include corresponding with described target patterns information described in present information.
Skilled artisans will appreciate that above-mentioned combining target style information generates results page Mode is only for example, and other combining target style informations that are existing or that be likely to occur from now on generate The mode of results page is such as applicable to the present invention, also should be included in scope with In, and be incorporated herein with way of reference.
Fig. 3 illustrate according to a further aspect of the present invention for determining the page corresponding to target pages Face describes the method flow diagram of information.
Specifically, in step sl, information determines that equipment 1 determines pending target pages institute Corresponding classification relevant information;In step s 2, information determines that equipment 1 is according to described classification phase Pass information, describes information to the candidate corresponding to described target pages and adjusts accordingly process, with Obtain the page-describing information corresponding to described target pages.Here, information determines that equipment 1 includes But it is mutually integrated by network with subscriber equipment to be not limited to the network equipment, subscriber equipment or the network equipment The equipment constituted.Here, the described network equipment includes but not limited to such as network host, single The webserver, multiple webserver collection or set of computers based on cloud computing etc. realize; Or realized by subscriber equipment.Here, cloud is by based on cloud computing (Cloud Computing) A large amount of main frames or the webserver are constituted, and wherein, cloud computing is the one of Distributed Calculation, by One super virtual machine of a group loosely-coupled computer collection composition.Here, described use Family equipment can be any one can with user by keyboard, mouse, touch pad, touch screen, Or the mode such as handwriting equipment carries out the electronic product of man-machine interaction, such as computer, mobile phone, PDA, palm PC PPC or panel computer etc..Described network include but not limited to the Internet, Wide area network, Metropolitan Area Network (MAN), LAN, VPN, wireless self-organization network (Ad Hoc net Network) etc..Those skilled in the art will be understood that above-mentioned information determines that equipment 1 is only for example, other Existing or that be likely to occur the from now on network equipment or subscriber equipment are such as applicable to the present invention, also should Within being included in scope, and it is incorporated herein with way of reference at this.Here, network Equipment and subscriber equipment all include a kind of automatically to enter according to the instruction being previously set or storing Row numerical computations and the electronic equipment of information processing, its hardware include but not limited to microprocessor, Special IC (ASIC), programmable gate array (FPGA), digital processing unit (DSP), Embedded device etc..
Specifically, in step sl, information determines that equipment 1 first passes through such as browser, searches The application programming interfaces (API) of third party device offers such as holding up are provided, obtain pending target The page;Or, by dynamic web page techniques such as ASP, JSP, obtain user and pass through subscriber equipment The search sequence of input, then this search sequence is submitted to search engine, and receive search engine The Search Results corresponding with this search sequence fed back, using as pending page object Face;Or, by agreement communication modes such as http, https, obtain pending target pages; Then, in step sl, information determines that equipment 1 determines the classification corresponding to described target pages Relevant information.Here, described classification relevant information includes but not limited to following at least any one: 1) Virtual theme, here, the page body matter of target pages can be anti-described in described virtual theme intention The access intention of the user accessing this target pages reflected, for example, it is assumed that target pages is as " rowed the boat Match composition model essay " main body of (http://www.qc99.com/xiaoxue/sinj/101176.Html) Content is a rowing regatta composition model essay, and the user browsing this page wishes study composition writing The information of aspect, then the classification relevant information corresponding to this target pages is virtual theme such as composition; For another example, it is assumed that target pages such as " download of flower material " The picture that body matter is flower of (http://sucai.redocn.com/category/260/), and The user browsing this page wishes that obtaining the material about flower creates for Arts, then this mesh Mark classification relevant information corresponding to the page is virtual theme such as Arts material;2) accurately mate Object, here, target pages described in described accurate coupling object intention contains complete with user's request The most consistent content information, and described user's request has irreplaceability, for example, it is assumed that mesh The mark page such as " oral cavity, Beijing expert-good doctor is online " (http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm?Province=be Ijing) relevant information such as the hospital about disease " oral ulcer " and attending doctor is contained, And the user browsing this page wishes that inquiry obtains about treatment disease such as " oral ulcer " not The page of the relevant information of other diseases such as " rhinitis ", then dividing corresponding to this target pages Class relevant information is for accurately mating object;For another example, it is assumed that target pages such as " IBM minicomputer IBM POWER720”(http://www.xinhuigroup.com/Product/10026/11479.html) Contain the letter such as the Products about IBM minicomputer IBM POWER720, specifications parameter Breath, and the user browsing this page wishes that inquiry obtains about IBM minicomputer IBM POWER720 rather than the page of other type products such as " IBM POWER 550 " relevant information Face, then the classification relevant information corresponding to this target pages is for accurately mating object;3) broad sense Match as, here, the content information of target pages described in described broad match object intention and user Demand has dependency, for example, it is assumed that target pages is as " iphone5 pink colour and the back side have The outer casing protective sleeve of heart pattern " (http://www.vipshop.com/show-0-48369-0.html?), and browse this page User be also possible to other accessories to iphone5 equipment as " Fructus Mali pumilae data wire " and with " iohone5 " Belong to other brands of like product such as intelligent machine such as " nokia " intelligent machine etc. interested, then should Classification relevant information corresponding to target pages is broad match object;4) mismatch object, here, The content information of target pages described in described mismatch object intention is not suitable for comprising at user's acquisition Presenting information outside the content information of this target pages, such as, user browses news report as " specially Family title Obama the most also opposes also friend and in-depth returns to Asia-Pacific strategy " Time (http://news.sina.com.cn/w/sd/2012-11-08/021925532469.shtml), except closing Note outside the content report of this news, the other guide information in this page will not be paid close attention to again, then should Classification relevant information corresponding to the page is mismatch object such as news report.Those skilled in the art should It is understood that above-mentioned classification relevant information is only for example, other classification that are existing or that be likely to occur from now on Relevant information is such as applicable to the present invention, within also should being included in scope, and at this It is incorporated herein with way of reference.
Such as, user inputs network address http://news.sina.com.cn/ in browser address bar, By "enter" key", in step sl, information determines that equipment 1 is by such as news website etc. the 3rd The application programming interfaces (API) that method, apparatus provides get and this network address The webpage that http://news.sina.com.cn/ is corresponding.For another example, user is by its subscriber equipment such as PC inputs key word " iphone accessory " in search column, clicks on search button, then divides Class device 11, by dynamic web page techniques such as JSP or ASP, gets this from this subscriber equipment The search sequence of user's input, and submit searching request based on this search sequence to search engine, The application programming interfaces (API) provided by search engine obtain search engine according to key word " iphone accessory " matching inquiry obtain with key word " iphone accessory " match one Individual or multiple Search Results, as " iphone accessory [market price evaluation certified products crudely-made articles] ", " iphone accessory Apple Store (Chinese) " etc., as pending target pages.
Those skilled in the art only will be understood that the mode of the pending target pages of above-mentioned acquisition For citing, other modes obtaining pending target pages that are existing or that be likely to occur from now on As being applicable to the present invention, within also should being included in scope, and at this to quote Mode is incorporated herein.
Then, in step sl, information determines that equipment 1 determines that pending target pages institute is right The classification relevant information answered, here, in step sl, information determines that equipment 1 determines described mesh The mode of mark classification relevant information corresponding to the page includes but not limited to following at least any one:
1) according to the page subject matter content of described target pages, determine corresponding to described target pages Classification relevant information.Specifically, in step sl, to determine that equipment 1 first passes through all for information As page html tag analyzes method, extract the page body matter of described target pages, or, According to VIPS (Vision-based Page Segmentation, the page segmentation of view-based access control model) algorithm, Utilize webpage foreground color, background color, font color and size, frame, logical block and logic The visual signatures such as spacing between block, element position, carry out piecemeal process to described target pages, To obtain the body matter piecemeal of described target pages;Then, in step S 1, information determines Equipment 1, according to the page body matter of described target pages, determines corresponding to described target pages Classification relevant information.For example, it is assumed that in step sl, information determines that first equipment 1 get Described target pages be news report as " expert claims Obama also the most also to oppose, and friend is by in-depth heavily Return Asia-Pacific strategy " (http://news.sina.com.cn/w/sd/2012-11-08/021925532469.shtml), then in step In rapid S1, information determines that equipment 1 analyzes method by such as page html tag, extracts The page body matter of this target pages is for " Obama the most also opposes also friend and in-depth is returned to Asia-Pacific Strategy " news report, the most in step sl, information determines that equipment 1 determines this target pages Corresponding classification relevant information is mismatch object.For another example, it is assumed that in step sl, information is true The described target pages that first locking equipment 1 gets is about treatment disease such as " oral ulcer " phase The page " oral cavity, Beijing expert-good doctor is online " of pass information (http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm?Province=be Ijing), this target pages contains content information on all four with user's request, then in step In rapid S1, information determines that equipment 1 determines that the classification relevant information corresponding to this target pages is essence Really mate object.
2) according to the page access record information of the user accessing described target pages, determine described Classification relevant information corresponding to target pages.Such as, user user is just at browsing pages such as " iphone accessory only product can snap up at a low price!Digital accessory special show indulgence in limited time " (http://www.vipshop.com/show-0-48369-0.html?), and this user user is also To other accessories such as " Fructus Mali pumilae data wire " of iphone5 equipment and belong to same with " iohone5 " Other brands of series products such as intelligent machine such as " nokia " intelligent machine etc. are interested, then in step In S1, information determines that equipment 1 determines that the classification relevant information corresponding to this target pages is broad sense Match as.
Those skilled in the art will be understood that the above-mentioned mode determining described classification relevant information is only Citing, the mode of relevant information of classifying described in other existing or determinations of being likely to occur from now on is such as It is applicable to the present invention, within also should being included in scope, and at this with the side of quoting Formula is incorporated herein.
In step s 2, information determine equipment 1 according to described classification relevant information, to described mesh Mark candidate corresponding to the page describes information and adjusts accordingly process, to obtain described target pages Corresponding page-describing information.Include but not limited to as described here, described candidate describes information The relevant letter of described classification corresponding to the description of target pages body matter information, described target pages The description of breath.Specifically, in step s 2, information determines that equipment 1 first passes through such as to institute The content of pages stating target pages carries out word frequency statistics, or, call belonging to described target pages The page candidate that third party website is provided describes Information application routine interface (API), it is thus achieved that described Candidate corresponding to target pages describes information;Then, in step s 2, information determines equipment 1 The described classification relevant information determined according to sorter, to the candidate corresponding to described target pages Description information adjusts accordingly process, to obtain the page-describing letter corresponding to described target pages Breath.Those skilled in the art will be understood that above-mentioned candidate describes information and is only for example, and other are existing Or the candidate that will be likely to occur from now on describe information and be such as applicable to the present invention, also should be included in this Within invention protection domain, and it is incorporated herein with way of reference at this.Here, described corresponding tune Whole process operation includes following at least any one:
-when described classification relevant information includes described virtual theme, according to described candidate, letter is described Breath carries out matching inquiry in virtual subject data base, using by corresponding matching inquiry result as Described page-describing information;
-when described classification relevant information includes described accurate coupling object, described candidate is described Information is as described page-describing information;
-when described classification relevant information includes described broad match object, retouch according to described candidate Information of stating carries out matching inquiry in generalized object data base, with described candidate is described information and The matching inquiry result of its correspondence is as described page-describing information;
-when described classification relevant information includes described mismatch object, described candidate is described information Empty, using as described page-describing information.
For example, it is assumed that in step sl, information determines that equipment 1 determines pending target pages Such as " rowing regatta composition model essay " (http://www.qc99.com/xiaoxue/sinj/101176.Html) Corresponding described classification relevant information is described virtual theme, and in step s 2, information is true First locking equipment 1 calls this target pages Third party website qc99 institute belonging to http://www.qc99.com/xiaoxue/sinj/101176.Html The page candidate provided describes Information application routine interface (API), it is thus achieved that this target pages The described candidate of http://www.qc99.com/xiaoxue/sinj/101176.Html describes information and includes " rowing regatta composition model essay " contents etc., the most in step s 2, information determines equipment 1 basis This candidate describes information and carries out matching inquiry in virtual subject data base, obtains matching inquiry knot Fruit such as " page body matter: rowing regatta composition model essay-correspondence classification relevant information: virtual master Topic (composition) ", then this matching inquiry result is as described page-describing information, here, Described virtual subject data library storage has multiple virtual theme, and it can be located at information and determines equipment 1 In, may be additionally located at and determine with information in the server that equipment 1 is connected by network;For another example, it is assumed that In step S 1, information determines that equipment 1 determines that pending target pages is as about treatment disease The page " oral cavity, Beijing expert-good doctor is online " such as " oral ulcer " relevant information (http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm?Province=be Ijing) described classification relevant information is for accurately mating object, and in step s 2, information is true First locking equipment 1 carries out word frequency statistics to the content of pages of this target pages, it is thus achieved that this target pages http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm?Province=beiji The described candidate of ng describes information and includes that " disease " oral ulcer " treatment-corresponding classification is relevant Information: accurately mate object " etc., the most in step s 2, information determines that equipment 1 is by this candidate Description information is as described page-describing information;And for example, it is assumed that in step S 1, information determines Equipment 1 determines that pending target pages is as " iphone accessory only product can snap up at a low price!Digital Accessory special show indulgence in limited time " (http://www.vipshop.com/show-0-48369-0.html?) Described classification relevant information be broad match object, and in step s 2, information determines equipment 1 first to this target pages http://www.vipshop.com/show-0-48369-0.html?'s Content of pages carries out word frequency statistics, it is thus achieved that this target pages http://www.vipshop.com/show-0-48369-0.html?Described candidate information bag is described Including " digital accessory special show " etc., the most in step s 2, information determines that equipment 1 is according to this candidate Description information carries out matching inquiry in generalized object data base, it is thus achieved that matching inquiry result is such as " iphone number accessory (protection housing accessory, charger etc.)-nokia number accessory-... " Deng, the matching inquiry result that this candidate describes information and correspondence thereof is believed as described page-describing Breath, here, described generalized object data base includes the classification set of generalized object, each broad sense Object can classification again, it can be located at information and determines in equipment 1, may be additionally located at true with information Locking equipment 1 passes through in the server that network is connected;The most such as, it is assumed that in step S 1, information is true Locking equipment 1 determines that pending target pages is that news report is as " expert claims Obama the most also to oppose Also in-depth is returned to Asia-Pacific strategy by friend " Described point of (http://news.sina.com.cn/w/sd/2012-11-08/021925532469.shtml) Class relevant information is mismatch object, and in step s 2, information determines that first equipment 1 adjusted The page candidate provided with the third party website sina belonging to this target pages describes Information application journey Sequence interface (API), it is thus achieved that the described candidate of this target pages describes information and includes " news report -corresponding classification relevant information: mismatch object ", the most in step s 2, information determines equipment 1 This candidate is described information empty, using as described page-describing information, i.e. this target pages institute Corresponding page-describing information is vacancy.
Those skilled in the art will be understood that above-mentioned to the candidate's description corresponding to described target pages Information adjusts accordingly the mode of process and is only for example, and other are existing or are likely to occur from now on The candidate corresponding to described target pages described information adjust accordingly the mode of process such as It is applicable to the present invention, within also should being included in scope, and at this with the side of quoting Formula is incorporated herein.
Those skilled in the art will be understood that the page corresponding to the described target pages of above-mentioned acquisition is retouched The mode stating information is only for example, other described targets of acquisition that are existing or that be likely to occur from now on The mode of the page-describing information corresponding to the page is such as applicable to the present invention, also should be included in this Within invention protection domain, and it is incorporated herein with way of reference at this.
Information determines between each step of equipment 1 it is constant work.Specifically, in step In rapid S1, information determines that equipment 1 persistently determines the classification phase corresponding to pending target pages Pass information;In step s 2, information determines that equipment 1 continues according to described classification relevant information, Candidate corresponding to described target pages is described information and adjusts accordingly process, described to obtain Page-describing information corresponding to target pages.Here, skilled artisan would appreciate that and " hold Continuous " information of referring to determines that each step of equipment 1 the most constantly carries out classifying relevant information really Determine and the acquisition of page-describing information, until information determines that equipment 1 stops dividing in a long time The determination of class relevant information.
Preferably, information determines that equipment 1 also includes step S4 (not shown), specifically, in step In rapid S4, information determines that equipment 1, according to the multiple training pages through marking classification information, is carried out Machine learning processes, to obtain the page classifications model for page classifications;Wherein, in step S1 In, information determines that equipment 1 is according to described page classifications model, the page based on described target pages Relevant information, determines described classification relevant information.
Specifically, in step s 4, information determines that equipment 1 is according to many through mark classification information The individual training page, carries out machine learning process, to obtain the page classifications model for page classifications. For example, it is assumed that the multiple training pages through mark classification information are as follows:
I: rowing regatta composition model essay
Http:// www.qc99.com/xiaoxue/sinj/101176.Html, virtual theme
II:sina/ reading/novel shop/world's masterpiece/" the Count of Monte Christo "
Http:// vip.book.sina.com.cn/book/index_81300.html, virtual theme
III: oral cavity, Beijing expert-good doctor is online
http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm?
Province=beijing, accurately mates object
IV:sina sports news http://sports.sina.com.cn/, mismatch object
V:sina financial and economic news http://finance.sina.com.cn/, mismatch object
VI: only product netting index code accessory
http://www.vipshop.com/show-0-48369-0.html?, broad match object
VII: Dangdang.com protects product http://cosmetic.dangdang.com/, broad match object The most in step s 4, information determine equipment 1 according to this through mark classification information multiple training pages Face, carries out machine learning process, to as described in training set carry out linear regression analysis or right Described training set carries out the modes such as nonlinear regression analysis, it is thus achieved that the page for page classifications divides Class model such as decision tree, each node of this decision tree corresponds to each page classifications, wherein, institute Stating page classifications and include multiple described training page, subject classification as the most virtual in page classifications includes page Face I with II, accurately mate object classification include the classification of page III, mismatch object include page IV and The classification of V, broad match object includes the page VI and VII.
Then, in step sl, information determines that equipment 1 is according to described page classifications model, base In the page relevant information of described target pages, determine described classification relevant information.Here, it is described Page relevant information includes but not limited to such as page body matter classification, page structure feature etc..Example As, it is assumed that in step sl, information determines the pending target pages that first equipment 1 obtain For " rowing regatta composition model essay " http://www.qc99.com/xiaoxue/sinj/101176.Html, The most in step sl, information determines that equipment 1 can set up, according to model, the described page that device obtains Disaggregated model, page relevant information such as page body matter information based on this target pages, should The page body matter classification of target pages and each page classifications institute in described page classifications model Including training the page page body matter classification compare, as assume determine this target pages Page body matter classification is composition type, with the training page included by the page classifications of virtual theme The content of pages classification in face is consistent, and the most in step sl, information determines that equipment 1 determines this mesh The described classification relevant information of the mark page is virtual theme.
Preferably, information determines that equipment 1 also includes step S5 (not shown), specifically, In step S5, it is corresponding with search sequence one or more that information determines that first equipment 1 obtain Search Results;Then, according to page-describing information and the institute of the page corresponding to described Search Results State the matching degree information of search sequence, the one or more Search Results is carried out follow-up place Reason;Then, by the one or more Search Results of subsequent treatment, at least one provides To the application corresponding to described search sequence.
Specifically, in step s 5, information determines that equipment 1 first passes through ASP, JSP etc. and moves State page technology, obtains user by subscriber equipment input inquiry sequence in search engine search column Mobile enquiry request, the most again this search sequence is mentioned to search engine, and receives search and draw Hold up the one or more Search Results corresponding with this search sequence fed back, to obtain and inquiry One or more Search Results that sequence is corresponding, for example, it is assumed that user user uses its PC Machine inputs key word " iphone protects housing accessory ", then point in search engine search column Hitting search button, the most in step s 5, information determines that equipment 1 is dynamic by ASP, JSP etc. Page technology, just can get the search sequence of user user input, be then based on this inquiry sequence Arrange and submit page searching request to search engine, and receive that search engine fed back with this inquiry Sequence " iphone protects housing accessory " corresponding one or more Search Results such as search for knot Really A " homepage-rice the more Fructus Mali pumilae number accessory certified products discount store ", Search Results B " ... 3C Herba Marsileae Quadrifoliae Really accessory iphone shell cell-phone cover wholesale and retail containment vessel ", a Search Results C " only nothing The containment vessel iphone4s accessory recommending mobile phone Technology Times Sina website of two " etc..
Those skilled in the art will be understood that or many that above-mentioned acquisition is corresponding with search sequence The mode of individual Search Results is only for example, other existing or acquisitions of being likely to occur from now on look into The mode asking the corresponding one or more Search Results of sequence is such as applicable to the present invention, also should Within being included in scope, and it is incorporated herein with way of reference at this.
Then, in step s 5, information determines that equipment 1 is according to page corresponding to described Search Results The page-describing information in face and the matching degree information of described search sequence, to the one or more Search Results carries out subsequent treatment.Specifically, in step s 5, first information determine equipment 1 Page-describing information corresponding to described Search Results is carried out semantic analysis, according to described inquiry Word corresponding to sequence is included by the page-describing information corresponding to described Search Results Ratio shared in total word, determines the page-describing information corresponding to described Search Results and institute State the matching degree information of search sequence, as when ratio is more than 0.95, determined that described matching degree is believed Breath is matched, if ratio is between 0.95 and 0.7, determines described matching degree information Mate for moderate, if ratio is less than 0.7, determine that described matching degree information is for minuent coupling; Then, search process device is further according to this matching degree information, to the one or more search knot Fruit carries out subsequent treatment, as the order between the one or more Search Results is adjusted, The one or more Search Results is screened.Such as, example is connected, it is assumed that Search Results A The page-describing information of the corresponding page and search sequence " iphone protects housing accessory " Degree of joining is higher than the page-describing information of the page corresponding to Search Results B and this search sequence " iphone Protection housing accessory " matching degree, the page-describing information of the page corresponding to Search Results B with The matching degree of search sequence " iphone protects housing accessory " is higher than page corresponding to Search Results C The page-describing information in face and the matching degree of this search sequence " iphone protects housing accessory ", The most in step s 5, information determines that equipment 1, according to described matching degree information, determines Search Results A, Search Results B and Search Results C put in order as A, B, C, i.e. user user obtains When arriving the Search Results corresponding with search sequence " iphone protects housing accessory ", search knot Before really A is positioned at Search Results B, before Search Results B is positioned at Search Results C;For another example, In step s 5, information determines that equipment 1 also can be according to described matching degree information, to Search Results A, B, C screen, and such as filtered search result, are not provided by Search Results C low for matching degree To user.
Those skilled in the art will be understood that above-mentioned after carrying out the one or more Search Results The continuous mode processed is only for example, other existing or be likely to occur from now on to one or Multiple Search Results carry out the mode of subsequent treatment and are such as applicable to the present invention, also should be included in this Within invention protection domain, and it is incorporated herein with way of reference at this.
Then, in step s 5, information determines that equipment 1 is by such as ASP, JSP or PHP Deng dynamic web page technique, or the communication mode of other agreements, communicate such as http or https etc. Agreement, by the one or more Search Results of subsequent treatment, at least one is supplied to institute Stating the application corresponding to search sequence, the described Search Results after processing for application is supplied to institute State the user corresponding to search sequence.Here, described application include but not limited to as search engine, Browser etc..Such as, connecting example, in step s 5, information determines that rear its is carried out by equipment 1 Search Results A, B and C after subsequent treatment provides according to matching degree information order A, B, C To user user, browse for user, or, by the page in the page Search Results A, B and C Degree of joining information is not applied to user user less than the page Search Results of predetermined threshold.
Fig. 4 illustrate in accordance with a preferred embodiment of the present invention for determining corresponding to target pages The method flow diagram of page-describing information.
Specifically, in step S1 ' in, information determines that equipment 1 determines pending target pages institute Corresponding classification relevant information;In step S2 ' in, information determines that equipment 1 is according to described classification phase Pass information, describes information to the candidate corresponding to described target pages and adjusts accordingly process, with Obtain the page-describing information corresponding to described target pages;In step S3 ' in, information determines and sets Standby 1 according to described page-describing information, determines and corresponding with described target pages presents information, Wherein, information and described page-describing information match are presented described in.Here, step S1 ' and Step S2 ' respectively step corresponding with shown in Fig. 3 same or similar, therefore here is omitted, and leads to Cross the mode quoted to be incorporated herein.
Specifically, in step S3 ' in, information determine equipment 1 according to described page-describing information, Determine and corresponding with described target pages present information, wherein, described in present information with described Page-describing information match.Present information described in here, to include but not limited to as to be shown in page With certain carrier such as link, text, picture, video, animation etc. in face, for transmitting to user The content of information, its include but not limited to as described in page-describing information page describe in information The style sheet information etc. that appearance information is corresponding with described page-describing information.Specifically, in step In rapid S3 ', information determine equipment 1 according to described page-describing information, by presenting Information Number Present information according to storehouse is inquired about corresponding to described description information, determine and believe with described page-describing What manner of breathing was corresponding presents information;Or, retouch by inquiring about the described page in presenting information database Maybe this presents the resource of associated user of user to state the user that presents of the target pages corresponding to information Configuration content information, determines and corresponding with described page-describing information presents information, wherein, Described information and the described page-describing information match of presenting.Information data is presented described in here, Storehouse can be located at information and determines in equipment 1, it is possible to is positioned at and determines that equipment 1 is by network phase with information In data base even.
For example, it is assumed that in step S1 ' in, information determines that equipment 1 determines pending target pages As " iphone accessory only product can snap up at a low price!Digital accessory special show indulgence in limited time " (http://www.vipshop.com/show-0-48369-0.html?) the relevant letter of described classification Breath is broad match object, and in step S2 ' in, information determines this page object that equipment 1 determines Face http://www.vipshop.com/show-0-48369-0.html?Described page-describing information Including " iphone number accessory (protection housing accessory, charger etc.)-nokia number is joined Part-... " etc., then in step S3 ' in, information determine equipment 1 can by this page-describing information, As with this target pages http://www.vipshop.com/show-0-48369-0.html?Relatively That answers presents information;For another example, example is connected, in step S3 ' in, information determines that equipment 1 can be by institute State page-describing information " iphone number accessory (protection housing accessory, charger etc.)-nokia Digital accessory-... " content information, and this page-describing information corresponding present other of user Resource distribution content information contents such as " iphone sell goods information " as described in present information.
Those skilled in the art will be understood that and above-mentioned determine present corresponding with described target pages The mode of information is only for example, other existing or determination of being likely to occur from now on and described targets The mode of information that what the page was corresponding present such as is applicable to the present invention, also should be included in the present invention Within protection domain, and it is incorporated herein with way of reference at this.
Preferably, information determines that equipment 1 also includes step S6 ' (not shown), specifically, quick In step S6 ' in, information determines that equipment 1 determines the content erotic degree information of described target pages; Wherein, in step S3 ' in, information determines that equipment 1, according to described page-describing information, and combines Described content erotic degree information, determines and corresponding with described target pages presents information, wherein, Described information and described page-describing information and the described content erotic degree information match of presenting.
Specifically, in step S6 ' in, information determines that equipment 1 is by such as resolving described page object The html source code in face, obtains the content of pages information of described target pages, at this content of pages By inquiry predetermined content sensitivity information in information, quick to determine the content of described target pages Sensitivity information.Here, described content erotic information includes but not limited to as being only suitable for certain particular cluster The content that body browses such as adult's information etc., as about causing death, disease, injure, damage or The relevant content information etc. of the fortuitous events such as person's unknown losses.For example, it is assumed that in step S1 ' in, Information determines the pending described target pages that equipment 1 obtains is that " how youngster No. 5 faces by Europe perfume (or spice) Alliance prohibits selling " (http://news.163.com/12/1109/05/8FRIGU8300014AED.html) News report, then in step S6 ' in, information determines that equipment 1 is by resolving the HTML of this page Source code, finds that the content of pages information of this page includes word such as " prohibiting selling ", " allergy " etc., The content erotic degree information i.e. determining this target pages is " prohibiting selling ", " allergy ".
Those skilled in the art will be understood that foregoing sensitivity information is only for example, and other are existing Or the content erotic degree information that will be likely to occur from now on be such as applicable to the present invention, also should be included in this Within invention protection domain, and it is incorporated herein with way of reference at this.
Those skilled in the art will be understood that the above-mentioned mode determining described sensitivity information is only and lift Example, the mode of sensitivity information described in other existing or determinations of being likely to occur from now on is the most applicable In the present invention, within also should being included in scope, and it is contained in way of reference at this This.
Then, in step S3 ' in, information determine equipment 1 according to described page-describing information, and In conjunction with described content erotic degree information, determine and corresponding with described target pages present information, Wherein, information and described page-describing information and described content erotic degree information phase are presented described in Join.Such as, example is connected, it is assumed that in step S2 ' in, information determines that equipment 1 determines target pages " perfume (or spice) how youngster No. 5 faces prohibited selling by European Union " The described page of (http://news.163.com/12/1109/05/8FRIGU8300014AED.html) It is mismatch object that face describes the described classification relevant information that information is vacancy, i.e. this target pages, Then in step S3 ' in, information determine equipment 1 according to this page-describing information, and it is quick to combine content Sensitivity information " prohibits selling ", " allergy ", the present information corresponding with this target pages determined is Be not suitable for providing at this page presenting information, or, described in present information be other brand perfume, Wherein, information and described page-describing information and described content erotic degree information phase are presented described in Join.For another example, in, it is assumed that in step S2 ', information determines that equipment 1 determines target pages such as " iphone Accessory only product can snap up at a low price!Digital accessory special show indulgence in limited time " (http://www.vipshop.com/show-0-48369-0.html?) described page-describing letter Breath is for " iphone number accessory (protection housing accessory, charger etc.)-nokia number is joined Part-... ", and in step S6 ' in, information determines that equipment 1 determines the described interior of this target pages Holding sensitivity information is to include being only suitable for content such as adult's information that certain special group browses, then exist Step S3 ' in, information determine equipment 1 according to this page-describing information, and it is quick to combine described content Sensitivity information, determines that the present information corresponding with this target pages includes this page-describing information But statement forbids that child browses the information of this page, wherein, information and the described page are presented described in Description information and described content erotic degree information match.
Those skilled in the art will be understood that above-mentioned combination content erotic degree information presents described in determining The mode of information is only for example, and other are existing or be likely to occur from now on and really combine content erotic Degree information presents the mode of information and is such as applicable to the present invention described in determining, also should be included in this Within bright protection domain, and it is incorporated herein with way of reference at this.
In a preferred embodiment (with reference to Fig. 4), information determines that equipment 1 includes step S1 ', Step S2 ', step S3 ', step S7 ' (not shown) and step S8 ' (not shown), wherein, Step S1 ' include step S11 ' (not shown) and step S12 ' (not shown).Below with reference to The preferred embodiment is described by Fig. 4: specifically, in step S11 ' in, information determines and sets Standby 1 obtains the accession page that user is accessed, using as described target pages;In step S12 ' In, information determines that equipment 1 determines the classification relevant information corresponding to described target pages;In step In rapid S2 ', information determine equipment 1 according to described classification relevant information, to described target pages institute Corresponding candidate describes information and adjusts accordingly process, to obtain corresponding to described target pages Page-describing information;In step S3 ' in, information determine equipment 1 according to described page-describing information, Determine and corresponding with described target pages present information, wherein, described in present information with described Page-describing information match;In step S7 ' in, information determines that equipment 1 generating means is according to institute State and present information, be updated described target pages processing, to generate corresponding results page, Wherein, described results page include described in present information;In step S8 ' in, information determines equipment 1 provides device that described results page is supplied to described user.Here, step S2 ' and Fig. 3 Shown in corresponding intrument step same or similar, step S3 ' step corresponding with shown in Fig. 4 be identical or phase Seemingly, therefore here is omitted, and is incorporated herein by way of reference.
Specifically, in step S11 ' in, information determines that first equipment 1 obtain the page visit of user Ask request, using the page corresponding to accessing page request as described target pages;Or, logical Cross the application programming interfaces (API) that such as third party device such as browser, search engine provides, Obtain the accession page that user is accessed, using as described target pages.Such as, user user In browser address bar, input http://news.sina.com.cn/, press enter key, then in step In S11 ', information determines the application programming interfaces (API) that equipment 1 is provided by browser, just obtains Get the accessing page request of user user;Then, in step S11 ' in, information determines equipment 1 According to this page URL, send respective page access request to page server, taken by the page The corresponding HTML response that business device returns, obtains the page corresponding with this accessing page request Http:// news.sina.com.cn/, using page http://news.sina.com.cn/ as described page object Face.For another example, it is assumed that user user inputs key word in search engine search column, and " iphone protects Protect housing accessory ", then click on search button, then in step S11 ' in, information determines equipment 1 The application programming interfaces (API) provided by search engine, the page just getting user user is visited Ask request, then in step S11 ' in, information determine equipment 1 based on this search sequence to search Page searching request submitted to by engine, and receive that search engine fed back with this search sequence One or more Search Results such as Search Results A that " iphone protects housing accessory " is corresponding " homepage-rice the more Fructus Mali pumilae number accessory certified products discount store ", Search Results B " ... 3C Fructus Mali pumilae is joined Part iphone shell cell-phone cover wholesale and retail containment vessel ", Search Results C " unique Containment vessel iphone4s accessory recommending mobile phone Technology Times Sina website " etc., then in step S11 ' In, information determine equipment 1 using include such Search Results search results pages as described target The page.
In step S12 ' in, information determines that equipment 1 determines the classification corresponding to described target pages Relevant information.Here, in step S12 ' in, information determines that equipment 1 determines described target pages In the mode of corresponding classification relevant information and Fig. 3 in step sl, information determines equipment 1 Determine that the mode of classification relevant information corresponding to described target pages is identical, for simplicity's sake, Therefore do not repeat them here, and comprise by reference and this.
Preferably, in step S12 ' in, information determines that equipment 1 may also be combined with the use of described user Family operation information, determines the classification relevant information corresponding to described target pages;
Wherein, described user's operation information includes following at least any one:
-described user is about the page access session information of described accession page;
The page access record information of-described user;
Page search record corresponding to-described accession page.
Such as, the described user page about described accession page is included when described user's operation information When session information is asked in interview, here, described page access session information include but not limited to as The connected reference of accession page is operated by one user.Assume user user at Search Results such as " iphone accessory only product can snap up at a low price!Digital accessory special show indulgence in limited time " (http://www.vipshop.com/show-0-48369-0.html?The page corresponding to) clear During looking at, also inquiry obtains other information such as accessory " Fructus Mali pumilae data wire white " of its demand, Then in step S12 ' in, information determines that equipment 1 determines the relevant letter of the classification corresponding to this target pages Breath is broad match object;For another example, the page of described user is included when described user's operation information When accessing record information, it is assumed that in step S11 ' in, information determines that equipment 1 gets user user The accession page such as " rowing regatta composition model essay " submitted to The accessing page request of (http://www.qc99.com/xiaoxue/sinj/101176.Html), and User user often accesses as about how illustrating the page of just-in-time politics examination question, then in step S12 ' In, information determines that equipment 1 determines accession page such as " rowing regatta composition model essay " The relevant letter of classification corresponding to (http://www.qc99.com/xiaoxue/sinj/101176.Html) Breath is virtual theme such as writing.
Skilled artisans will appreciate that the user's operation information of above-mentioned combination user determines described The mode of classification relevant information is only for example, and other combinations that are existing or that be likely to occur from now on are used The user's operation information at family determines that the mode of described classification relevant information is such as applicable to the present invention, Also within should being included in scope, and it is incorporated herein with way of reference.
In step S7 ' in, information determines that equipment 1 presents information according to described, to described page object Face is updated processing, will with as described in present information be embedded in as described in target pages, with life Become corresponding results page, wherein, described results page include described in present information.Such as, Assume in step S3 ' in, information determine that equipment 1 determines with target pages such as " iphone accessory Only product can snap up at a low price!Digital accessory special show indulgence in limited time " (http://www.vipshop.com/show-0-48369-0.html?) corresponding present information Including this page-describing information such as " iphone number accessory (protection housing accessory, charger etc.) -nokia number accessory-... ", then in step S7 ' in, information determine equipment 1 can according to this in Existing information, is updated this target pages processing, will with as described in present information and be embedded in this In target pages, as being embedded at the navigation segmented areas of this target pages, wherein, described in Existing information and described page-describing information match.
In step S8 ' in, information determines that equipment 1 is dynamic by such as ASP, JSP or PHP etc. Web technologies, or communication modes of other agreements, such as communication protocols such as http or https, Described results page is supplied to described user.
Preferably, information determines that equipment 1 also includes step S9 ' (not shown), specifically, In step S9 ' in, information determines that equipment 1 presents information institute in described target pages described in determining Corresponding target position information;Wherein, in step S7 ' in, information determines that equipment 1 is according to described Present information, and combine described target position information, be updated described target pages processing, To generate corresponding described results page, wherein, described results page is believed in described target location Breath corresponding position presents information described in including.
Specifically, in step S9 ' in, information determines that equipment 1 presents information described described in determining Target position information corresponding in target pages.Here, described target position information include by Described present which position that information is embedded in described target pages, present information as will be described embedding Enter the position that in described target pages, user preferably browses, or, by described information to be presented The navigation segmented areas etc. being embedded in described target pages.Here, in step S9 ' in, information Determine that equipment 1 determines that the mode of described target position information includes but not limited to following arbitrary :
1) according to the page layout information of described target pages, target position information is determined, as incited somebody to action On the right side of white space in the target pages such as page subfield as described in target position information, by target The page easily causes the surrounding of search column in the region such as search page that user notes wait as described in Target position information.For example, it is assumed that in step S11 ' in, what information determined that equipment 1 obtains waits to locate The described target pages of reason is for " iphone accessory only product can snap up at a low price!Digital accessory special show limit Shi Tehui " (http://www.vipshop.com/show-0-48369-0.html?), and in step In S9 ', information determine equipment 1 by such as html tag analytic method or according to VIPS (Vision-based Page Segmentation, the page segmentation of view-based access control model) algorithm, to this Target pages resolves, and obtains the style sheet information of this target pages, such as page layout information, Wherein, on the right side of the page of this target pages, subfield is white space, then in step S9 ' in, information Determine that subfield region on the right side of the page in this target pages can be believed by equipment 1 as described target location Breath.
2) according to the content of pages information of described target pages, by described target pages with described Present location of content region that the content of information matches as described target position information.Such as, Assume in step S11 ' in, information determines that the described target pages that equipment 1 obtains is the page http://www.vipshop.com/show-0-48369-0.html?, in step S3 ' in, information is true Locking equipment 1 determine described in present information include content as " iphone number accessory (and protection shell Accessory, charger etc.)-nokia number accessory-... ", in step S9 ' in, information determines and sets Standby 1 by resolving this target pages, this target pages comprises multiple channel content such as " luxurious ornaments ", " only product group ", " only product are still " etc., then in step S9 ' in, information determines that equipment 1 is by this target The location of content region matched with this content presenting information in the page is as described target location Information, will in this target pages " only product are still " channel position region as described information to be presented Described target position information.
3) according to the page relevant information of described target pages, and the page combining described user is visited Ask record information, determine described in present the target location that information is corresponding in described target pages Information.For example, it is assumed that in step S11 ' in, it is pending described that information determines that equipment 1 obtains Target pages is page http://www.vipshop.com/show-0-48369-0.html?, in step In S3 ', information determines that equipment 1 presents information and includes that content is as " iphone is digital described in determining Accessory (protection housing accessory, charger etc.)-nokia number accessory-... ", it is assumed that user User often clicks on this target pages http://www.vipshop.com/show-0-48369-0.html? In the link of page top region content, then in step S9 ' in, information determines that equipment 1 combines and uses The page access record information of family user, often accesses this target pages by user user http://www.vipshop.com/show-0-48369-0.html?In content at this target pages In positional information such as page top region as described in present information in this target pages institute right The target position information answered.
Skilled artisans will appreciate that the above-mentioned mode determining described target position information is only to lift Example, the mode of target position information described in other existing or determinations of being likely to occur from now on is as can It is applicable to the present invention, within also should being included in scope, and comprises with way of reference In this.
Then, in step S7 ' in, information determines that equipment 1, according to the described information that presents, and combines Described target position information, described target pages is updated process, will with as described in present Information is embedded at the described target position information of described target pages, to generate corresponding described knot The really page, wherein, described results page include in described target position information corresponding position described in Existing information.Such as, connect example, it is assumed that in step S9 ' in, information determine that equipment 1 determines in " iphone number accessory (protection housing accessory, charger etc.)-nokia is digital for existing information Accessory-... " at target pages http://www.vipshop.com/show-0-48369-0.html? In target position information be right regions on the right side of the page, then in step S7 ' in, information determines and sets Standby 1 determined present information with this and be embedded in the described target position information of this target pages Place, to generate corresponding described results page.
Preferably, information determines that equipment 1 also includes step S10 ' (not shown), specifically, In step S10 ' in, information determines that equipment 1 presents information in described target pages described in determining Corresponding target patterns information;Wherein, in step S7 ' in, information determines that equipment 1 is according to institute State and present information, and combine described target patterns information, described target pages is updated place Reason, to generate corresponding described results page, wherein, described results page includes and described mesh Information is presented described in mark style information is corresponding.
Specifically, in step S10 ' in, information determines that equipment 1 presents information in institute described in determining State target patterns information corresponding in target pages, here, in step S10 ' in, information is true Locking equipment 1 presents, described in determining, the target patterns information that information is corresponding in described target pages Mode include but not limited to following at least any one:
1) according to the pattern relevant information of described target pages, determine described in present information in institute State target patterns information corresponding in target pages.Specifically, in step S10 ' in, information Determine that first equipment 1 determine the pattern relevant information of described target pages;Then, further according to institute State the pattern relevant information of target pages, from this pattern relevant information, extract one or more Style setting information presents the target patterns information of information described in being used as, or, directly by described The pattern relevant information of target pages is as the described target patterns information presenting information.Such as, Assume in step S11 ' in, information determines described target pages " the only product meeting that equipment 1 gets Brand fashion discount store " http://www.vipshop.com/show-0-48369-0.html?, and Step S3 ' in, information determines that equipment 1 presents information and includes content such as " iphone described in determining Digital accessory (protection housing accessory, charger etc.)-nokia number accessory-... ", then sample Formula determine device first can by such as based on html tag analyze method or according to VIPS (Vision-based Page Segmentation, the page segmentation of view-based access control model) algorithm etc., right Described target pages resolves, and the pattern relevant information obtaining described target pages includes the page On the right of the navigation of top navigation block, breadcrumb, text region unit, page left-hand column content blocks, the page Hurdle provides bottom Info Link block and the page Segment features such as content blocks, the font face in the page Color be Lycoperdon polymorphum Vitt, page tone be that the style sheet such as pink colour are arranged;Then, in step S10 ' in, Information determine equipment 1 can according to the pattern relevant information of described target pages, determine described in present The target patterns information of information, presents the page tone of information, font color etc. as will be described and arranges For consistent with the page tone of this initial search result page, font color etc., i.e. page tone sets Be set to pink colour, font color is set to Lycoperdon polymorphum Vitt.
2) present the application class information of information described in basis, enter in style sheet data base Row matching inquiry, to obtain the style sheet information corresponding with described application class information, with As described target patterns information, wherein, described style sheet data base include application class with The mapping relations of style sheet.Here, described application class information includes but not limited to described The trade classification of the page corresponding to one accessing page request, such as food, environmental protection, news, cosmetic Product, flower, automobile, novel etc..Such as, for example, it is assumed that described in present the application of information and divide The application class information of category information belongs to food service industry, then in step S10 ' in, information determines and sets Standby 1 carries out matching inquiry in accession page pattern database, it is thus achieved that with described application class information Corresponding style sheet information includes that breadcrumb navigation, text summary region block, page layout background are Green, page font color is black etc.;For another example, it is assumed that described in present information application class letter The application class information of breath belongs to cosmetic industry, then in step S10 ' in, information determines equipment 1 Matching inquiry is carried out, it is thus achieved that with described application class information phase in accession page pattern database Corresponding style sheet information includes that breadcrumb navigation, text summary region block, page layout background are warm Tone such as pink colours etc., page font color are white etc..Here, described style sheet data base was both The information of can be located at determines in equipment 1, may be additionally located at and determines that with information equipment 1 is connected by network Server in.
Skilled artisans will appreciate that above-mentioned determine described in present information at described target pages The mode of the target patterns information corresponding in is only for example, and other are existing or may go out from now on Information target patterns information corresponding in described target pages is presented described in existing determination Mode is such as applicable to the present invention, within also should being included in scope, and to quote Mode is incorporated herein.
Then, in step S7 ' in, information determines that equipment 1, according to the described information that presents, and combines Described target patterns information, is updated described target pages processing, to generate corresponding institute Stating results page, wherein, described results page includes corresponding with described target patterns information Described present information.Such as, example is connected, it is assumed that in step S10 ' in, information determines equipment 1 Determine presents information " iphone number accessory (protection housing accessory, charger etc.)-nokia Digital accessory-... " at target pages http://www.vipshop.com/show-0-48369-0.html?Target patterns letter corresponding in Breath includes that breadcrumb navigation, text summary region block, page layout background are warm tones such as pink colour etc., page Face font color is white etc., then in step S7 ' in, information determines that this is presented information by equipment 1 It is embedded in this target pages with the display format of this target patterns information, to generate described in correspondence Results page, wherein, described results page includes the institute corresponding with described target patterns information State and present information.
Skilled artisans will appreciate that above-mentioned combining target style information generates results page Mode is only for example, and other combining target style informations that are existing or that be likely to occur from now on generate The mode of results page is such as applicable to the present invention, also should be included in scope with In, and be incorporated herein with way of reference.
It should be noted that the present invention can be by reality in the assembly of software and/or software and hardware Execute, such as, can use special IC (ASIC), general purpose computer or any its He is similar to hardware device and realizes.In one embodiment, the software program of the present invention can lead to Cross processor to perform to realize steps described above or function.Similarly, the software journey of the present invention Sequence (including the data structure being correlated with) can be stored in computer readable recording medium storing program for performing, example Such as, RAM memory, magnetically or optically driver or floppy disc and similar devices.It addition, this Some bright steps or function can employ hardware to realize, such as, as coordinate with processor from And perform the circuit of each step or function.
It addition, the part of the present invention can be applied to computer program, such as computer Programmed instruction, when it is computer-executed, by the operation of this computer, can call or The method according to the invention and/or technical scheme are provided.And the program calling the method for the present invention refers to Order, is possibly stored in fixing or movably in record medium, and/or by broadcast or its Data stream in his signal bearing media and be transmitted, and/or be stored in and refer to according to described program In the working storage of the computer equipment that order runs.Here, according to an enforcement of the present invention Example includes a device, and this device includes the memorizer for storing computer program instructions and use In the processor of execution programmed instruction, wherein, when this computer program instructions is held by this processor During row, trigger this plant running method based on aforementioned multiple embodiments according to the present invention and/ Or technical scheme.
It is obvious to a person skilled in the art that the invention is not restricted to above-mentioned one exemplary embodiment Details, and without departing from the spirit or essential characteristics of the present invention, it is possible to it His concrete form realizes the present invention.Therefore, no matter from the point of view of which point, all should be by embodiment Regarding exemplary as, and be nonrestrictive, the scope of the present invention is by claims Rather than described above limit, it is intended that by fall claim equivalency implication and In the range of all changes be included in the present invention.Should be by any accompanying drawing mark in claim Note is considered as limiting involved claim.Furthermore, it is to be understood that " an including " word is not excluded for other lists Unit or step, odd number is not excluded for plural number.The multiple unit stated in device claim or device Can also be realized by software or hardware by a unit or device.The first, the second word such as grade Pragmatic represents title, and is not offered as any specific order.

Claims (20)

1. for the method determining the page-describing information corresponding to target pages, wherein, The method comprises the following steps:
A determines the classification relevant information corresponding to pending target pages;
B, according to described classification relevant information, describes letter to the candidate corresponding to described target pages Breath adjusts accordingly process, to obtain the page-describing information corresponding to described target pages;
Wherein, described classification relevant information includes following at least any one:
-virtual theme, it is intended that this target of access that the page body matter of described target pages can reflect The access intention of the user of the page;
-accurately mating object, it is intended that described target pages contains on all four with user's request interior Appearance information, and described user's request has irreplaceability;
-broad match object, it is intended that the content information of described target pages has relevant to user's request Property;
-mismatch object, it is intended that the content information of described target pages is not suitable for comprising confession user and obtains out Information is presented outside the content information of this target pages.
Method the most according to claim 1, wherein, the method also includes:
-according to the multiple training pages through marking classification information, carry out machine learning process, with Obtain the page classifications model for page classifications;
Wherein, described step a includes:
-according to described page classifications model, page relevant information based on described target pages, Determine described classification relevant information.
Method the most according to claim 1, wherein, the method also includes:
C according to described page-describing information, determines and corresponding with described target pages presents letter Breath, wherein, described in present information and described page-describing information match.
Method the most according to claim 3, wherein, described step a includes:
-obtain the accession page that user is accessed, using as described target pages;
A1 determines the classification relevant information corresponding to described target pages;
Wherein, the method also includes:
D according to described in present information, described target pages is updated process, right to generate The results page answered, wherein, described results page include described in present information;
-described results page is supplied to described user.
Method the most according to claim 4, wherein, described step a1 includes:
-combine the user's operation information of described user, determine corresponding to described target pages point Class relevant information;
Wherein, described user's operation information includes following at least any one:
-described user is about the page access session information of described accession page;
The page access record information of-described user;
Page search record corresponding to-described accession page.
6. according to the method described in claim 4 or 5, wherein, the method also includes:
-determine described in present the target position information that information is corresponding in described target pages;
Wherein, described step d includes:
Present information described in-basis, and combine described target position information, to described page object Face is updated processing, to generate corresponding described results page, wherein, described results page Information is presented described in including in described target position information corresponding position.
7. according to the method described in claim 4 or 5, wherein, the method also includes:
-determine described in present the target patterns information that information is corresponding in described target pages;
Wherein, described step d includes:
Present information described in-basis, and combine described target patterns information, to described page object Face is updated processing, to generate corresponding described results page, wherein, described results page Information is presented described in corresponding with described target patterns information.
Method the most according to claim 3, wherein, the method also includes:
-determine the content erotic degree information of described target pages;
Wherein, described step c includes:
-according to described page-describing information, and combine described content erotic degree information, determine with What described target pages was corresponding presents information, wherein, described in present information and retouch with the described page State information and described content erotic degree information match.
Method the most according to claim 1, wherein, described corresponding adjustment processes operation bag Include following at least any one:
-when described classification relevant information includes described virtual theme, according to described candidate, letter is described Breath carries out matching inquiry in virtual subject data base, using by corresponding matching inquiry result as Described page-describing information;
-when described classification relevant information includes described accurate coupling object, described candidate is described Information is as described page-describing information;
-when described classification relevant information includes described broad match object, retouch according to described candidate Information of stating carries out matching inquiry in generalized object data base, with described candidate is described information and The matching inquiry result of its correspondence is as described page-describing information;
-when described classification relevant information includes described mismatch object, described candidate is described information Empty, using as described page-describing information.
Method the most according to claim 1, wherein, the method also includes:
-obtain the one or more Search Results corresponding with search sequence;
-according to the page-describing information of the page corresponding to described Search Results and described inquiry sequence The matching degree information of row, carries out subsequent treatment to the one or more Search Results;
-by the one or more Search Results of subsequent treatment, at least one is supplied to Application corresponding to described search sequence.
11. 1 kinds are used for determining that the information of the page-describing information corresponding to target pages determines and set Standby, wherein, this information determines that equipment includes:
Sorter, for determining the classification relevant information corresponding to pending target pages;
Determine device, for according to described classification relevant information, to corresponding to described target pages Candidate describe information and adjust accordingly process, to obtain the page corresponding to described target pages Face describes information;
Wherein, described classification relevant information includes following at least any one:
-virtual theme, it is intended that this target of access that the page body matter of described target pages can reflect The access intention of the user of the page;
-accurately mating object, it is intended that described target pages contains on all four with user's request interior Appearance information, and described user's request has irreplaceability;
-broad match object, it is intended that the content information of described target pages has relevant to user's request Property;
-mismatch object, it is intended that the content information of described target pages is not suitable for comprising confession user and obtains out Information is presented outside the content information of this target pages.
12. information according to claim 11 determine equipment, and wherein, this information determines and sets For also including:
Device set up by model, for according to the multiple training pages through marking classification information, carrying out Machine learning processes, to obtain the page classifications model for page classifications;
Wherein, described sorter is used for:
-according to described page classifications model, page relevant information based on described target pages, Determine described classification relevant information.
13. information according to claim 11 determine equipment, and wherein, this information determines and sets For also including:
Coalignment, for according to described page-describing information, determines and described target pages phase Corresponding presents information, wherein, described in present information and described page-describing information match.
14. information according to claim 13 determine equipment, wherein, and described sorter Including:
Acquiring unit, for obtaining the accession page that user is accessed, using as described page object Face;
Taxon, for determining the classification relevant information corresponding to described target pages;
Wherein, this information determines that equipment also includes:
Generating means, for presenting information described in basis, is updated place to described target pages Reason, to generate corresponding results page, wherein, described results page include described in present information;
There is provided device, for described results page is supplied to described user.
15. information according to claim 14 determine equipment, wherein, and described taxon For:
-combine the user related information of described user, determine corresponding to described target pages point Class relevant information;
Wherein, described user related information includes following at least any one:
-described user is about the page access session information of described accession page;
The page access record information of-described user;
Page search record corresponding to-described accession page.
16. determine equipment according to the information described in claims 14 or 15, wherein, and this information Determine that equipment also includes:
Position determining means, be used for determining described in present information corresponding in described target pages Target position information;
Wherein, described generating means is used for:
Present information described in-basis, and combine described target position information, to described page object Face is updated processing, to generate corresponding described results page, wherein, described results page Information is presented described in including in described target position information corresponding position.
17. determine equipment according to the information described in claims 14 or 15, wherein, and this information Determine that equipment also includes:
Pattern determines device, be used for determining described in present information corresponding in described target pages Target patterns information;
Wherein, described generating means is used for:
Present information described in-basis, and combine described target patterns information, to described page object Face is updated processing, to generate corresponding described results page, wherein, described results page Information is presented described in corresponding with described target patterns information.
18. information according to claim 13 determine equipment, and wherein, this information determines and sets For also including:
Sensitivity device, for determining the content erotic degree information of described target pages;
Wherein, described coalignment is used for:
-according to described page-describing information, and combine described content erotic degree information, determine with What described target pages was corresponding presents information, wherein, described in present information and retouch with the described page State information and described content erotic degree information match.
19. information according to claim 11 determine equipment, wherein, and described corresponding adjustment Process operation and include following at least any one:
-when described classification relevant information includes described virtual theme, according to described candidate, letter is described Breath carries out matching inquiry in virtual subject data base, using by corresponding matching inquiry result as Described page-describing information;
-when described classification relevant information includes described accurate coupling object, described candidate is described Information is as described page-describing information;
-when described classification relevant information includes described broad match object, retouch according to described candidate Information of stating carries out matching inquiry in generalized object data base, with described candidate is described information and The matching inquiry result of its correspondence is as described page-describing information;
-when described classification relevant information includes described mismatch object, described candidate is described information Empty, using as described page-describing information.
20. information according to claim 11 determine equipment, and wherein, this information determines and sets For also including search process device, it is used for:
-obtain the one or more Search Results corresponding with search sequence;
-according to the page-describing information of the page corresponding to described Search Results and described inquiry sequence The matching degree information of row, carries out subsequent treatment to the one or more Search Results;
-by the one or more Search Results of subsequent treatment, at least one is supplied to Application corresponding to described search sequence.
CN201210452843.6A 2012-11-13 2012-11-13 For the method and apparatus determining the page-describing information corresponding to target pages Active CN102999576B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210452843.6A CN102999576B (en) 2012-11-13 2012-11-13 For the method and apparatus determining the page-describing information corresponding to target pages

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210452843.6A CN102999576B (en) 2012-11-13 2012-11-13 For the method and apparatus determining the page-describing information corresponding to target pages

Publications (2)

Publication Number Publication Date
CN102999576A CN102999576A (en) 2013-03-27
CN102999576B true CN102999576B (en) 2016-08-17

Family

ID=47928144

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210452843.6A Active CN102999576B (en) 2012-11-13 2012-11-13 For the method and apparatus determining the page-describing information corresponding to target pages

Country Status (1)

Country Link
CN (1) CN102999576B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103345476B (en) * 2013-06-09 2017-03-01 北京百度网讯科技有限公司 For determining the method and apparatus that assume information corresponding with target pages
CN103399764A (en) * 2013-07-24 2013-11-20 北京小米科技有限责任公司 Method, device and terminal for setting interface colors
CN103440326A (en) * 2013-09-02 2013-12-11 百度在线网络技术(北京)有限公司 Method and apparatus for providing representation information
CN103699669B (en) * 2013-12-30 2017-03-15 北京奇虎科技有限公司 The method of message push and a kind of browser terminal is carried out in a kind of browser
CN110489187B (en) * 2018-05-15 2021-09-24 腾讯科技(深圳)有限公司 Page refreshing method and device, storage medium and computer equipment
CN109492216A (en) * 2018-09-19 2019-03-19 平安科技(深圳)有限公司 Water note identifies automatically and the measures and procedures for the examination and approval, device and computer readable storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101251855A (en) * 2008-03-27 2008-08-27 腾讯科技(深圳)有限公司 Equipment, system and method for cleaning internet web page
CN101404031A (en) * 2008-11-12 2009-04-08 北京搜狗科技发展有限公司 Method and system for recognizing concept type web pages
CN102609407A (en) * 2012-02-16 2012-07-25 复旦大学 Fine-grained semantic detection method of harmful text contents in network
CN102750334A (en) * 2012-06-01 2012-10-24 北京市农林科学院农业科技信息研究所 Agricultural information accurate propelling method based on data mining (DM)

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8260664B2 (en) * 2010-02-05 2012-09-04 Microsoft Corporation Semantic advertising selection from lateral concepts and topics

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101251855A (en) * 2008-03-27 2008-08-27 腾讯科技(深圳)有限公司 Equipment, system and method for cleaning internet web page
CN101404031A (en) * 2008-11-12 2009-04-08 北京搜狗科技发展有限公司 Method and system for recognizing concept type web pages
CN102609407A (en) * 2012-02-16 2012-07-25 复旦大学 Fine-grained semantic detection method of harmful text contents in network
CN102750334A (en) * 2012-06-01 2012-10-24 北京市农林科学院农业科技信息研究所 Agricultural information accurate propelling method based on data mining (DM)

Also Published As

Publication number Publication date
CN102999576A (en) 2013-03-27

Similar Documents

Publication Publication Date Title
CN102999576B (en) For the method and apparatus determining the page-describing information corresponding to target pages
US20220058715A1 (en) Method and system for search refinement
CN102929939B (en) The offer method and device of customized information
Lu et al. BizSeeker: a hybrid semantic recommendation system for personalized government‐to‐business e‐services
CN104899285B (en) Search result methods of exhibiting and device
US8161382B2 (en) Method for providing font service on service page and system for executing the method
US11416565B2 (en) Techniques to leverage machine learning for search engine optimization
CN102999595B (en) A kind of for providing method and the equipment of the accession page corresponding with page info
CN107292463A (en) A kind of method and system that the project evaluation is carried out to application program
JP2013512501A (en) System, apparatus and method for using context information
CN103744866A (en) Searching method and device
CN103886016B (en) A kind of method and apparatus for being used to determine the rubbish text information in the page
CN108596705A (en) A kind of commodity suitable for e-commerce recommend method and system with information classification
JP2009193098A (en) Information processor, information processing method, and program
CN107742128A (en) Method and apparatus for output information
CN107833088A (en) Content providing, device and smart machine
CN108984555A (en) User Status is excavated and information recommendation method, device and equipment
Semerádová et al. Website quality and shopping behavior: Quantitative and qualitative evidence
Kornberger Think different: On studying the brand as organizing device
KR20160131477A (en) An e-commerce system based on interest category using related keywords
US20170083574A1 (en) Search system, search method, and program
Liang Development of rural community-based tourism with local customs from the view of consumer satisfaction
Choi et al. Developing an AI-based automated fashion design system: reflecting the work process of fashion designers
KR20180010147A (en) System and method for customized value information retrieval and social network configuration
CN105243133B (en) A kind of search record display methods and electronic equipment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant