WO2013120373A1 - 搜索方法、装置及存储介质 - Google Patents
搜索方法、装置及存储介质 Download PDFInfo
- Publication number
- WO2013120373A1 WO2013120373A1 PCT/CN2012/086025 CN2012086025W WO2013120373A1 WO 2013120373 A1 WO2013120373 A1 WO 2013120373A1 CN 2012086025 W CN2012086025 W CN 2012086025W WO 2013120373 A1 WO2013120373 A1 WO 2013120373A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- searched
- vector
- matching algorithm
- document
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3344—Query execution using natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/338—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/93—Document management systems
Definitions
- the present invention relates to the field of computer network search technologies, and in particular, to a search method, device, and storage medium.
- the traditional search scheme mainly includes: searching all the associated documents in the network according to the information input by the user, and calculating the degree of association between each associated document and the information to be searched according to a certain algorithm rule, based on the degree of association to all associations.
- the document is sorted and the sorted result is returned to the user as a search result.
- the degree of relevance directly affects the ranking results of related documents, directly affecting the user's search results, and the degree of relevance is generally reflected by the relevance score.
- the word matching algorithm is usually used for correlation calculation, for example, BM25 (Best Match) algorithm, proximity (Term proximity scoring) algorithm, etc., relevance score, relevance score The higher the value, the stronger the association.
- BM25 Best Match
- proximity Term proximity scoring
- relevance score relevance score The higher the value, the stronger the association.
- the relevance score of the associated document is 0; for example: one of the associated documents is: "Beijing, it is a historical and cultural city with a history of more than 3,000 years of construction, more than 850 years of history Is the national political and cultural center, and is also the country's largest land and air production hub.
- the correlation score of the associated document is 0, indicating that it is not related to the information to be searched, however, from the semantic relationship See, the correlation between the associated document and the information to be searched is actually very good.
- the associated document may be arranged in the later search result page, which is not conducive to the user's viewing.
- the technical problem to be solved by the embodiments of the present invention is to provide a search method, a device and a storage medium, which can obtain more accurate search results.
- an embodiment of the present invention provides a search method, including:
- the obtained related documents are sorted according to the calculated correlation, and the sorting result is displayed.
- an embodiment of the present invention further provides a search apparatus, including:
- a search module configured to acquire an associated document of the information to be searched
- a calculation module configured to calculate a correlation between each associated document obtained by the search module and the information to be searched based on a word matching algorithm and a semantic matching algorithm
- a sorting module configured to perform sorting processing on all associated documents obtained by the search module according to the correlation calculated by the calculating module
- a display module configured to display a sort result obtained by the sorting module.
- an embodiment of the present invention further provides a storage medium including computer executable instructions for performing a search method, the method comprising the steps of: acquiring an associated document of information to be searched ;
- the obtained related documents are sorted according to the calculated correlation, and the sorting result is displayed.
- the embodiment of the invention combines the word matching algorithm and the semantic matching algorithm, comprehensively considers the matching of words and words, and the matching of semantic relations between words and words, and obtains a relatively accurate correlation between each associated document and the information to be searched. Sorting based on the relevance and displaying the sorting result can provide users with ideal search results, so that the user can quickly obtain related documents with high relevance from the displayed search results, satisfying their actual search requirements, and improving search efficiency. , thus improving User satisfaction. BRIEF abstract
- FIG. 1 is a flow chart of an embodiment of a search method provided by the present invention.
- step S102 shown in FIG. 1;
- FIG. 3 is a schematic diagram of an IDF table provided by the present invention.
- FIG. 4 is a schematic diagram of a M1 table provided by the present invention.
- FIG. 5 is a specific flowchart of step S103 shown in FIG. 1;
- FIG. 6 is a schematic structural diagram of an embodiment of a search apparatus provided by the present invention.
- FIG. 7 is a schematic structural diagram of an embodiment of the computing module shown in FIG. 6. Preferred embodiment of the invention
- the searching device may calculate the relevance of all associated documents of the information to be searched based on the word matching and the semantic matching algorithm between words and words, and sort and display according to the relevance, so that the user You can quickly obtain related documents with high relevance from the displayed search results to meet your own search needs and improve search efficiency.
- the information to be searched may be a search keyword sentence input by the user, and the query information may be used.
- the associated document may be: a document included in a search result obtained by using an existing web search technology based on a search keyword sentence input by a user, which may be represented by a document.
- the word matching algorithm refers to the search process based on the word matching, which may be: BM25 algorithm, proximity algorithm, etc., unless otherwise specified, the embodiment of the present invention uses the BM25 algorithm as an example for description.
- the semantic matching algorithm means that the search process is based on the semantic relationship between words and words, that is, the search process is based on mutual information between words and words.
- MI Matter
- MI ual Information
- FIG. 1 is a flowchart of an embodiment of a search method provided by the present invention. the method includes:
- the score of the relevance of each associated document to the information to be searched may be composed of two parts, one is an association score obtained based on the word matching algorithm, and the other is an association score obtained based on the semantic matching algorithm.
- the weights of the two-part correlation scores may be preset according to specific conditions, so that the correlation scores composed of the weighted two-part correlation scores can more accurately reflect the degree of association between the associated documents and the information to be searched.
- all related documents obtained by the search may be sorted and displayed according to the relevance of each related document and the information to be searched in descending order, so that the displayed information is always related to the information to be searched.
- the related documents enable the user to quickly obtain related documents with high relevance from the displayed search results, satisfying their own search requirements and improving search efficiency. It can be understood that this step can also perform sorting processing in other orders, for example, in descending order according to the relevance degree, or setting a part in descending order according to the relevance degree, one part The scores are ranked in descending order of relevance, and so on.
- step S102 includes:
- the search information is vectorized, that is, the word segmentation technique is used, and the search information is processed by word segmentation, and the information to be searched is divided into m words, which can be expressed as ⁇ to, where m and both are positive integers. , and lm.
- 5212 Perform vectorization processing on each associated document obtained, and obtain n vectors corresponding to each associated document.
- each document in the obtained related documents is vectorized, that is, using the word segmentation technology, each associated document is subjected to word segmentation, and the associated document is divided into n words, which can be expressed as ⁇ to ⁇ , where n and _/ are both positive integers, and 1 _/ n.
- step S211 and step S212 are not sequential in sequence.
- step S212 may be performed first, and then step S211 is performed.
- the process of the vectorization process in step S211-step S212 can refer to the prior art, and details are not described herein.
- the formula of the word matching algorithm can be:
- the parameters, k, and the adjustment factor can play the role of smoothing the data;
- the parameters, k, and k are constants, and the specific values can be set by the user according to the actual situation or the empirical value;
- Qtfi is the first vector ⁇ , the word frequency in the information to be searched, that is, the number of times the vector t t appears in the information to be searched;
- Tfi is a vector, the frequency of words in the associated document, ie vector ⁇ , the number of occurrences in the corresponding associated document;
- Avdl is the average length of all associated documents
- the weight of the vector ⁇ is generally the IDF (Inverse document frequency) value, which can be calculated by the following formula, which is as follows:
- the weights (IDF values) of the vectors (words) in the network may be pre-calculated and stored.
- the weights of the vectors may be stored in the form of a table.
- FIG. 3 is a schematic diagram of an IDF table provided by the present invention.
- the IDF table in the example shown in FIG. 3 stores the weights of the vectors. It can be understood that the IDF table of the example shown in FIG. 3 and the table are Each item is an example.
- step S213 the weights of the vectors in the information to be searched can be directly read from the preset IDF table, and the parameters required to obtain the word matching algorithm are calculated according to the data obtained in step S211 and step S212, and substituted. Calculated in the calculation formula of the word matching algorithm, the correlation score of the related document and the to-be-searched information is obtained.
- the formula of the semantic matching algorithm may be:
- the parameters, k, and the adjustment factor can play the role of smoothing the data;
- the parameters, k, and k are constants, and the specific values can be set by the user according to the actual situation or the empirical value;
- / for the length of the corresponding associated document, according to the result of the vectorization processing in step S212, the value of / is n; Avdl is the average length of all associated documents obtained;
- the service is a vector ⁇ ,. Mutual information with the vector.
- the mutual information between each vector (word) and each vector in the network may be pre-calculated and stored before the execution of the search process.
- the mutual information between the vectors may be stored in the form of a table. .
- FIG. 4 it is a schematic diagram of the M1 table provided by the present invention; the M1 table in the example shown in FIG. 4 stores mutual information between the vectors, and it can be understood that the M1 table of the example shown in FIG. 4 And the items in the table are examples.
- step S214 the mutual information of each vector in the to-be-searched information and each vector of the associated document can be directly read from the preset M1 table, and calculated according to the data obtained in step S211 and step S212.
- the parameters required for obtaining the semantic matching algorithm are calculated and substituted into the calculation formula of the semantic matching algorithm to obtain an association score S 2 of the associated document and the information to be searched.
- step S213 and step S214 are not sequential in sequence. For example, step S214 may be performed first, and then step S213 is performed.
- step S103 includes: S311 , according to the relevance of each associated document and the information to be searched, in order of relevance from highest to lowest. Associate documents for sorting.
- step S311 After the sorting process in step S311, the associated documents are arranged in descending order of relevance, and step S312 displays related documents arranged in descending order of relevance, so that the user can quickly display from the displayed search results. Get relevant documents with high relevance to meet your own search needs and improve search efficiency.
- XX mobile phone price/performance ratio "XX mobile phone price/performance ratio"
- XX brand mobile phone is very good value for money, and XX brand mobile phone is very durable;
- Related document 2 I am a loyal friend of XX brand mobile phone, like to play XX brand mobile phone, brush machine, download program, game In all aspects, I feel that the various softwares of the XX brand mobile phone are relatively comprehensive, so I have been playing until now;
- Step S212 performs vectorization processing on any associated document, and associates the document 1 as an example.
- n vectors are obtained, as follows: XX card ⁇ ⁇ mobile phone ⁇ cost-effective ⁇ are ⁇ very ⁇ good ⁇ ⁇ , ⁇ and ⁇ XX card ⁇ mobile ⁇ very ⁇ durable ⁇ .
- n 15, ⁇ as “XX” brand, ⁇ 2 "and” 4 "Mobile” for the “price” for "all” for the “Gen”, ⁇ ⁇ is “Yes”, ⁇ 3 ⁇ 4 For "”, ⁇ 9 for", ", 4.
- d is "XX card”
- d 12 is “mobile phone”
- d 13 is ⁇
- d 14 is “durable”
- d l5 is "of".
- the vectors may be separately counted.
- the word frequency in the information to be searched is: ⁇ is 1, ⁇ 2 is 1, and 3 is 1.
- the vector, the word frequency in the associated document, is: ⁇ is 2, ⁇ 2 is 2, and ⁇ 3 is 1.
- / is the length 15 of the associated document 1.
- Flw / / is the average length of the three associated documents.
- the weights of the vectors in the information to be searched can be read from the preset IDF table shown in FIG. 3 as follows: ⁇ is 8.435292, w 2 is 5.256969, and w 3 is 8.952069. Based on the calculation formula of the word matching algorithm, the association score of the associated document and the to-be-searched information is calculated.
- step S214 mutual information of each vector in the information to be searched and each vector of the associated document may be read from the preset M1 table shown in FIG. Based on the calculation formula of the semantic matching algorithm, the association score of the associated document and the information to be searched is calculated.
- step S215 it may be set to, for example, 0.4 according to actual needs, so that the correlation between the associated document 1 and the information to be searched is calculated to be 1.759 by using ⁇ -pair and weighted summation.
- Step S311 sorts the associated documents 1-3 in descending order of relevance to form an arrangement of "related documents 3 - associated documents 2 - associated documents.
- Step S312 displays the arrangement obtained in step S311 to the user.
- the user can obtain the most relevant related document 3 from the first search result, and the user can satisfy his actual search requirement without searching, thereby improving the search efficiency.
- the embodiment of the invention combines the word matching algorithm and the semantic matching algorithm, comprehensively considers the matching of words and words, and the matching of semantic relations between words and words, and obtains a relatively accurate correlation between each associated document and the information to be searched. Sorting based on the relevance and displaying the sorting result can provide users with ideal search results, so that users can quickly obtain relevance from the displayed search results. Higher associated documents, to meet their actual search needs, improve search efficiency, thereby improving user satisfaction.
- the search device provided by the embodiment of the present invention will be described in detail below with reference to FIG. 6 to FIG. 7. The device of the following embodiments may be used. It is applied to the above method embodiment.
- FIG. 6 is a schematic structural diagram of an embodiment of a search apparatus provided by the present invention.
- the apparatus includes:
- the search module 101 is configured to acquire an associated document of the information to be searched.
- the specific search process of the search module 101 can refer to the prior art, and details are not described herein.
- the calculating module 102 is configured to calculate, according to the word matching algorithm and the semantic matching algorithm, the relevance of each associated document obtained by the search module 101 and the information to be searched.
- the score of the relevance of each associated document to the information to be searched may be composed of two parts, one is an association score obtained based on a word matching algorithm, and the other is an association score obtained based on a semantic matching algorithm.
- the weights of the two parts of the associated scores may be preset according to specific conditions, so that the relevance scores of the weighted two-part correlation scores more accurately reflect the degree of association between the associated documents and the information to be searched.
- the sorting module 103 is configured to sort the associated documents obtained by the search module according to the correlation calculated by the calculating module 102.
- the sorting module 103 may sort all the related documents obtained by the search according to the order of relevance of each associated document and the information to be searched calculated by the calculating module 102, or may perform sorting processing in other orders, for example, According to the relevance degree, the order is from low to high, or the part is set in descending order according to the relevance degree, and the part is ranked in descending order according to the relevance degree, and so on.
- the display module 104 is configured to display the sorting result obtained by the sorting module 103.
- the display module 104 displays the sorting result obtained by the sorting module 103, so that the displayed related document that is always related to the information to be searched is always displayed, so that the user can quickly obtain the related document with high relevance from the displayed search result. , to meet their own search needs, improve search efficiency.
- FIG. 7 which is a schematic structural diagram of an embodiment of the computing module shown in FIG. 6, the computing module 102 includes:
- the first vectorization processing unit 211 is configured to perform vectorization processing on the to-be-searched information to obtain m vectors ⁇ , ⁇ .
- the first vectorization processing unit 211 performs vectorization processing on the search information, that is, uses a word segmentation technique to perform word segmentation processing on the search information, and divides the information to be searched into m words, which can be expressed as, wherein, m and both Positive integer, and lm.
- the specific processing procedure of the first vectorization processing unit 211 can refer to the prior art, and details are not described herein.
- the second vectorization processing unit 212 is configured to perform vectorization processing on each associated document obtained by the search module to obtain n vectors corresponding to each associated document.
- the second vectorization processing unit 212 performs vectorization processing on the associated document, that is, uses word segmentation technology to perform word segmentation processing on the associated document, and divides the associated document into n words, which can be expressed as 4 to , where, ! ! And ⁇ ' are both positive integers, and 1 second vectorization processing unit
- the word matching calculation unit 213 is configured to calculate, according to the word matching algorithm, an association score of the associated document processed by the second vectorization processing unit 212 and the information to be searched.
- the word matching calculation unit 213 can directly read the weights of the vectors in the information to be searched directly from the preset IDF table in the example shown in FIG. 3, and according to the first vectorization processing unit 211 and the second vectorization processing unit.
- the data obtained by 212 is used to calculate various parameters required for obtaining the word matching algorithm, and based on the calculation formula of the word matching algorithm, the associated score of the associated document and the information to be searched is calculated.
- the semantic matching calculation unit 214 is configured to calculate, according to the semantic matching algorithm, the association score S 2 of the associated document processed by the second vectorization processing unit 212 and the to-be-searched information.
- the semantic matching calculation unit 214 can directly read the mutual information of each vector in the information to be searched and each vector of the associated document from the preset M1 table in the example shown in FIG. 4, and according to the first direction
- the data obtained by the quantization processing unit 211 and the second vectorization processing unit 212 calculates various parameters required to obtain the semantic matching algorithm, and calculates a correlation between the associated document and the to-be-searched information based on a calculation formula of the semantic matching algorithm. Rating S 2 .
- the value set according to the specific situation may be such that the weighted sum and S 2 correlation degree score S can more accurately reflect the degree of association between the associated document and the information to be searched. It should be noted that the larger the value of S, the stronger the association between the associated document and the information to be searched.
- the second vectorization processing unit 212, the word matching calculation unit 213, the semantic matching calculation unit 214, and the relevance calculation unit 215 may need to repeat the work until the relevance of all associated documents to the information to be searched is obtained. Then, the sorting module 103 may sort all the related documents obtained by the search module according to the relevance of each associated document and the information to be searched, in descending order of relevance; the display module 104 Then, the sorting module 103 displays all the associated documents processed by the sorting module 103.
- the search apparatus may be: a search engine, a browser, and a terminal having a search function.
- the embodiment of the present invention combines a word matching algorithm and a semantic matching algorithm, comprehensively considers the matching of words and words, and the matching of semantic relations between words and words, and obtains each associated document and information to be searched.
- the more accurate correlation, sorting based on the relevance and displaying the sorting result can provide users with ideal search results, so that users can quickly obtain related documents with high relevance from the displayed search results, and satisfy their actual situation. Search requirements increase search efficiency and increase user satisfaction.
- the storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Artificial Intelligence (AREA)
- Business, Economics & Management (AREA)
- General Business, Economics & Management (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/347,776 US9317590B2 (en) | 2012-02-13 | 2012-12-06 | Search method, search device and storage medium |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210031523.3A CN103246681B (zh) | 2012-02-13 | 2012-02-13 | 一种搜索方法及装置 |
CN201210031523.3 | 2012-02-13 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2013120373A1 true WO2013120373A1 (zh) | 2013-08-22 |
Family
ID=48926205
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2012/086025 WO2013120373A1 (zh) | 2012-02-13 | 2012-12-06 | 搜索方法、装置及存储介质 |
Country Status (3)
Country | Link |
---|---|
US (1) | US9317590B2 (zh) |
CN (1) | CN103246681B (zh) |
WO (1) | WO2013120373A1 (zh) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103699662B (zh) * | 2013-12-27 | 2018-01-19 | 贝壳网际(北京)安全技术有限公司 | 一种通知栏消息展现方法及装置 |
GB201514249D0 (en) * | 2015-08-12 | 2015-09-23 | Trw Ltd | Processing received radiation reflected from a target |
US9984031B2 (en) | 2015-10-26 | 2018-05-29 | International Business Machines Corporation | Adapter selection based on a queue time factor |
CN106815252B (zh) * | 2015-12-01 | 2020-08-25 | 阿里巴巴集团控股有限公司 | 一种搜索方法和设备 |
CN105653703A (zh) * | 2015-12-31 | 2016-06-08 | 武汉传神信息技术有限公司 | 一种文档检索匹配方法 |
CN107341152B (zh) * | 2016-04-28 | 2020-05-08 | 创新先进技术有限公司 | 一种参数输入的方法及装置 |
CN107798637A (zh) * | 2016-08-30 | 2018-03-13 | 北京国双科技有限公司 | 同案异判文书的获取方法及装置 |
CN108415903B (zh) * | 2018-03-12 | 2021-09-07 | 武汉斗鱼网络科技有限公司 | 判断搜索意图识别有效性的评价方法、存储介质和设备 |
CN110362813B (zh) * | 2018-04-09 | 2023-12-05 | 乐万家财富(北京)科技有限公司 | 基于bm25的搜索相关性度量方法、存储介质、设备及*** |
CN109388786B (zh) * | 2018-09-30 | 2024-01-23 | 广州财盟科技有限公司 | 一种文档相似度计算方法、装置、设备及介质 |
CN109408616A (zh) * | 2018-10-10 | 2019-03-01 | 中南民族大学 | 内容相似性短文本查询方法、设备、***及存储介质 |
CN110162590A (zh) * | 2019-02-22 | 2019-08-23 | 北京捷风数据技术有限公司 | 一种工程招标文本结合经济要素的数据库显示方法及其装置 |
CN111611372A (zh) * | 2019-02-25 | 2020-09-01 | 北京嘀嘀无限科技发展有限公司 | 搜索结果的排序方法及装置、音乐搜索方法及装置 |
CN109977292B (zh) * | 2019-03-21 | 2022-12-27 | 腾讯科技(深圳)有限公司 | 搜索方法、装置、计算设备和计算机可读存储介质 |
CN113361248B (zh) * | 2021-06-30 | 2022-08-12 | 平安普惠企业管理有限公司 | 一种文本的相似度计算的方法、装置、设备及存储介质 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1741012A (zh) * | 2004-08-23 | 2006-03-01 | 富士施乐株式会社 | 文本检索装置及方法 |
US20110087701A1 (en) * | 2009-10-09 | 2011-04-14 | International Business Machines Corporation | System, method, and apparatus for pairing a short document to another short document from a plurality of short documents |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1162789C (zh) * | 2001-09-06 | 2004-08-18 | 联想(北京)有限公司 | 通过主题词矫正基于向量空间模型文本相似度计算的方法 |
CN102043833B (zh) * | 2010-11-25 | 2013-12-25 | 北京搜狗科技发展有限公司 | 一种基于查询词进行搜索的方法和搜索装置 |
US9589050B2 (en) * | 2014-04-07 | 2017-03-07 | International Business Machines Corporation | Semantic context based keyword search techniques |
-
2012
- 2012-02-13 CN CN201210031523.3A patent/CN103246681B/zh active Active
- 2012-12-06 WO PCT/CN2012/086025 patent/WO2013120373A1/zh active Application Filing
- 2012-12-06 US US14/347,776 patent/US9317590B2/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1741012A (zh) * | 2004-08-23 | 2006-03-01 | 富士施乐株式会社 | 文本检索装置及方法 |
US20110087701A1 (en) * | 2009-10-09 | 2011-04-14 | International Business Machines Corporation | System, method, and apparatus for pairing a short document to another short document from a plurality of short documents |
Non-Patent Citations (1)
Title |
---|
PABLO CASTELLS ET AL.: "An Adaptation of the Vector-Space Model for Ontology-Based information Retrieval.", IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING., vol. 19, no. 2, February 2007 (2007-02-01), pages 261 - 272, XP011152473 * |
Also Published As
Publication number | Publication date |
---|---|
US9317590B2 (en) | 2016-04-19 |
US20140358914A1 (en) | 2014-12-04 |
CN103246681A (zh) | 2013-08-14 |
CN103246681B (zh) | 2018-10-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2013120373A1 (zh) | 搜索方法、装置及存储介质 | |
US11507975B2 (en) | Information processing method and apparatus | |
CN104216942B (zh) | 查询建议模板 | |
JP5913736B2 (ja) | キーワードの推薦 | |
US11687968B1 (en) | Serving advertisements based on partial queries | |
CN108628833B (zh) | 原创内容摘要确定方法及装置,原创内容推荐方法及装置 | |
US8103667B2 (en) | Ranking results of multiple intent queries | |
CN108763362A (zh) | 基于随机锚点对选择的局部模型加权融合Top-N电影推荐方法 | |
US9864747B2 (en) | Content recommendation device, recommended content search method, and program | |
WO2019023358A1 (en) | SEMANTIC SIMILARITY FOR MODEL CLASSIFICATION OF RESULTS OF MACHINE LEARNING | |
CN103345517B (zh) | 模拟tf-idf相似性计算的协同过滤推荐算法 | |
CN106651544B (zh) | 最少用户交互的会话式推荐*** | |
US10152478B2 (en) | Apparatus, system and method for string disambiguation and entity ranking | |
JP7150090B2 (ja) | ショッピング検索のための商品属性抽出方法 | |
JP2015522190A (ja) | 検索結果の生成 | |
CN107943910B (zh) | 一种基于组合算法的个性化图书推荐方法 | |
US11100169B2 (en) | Alternative query suggestion in electronic searching | |
CN103744887B (zh) | 一种用于人物搜索的方法、装置和计算机设备 | |
CN110968789B (zh) | 电子书推送方法、电子设备及计算机存储介质 | |
CN111125348A (zh) | 一种文本摘要的提取方法及装置 | |
CN109960749A (zh) | 模型获取方法、关键词生成方法、装置、介质及计算设备 | |
CN107291894A (zh) | 一种融合相似性和共同评分项数量的概率矩阵分解模型 | |
US9251264B2 (en) | Systems and methods for enabling an electronic graphical search space of a database | |
CN109144953B (zh) | 搜索文件的排序方法、装置、设备、存储介质及搜索*** | |
CN113449200A (zh) | 物品推荐方法、装置及计算机存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 12868395 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14347776 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 30.01.15) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 12868395 Country of ref document: EP Kind code of ref document: A1 |