WO2021000826A1 - Information search method and apparatus, terminal and storage medium - Google Patents

Information search method and apparatus, terminal and storage medium Download PDF

Info

Publication number
WO2021000826A1
WO2021000826A1 PCT/CN2020/098785 CN2020098785W WO2021000826A1 WO 2021000826 A1 WO2021000826 A1 WO 2021000826A1 CN 2020098785 W CN2020098785 W CN 2020098785W WO 2021000826 A1 WO2021000826 A1 WO 2021000826A1
Authority
WO
WIPO (PCT)
Prior art keywords
interest
search
points
terminal
query
Prior art date
Application number
PCT/CN2020/098785
Other languages
French (fr)
Chinese (zh)
Inventor
李俊良
Original Assignee
Oppo广东移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oppo广东移动通信有限公司 filed Critical Oppo广东移动通信有限公司
Publication of WO2021000826A1 publication Critical patent/WO2021000826A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • G06F16/9014Indexing; Data structures therefor; Storage structures hash tables
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results

Definitions

  • This application belongs to the field of data processing technology, and in particular relates to an information search method, device, terminal and storage medium.
  • the search engine is used to provide search services and display search results related to the query keyword (Query) entered by the user to the user.
  • Query query keyword
  • a query keyword can reflect a variety of different search intents.
  • Existing comprehensive search engines cannot parse the user's real search intent well.
  • the search results returned by priority often fail to match the user's real search intent, resulting in Information search efficiency is low.
  • the embodiments of the present application provide an information search method, device, terminal and storage medium to at least solve the problem that search results preferentially returned by comprehensive search engines often fail to match the user’s real search intent, and the efficiency of information search is low. problem.
  • the embodiment of the present application provides an information search method, including:
  • the search results are output in sorted order.
  • the searching for points of interest in the geographic area where the terminal is located related to the query keyword includes:
  • the searching for points of interest in the geographic area where the terminal is located related to the query keyword includes:
  • LBS Location-Based Service
  • the input method is a word-by-word input
  • the search for points of interest in the geographic area where the terminal is located related to the query keyword in the LBS corpus database corresponding to the input method includes:
  • the LBS corpus database stored in a prefix tree manner, search for points of interest in the geographic area where the terminal is located and related to the query keyword.
  • the input mode is a one-time input
  • the search for the points of interest in the geographic area where the terminal is located related to the query keyword in the LBS corpus database corresponding to the input mode includes:
  • LBS corpus database stored in the form of a dictionary tree, search for points of interest in the geographic area where the terminal is located related to the query keyword.
  • the obtaining of query keywords includes:
  • the searching for points of interest in the geographic area where the terminal is located related to the query keyword includes:
  • the searching for the points of interest in the geofence where the terminal is located related to the query keyword includes:
  • the embodiment of the present application also provides an information search device, including:
  • the obtaining unit is configured to obtain query keywords
  • the searching unit is configured to search for points of interest in the geographic area where the terminal is located and related to the query keyword;
  • the adding unit is configured to determine that a point of interest is found, and to add a relevance score corresponding to the found point of interest, where the relevance score is used to characterize the correlation between the corresponding search result and the query keyword;
  • the sorting unit is configured to sort at least two search results according to the relevance score, the at least two search results including the found points of interest and the search keywords other than the found points of interest Matching information
  • the output unit is configured to output search results in a sorted order.
  • the embodiment of the present application also provides a terminal, including a processor and a memory configured to store a computer program that can run on the processor,
  • the processor is configured to execute the steps of any one of the foregoing information searching methods when running the computer program.
  • the embodiment of the present application also provides a storage medium, the storage medium stores a computer program, and when the computer program is executed by a processor, the steps of any one of the foregoing information search methods are implemented.
  • the solution provided by the embodiments of this application considers the geographic location of the terminal when it initiates the search behavior, and searches for whether there are points of interest in the geographic area where the terminal is located related to the query keywords. If such points of interest exist, set The correlation between the interest point and the query keyword is increased, so that when the search result is output, the interest point is displayed to the user as the priority output search result, so that the priority output search result may match the user's real search intention Increased sexuality improves the efficiency of information search.
  • FIG. 1 is a schematic diagram of an implementation process of an information search method provided by an embodiment of the present application
  • FIG. 2 is a schematic diagram of the implementation flow of searching for points of interest in the information search method provided by an embodiment of the present application;
  • FIG. 3 is a schematic diagram of the implementation process of searching for points of interest in the information search method provided by another embodiment of the present application;
  • FIG. 4 is a schematic diagram of an implementation process of an information search method provided by another embodiment of the present application.
  • FIG. 5 is a schematic diagram of an implementation flow of an information search method provided by another embodiment of the present application.
  • Figure 6 is a schematic diagram of a data structure for storing LBS corpus provided by an embodiment of the present application.
  • FIG. 7 is a schematic diagram of an inverted index structure in related technologies
  • FIG. 8 is an example diagram of search result output provided by an application embodiment of this application.
  • FIG. 9 is an example diagram of search result output provided by another application embodiment.
  • FIG. 10 is a schematic diagram of the implementation principle of the information search method provided by the embodiment of the present application.
  • Figure 11 is a structural block diagram of an information search device provided by an embodiment of the present application.
  • FIG. 12 is a schematic diagram of the hardware structure of a terminal provided by an embodiment of the present application.
  • FIG. 1 shows the implementation process of the information search method provided by the embodiment of the present invention.
  • the execution subject of the information search method is mobile terminals such as smart phones, tablets, and notebooks.
  • the information search method provided by the embodiment of the present invention includes:
  • obtaining query keywords includes:
  • Format normalization is to remove data noise in keywords input to the terminal, so as to improve the efficiency of data processing in the search process.
  • the format normalization methods include but are not limited to: converting keywords from full-width to half-width, from traditional to simplified, and from uppercase to lowercase.
  • search engines will use specific computer programs to crawl information from the Internet according to certain strategies in advance, organize and process the crawled information, form a data structure that is easy to retrieve, and store it in the database.
  • the search engine After obtaining the query keywords, the search engine will search in the database based on the query keywords to obtain matching information of the query keywords, and each matching information corresponds to a search result of the query keywords.
  • S102 Search for points of interest that are related to the query keyword and located in the geographic area where the terminal is located.
  • Point of interest in English can be expressed as Point of Interest
  • Point of Interest usually refers to a certain landmark on the electronic map, which can be office buildings, communities, shops, bus stations, parks, stadiums, etc.
  • each point of interest can contain four aspects of information: name, category, coordinates and classification.
  • the point of interest is configured with geographic location information of the location of the point of interest.
  • geographic location information is usually marked with latitude and longitude.
  • searching for points of interest requires combining the geographic location of the point of interest with the geographic area where the terminal is located, where the geographic area where the terminal is located It can be a geographic area centered on the geographic location of the terminal, and a geographic area within a set radius.
  • the geographic location of the terminal can be obtained through the positioning function of the terminal.
  • the geographic area where the terminal is located can also be determined in combination with the LBS. Referring to Figure 2, searching for points of interest in the geographic area where the terminal is located related to the query keyword includes:
  • S201 Determine the geographic location of the terminal.
  • the geographic location of the terminal can be obtained through the positioning function of the terminal, and the available positioning technical means include but not limited to satellite positioning, base station positioning, Wi-Fi positioning, and the like.
  • S202 Determine the geofence where the geographic location is located.
  • LBS is a way to obtain the location information of the terminal through the radio communication network of the telecommunications mobile operator, or an external positioning method, and with the support of the Geographic Information System (GIS, Geographic Information System) platform, to provide users with corresponding services. Value-added services.
  • GIS Geographic Information System
  • a virtual fence is used to enclose a virtual geographic boundary in the GIS platform, and the geographic area surrounded by the geographic boundary is called a geographic fence.
  • the geographic area has actually been gridded at the virtual level, and it can be considered that the geographic area is spliced by the multiple geo-fences.
  • the geographic fence delineating the geographic location can be regarded as the geographic area where the terminal is located.
  • the geographic area where the terminal is located can be determined more efficiently, and the other
  • it can also provide more information services based on geofencing and user search requirements.
  • S203 Search for points of interest located in the geofence that are related to the query keyword.
  • the geographic fence corresponding to the geographic location of the terminal After the geographic fence corresponding to the geographic location of the terminal is determined, it is equivalent to determining the geographic area where the terminal is located, and it is possible to find out whether there is a point of interest related to the query keyword whose geographic location is located in the geographic fence.
  • based on the searched matching information of the query keywords it is possible to find whether there is a point of interest in the geographic area where the terminal is located in the searched matching information. After searching the matching information of the keywords.
  • the thread for finding points of interest can also be started specifically, that is, the search process of the points of interest and the search process of matching information of the query keywords can be executed in parallel.
  • searching for points of interest in the geofence related to the query keyword includes:
  • S2031 Access a setting database corresponding to the geofence, and all points of interest in the geofence are stored in the setting database.
  • the points of interest in the electronic map can be sorted out first, using geofences as a unit to create an index for all points of interest in each geofence and store them in the database, and each geofence
  • Each fence can correspond to a set database.
  • the points of interest in the geofence are stored in the data table of the database in a certain data structure. According to the update frequency of the points of interest, the set database is regularly updated and maintained. .
  • S2032 Search for points of interest related to the query keyword in the set database.
  • the search for points of interest is completed by setting the database.
  • the setting database is set in relation to the geofence, and the corresponding setting database can be quickly located through the geofence where the terminal is located, reducing the points of interest
  • searching in the information that only has the points of interest instead of searching in all the matching information of the query keywords, greatly improves the search efficiency of the points of interest.
  • the setting database can be regularly synchronized with the geographic information system, which can well realize the regular update and maintenance of the points of interest to ensure the accuracy of the search for the points of interest.
  • searching for points of interest in the geographic area where the terminal is located related to the query keyword is intended to be based on the query keyword and the geographic location of the terminal on the basis of the conventional search process.
  • the search request recognizes the query intention to determine whether the search request is related to the geographic location of the terminal.
  • S1021 According to the context of this search, determine the probability that the search target type is a point of interest.
  • the context can be understood as the environment of this search, including search scenarios, historical search records, etc.
  • the probability that the search target type of this search is a point of interest can be roughly determined. For example, in a period of time before this search occurs, a search related to a point of interest is performed, or a search result related to a point of interest is clicked in the given search results, then the search target type is a point of interest The probability is set to a larger value.
  • S1022 When it is determined that the probability is greater than the set probability value, search for points of interest that are related to the query keyword and located in the geographic area where the terminal is located.
  • the conventional search process can also be performed directly without triggering the operation of finding the point of interest in the geographic area where the terminal is located related to the query keyword, thereby It can avoid unnecessary point-of-interest searching operations, which effectively improves search efficiency.
  • searching for points of interest in the geofence related to the query keyword includes:
  • S501 Determine the input mode of the query keyword.
  • verbatim input refers to the input method of inputting the characters of the query keyword one by one according to the position order of the characters in the query keyword. For example, for the query keyword "Kapok”, input “wood”, “cotton”, The three characters “ ⁇ ”, or input "Kapok” and “ ⁇ ” one by one, these two input methods can be regarded as input word by word.
  • One-time input refers to the input method of inputting all the characters of the query keyword at one time, for example, inputting "Kapok” at one time through the phrase input method or the input method of copy and paste.
  • S502 In the LBS corpus database corresponding to the input mode, search for points of interest that are related to the query keyword and located in the geographic area where the terminal is located.
  • the LBS corpus is stored in three different LBS corpus databases through three different data structures in advance, as shown in FIG. 6, the three data structures are DAT (Double Array Trie) trees. , Prefix tree and hash map.
  • DAT Double Array Trie
  • the DAT tree that is, the double-array Tire tree
  • the dictionary tree is a tree structure often used for word retrieval.
  • the DAT tree uses the common prefix of words to save space.
  • the search efficiency has nothing to do with the number of words in the LBS corpus database, only the longest word length and query keywords in the LBS corpus database The length of is related, so you can take into account query efficiency and space storage. Since in the DAT tree, the search result can be obtained only when the query keyword is completely hit.
  • the LBS corpus stored in the DAT tree method In the database, search for points of interest that are located in the geographic area where the terminal is located and are related to the query keywords, so as to improve the search efficiency of points of interest through the characteristics of the DAT tree.
  • the prefix tree namely the marisa tree
  • the point of interest is searched in combination with the edit distance method.
  • the edit distance is also called Levenshtein distance, which refers to the minimum number of editing operations required to convert two strings from one to the other.
  • the permitted editing operations include replacing one character with another character and inserting a character , Delete a character, etc.
  • to convert from "Kapok” to "Kapok Hotel” only two characters need to be inserted, that is, the editing distance is only 2.
  • Hash Map is an implementation of a mapping interface based on a hash table. This implementation provides all optional mapping operations.
  • a fixed hash table (Hash Table) can also be used to establish a compact
  • the linked list can mount the quality scores, categories, etc. of documents, and filter by category when searching, and the search results can be sorted according to the quality scores of the documents.
  • Figure 7 shows a schematic diagram of the inverted index structure in the related technology.
  • a spatial index can be constructed by longitude, latitude, and city name, and the construction method of Geohash or HibertID can be used.
  • an inverted index can also be used to store the data structure in a hash map.
  • S103 Determine that the point of interest is found, and increase the relevance score corresponding to the found point of interest.
  • the relevance score is used to characterize the relevance between the corresponding search result and the query keyword.
  • the query intent corresponding to the query keyword is associated with the point of interest.
  • the points of interest gained increase the corresponding relevance score.
  • a relevance score will be calculated for characterization The relevance between the search result and the query keyword, the higher the relevance score, the more the search result matches the user's real query intention, the more likely the search result will be output first.
  • S104 Sort at least two search results according to the relevance score.
  • the at least two search results include the found points of interest and matching information of query keywords other than the found points of interest.
  • the search results are sorted, and the search results include the found points of interest and also include matching information for query keywords in addition to the found points of interest.
  • the distance between the point of interest and the geographic location where the terminal is located, and the historical search popularity of the point of interest can be considered to determine the sort order among these points of interest.
  • S105 Output the search results according to the sort order.
  • the search results may have the following two changes compared to the increase in relevance score:
  • Example 1 For the query keyword "Kapok”, before the relevance score is increased, "Kapok” as a plant variety has a strong search intention, and a hotel name has a weak search intention, but it is combined with the terminal where the search is performed The location is very close to the "Kapok Hotel", which increases the relevance score of the search results for the point of interest "Kapok Hotel”, and the location distance between the integrated terminal and the "Kapok Hotel", the context of the search, and the "Kapok Hotel” The historical search interest of "Cotton Hotel” and many other factors have transformed the search intention of "Kapok” as a hotel name into a strong intent. As shown in Figure 8, the search results show the interest in "Kapok Hotel” first. Click the search result, and then display other matching information about "Kapok”.
  • Example 2 For the query keyword "Xiao Shi”, before the relevance score increases, "Xiao Shi” has a strong search intention as a movie name, and a weak search intention as a hotel apartment name, but it is combined with the terminal when searching The location of the location is very close to "Xiaoshidai", which will increase the relevance score of the search results for the point of interest "Xiaoshidai Short-term Rental Apartment", and the distance between the integrated terminal and Context and historical search popularity of "Xiao Shidai Short-term Rental Apartments" and other factors have strengthened the search intention of using "Xiao Shidai” as the name of hotel apartments.
  • Figure 10 shows a schematic diagram of the implementation principle of the information search provided by the embodiments of the present application.
  • the query keywords are input into the scheduler.
  • the scheduler inputs the query keywords in combination with context and other information
  • the intent recognizer is based on the LBS corpus database to find whether there are points of interest mentioned in this scheme.
  • the searcher obtains the search for the query keywords based on the query keywords and context information result.
  • the data in the LBS corpus database is stored in a spatial index
  • other corpora is stored in an inverted index.
  • the embodiment of the application considers the geographic location of the terminal when it initiates the search behavior, and searches for whether there is a point of interest in the geographic area where the terminal is located related to the query keyword. If such a point of interest exists, set the point of interest and The correlation between the query keywords increases, so that when outputting search results, the interest point is displayed to the user as the priority output search result, so that the priority output search result is more likely to match the user’s real search intention. Improve the efficiency of information search.
  • the embodiment of the present application also provides an information search device, which is set on a terminal such as a smart phone, a tablet, a notebook, etc., referring to FIG. 11, the information search device includes:
  • the obtaining unit 1101 is configured to obtain query keywords
  • the searching unit 1102 is configured to search for points of interest located in the geographic area where the terminal is located and related to the query keywords;
  • the adding unit 1103 is configured to determine that a point of interest is found, and increase the relevance score corresponding to the found point of interest, and the relevance score is used to represent the correlation between the corresponding search result and the query keyword;
  • the sorting unit 1104 is configured to sort at least two search results according to the relevance scores, the at least two search results including the found points of interest and matching information of query keywords other than the found points of interest;
  • the output unit 1105 is configured to output search results in a sorted order.
  • the searching unit 1102 is configured to:
  • search for points of interest in the geographic area where the terminal is located related to the query keyword When the determined probability is greater than the set probability value, search for points of interest in the geographic area where the terminal is located related to the query keyword.
  • the searching unit 1102 is configured to:
  • the input method is word-by-word input
  • the search for points of interest in the geographic area where the terminal is located related to the query keyword in the LBS corpus database corresponding to the input method includes:
  • the input mode is a one-time input
  • the search for the points of interest in the geographic area where the terminal is located related to the query keyword in the LBS corpus database corresponding to the input mode includes:
  • the acquiring unit 1101 is configured to:
  • the searching unit 1102 is configured to:
  • the searching for points of interest located in the geofence related to the query keyword includes:
  • the acquiring unit 1101, searching unit 1102, adding unit 1103, sorting unit 1104, and output unit 1105 can be implemented by a processor in an information search device.
  • the processor needs to run the programs stored in the memory to realize the functions of the above-mentioned program modules.
  • the information search device provided in the above embodiment performs information search
  • only the division of the above-mentioned program modules is used as an example for illustration.
  • the above-mentioned processing can be allocated to different program modules as needed. That is, the internal structure of the device is divided into different program modules to complete all or part of the processing described above.
  • the information search device and the information search method embodiments provided in the above embodiments belong to the same concept, and the specific implementation process is detailed in the method embodiments, which will not be repeated here.
  • FIG. 12 is a schematic diagram of the hardware composition structure of a terminal according to an embodiment of the application. As shown in FIG. 12, the terminal includes:
  • Communication interface 1 which can exchange information with other devices such as network equipment;
  • the processor 2 is connected to the communication interface 1 to realize information interaction with other devices, and when configured to run a computer program, it executes the information search method provided by one or more technical solutions on the terminal side.
  • the computer program is stored in the memory 3.
  • bus system is configured to implement connection and communication between these components.
  • bus system also includes a power bus, a control bus, and a status signal bus.
  • various buses are marked as bus systems in FIG. 12.
  • the memory 3 in the embodiment of the present application is configured to store various types of data to support the operation of the terminal. Examples of such data include: any computer program used to operate on the terminal.
  • the memory 3 may be a volatile memory or a non-volatile memory, and may also include both volatile and non-volatile memory.
  • the non-volatile memory can be read only memory (ROM, Read Only Memory), programmable read only memory (PROM, Programmable Read-Only Memory), erasable programmable read only memory (EPROM, Erasable Programmable Read- Only Memory, Electrically Erasable Programmable Read-Only Memory (EEPROM, Electrically Erasable Programmable Read-Only Memory), magnetic random access memory (FRAM, ferromagnetic random access memory), flash memory (Flash Memory), magnetic surface memory , CD-ROM, or CD-ROM (Compact Disc Read-Only Memory); magnetic surface memory can be magnetic disk storage or tape storage.
  • the volatile memory may be random access memory (RAM, Random Access Memory), which is used as an external cache.
  • RAM random access memory
  • SRAM static random access memory
  • SSRAM synchronous static random access memory
  • DRAM dynamic random access Memory
  • SDRAM Synchronous Dynamic Random Access Memory
  • DDRSDRAM Double Data Rate Synchronous Dynamic Random Access Memory
  • ESDRAM enhanced -Type synchronous dynamic random access memory
  • SLDRAM SyncLink Dynamic Random Access Memory
  • direct memory bus random access memory DRRAM, Direct Rambus Random Access Memory
  • DRRAM Direct Rambus Random Access Memory
  • the memory 2 described in the embodiment of the present application is intended to include, but is not limited to, these and any other suitable types of memory.
  • the method disclosed in the foregoing embodiment of the present application may be applied to the processor 2 or implemented by the processor 2.
  • the processor 2 may be an integrated circuit chip with signal processing capability. In the implementation process, each step of the above method can be completed by an integrated logic circuit of hardware in the processor 2 or instructions in the form of software.
  • the aforementioned processor 2 may be a general-purpose processor, a DSP, or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, and the like.
  • the processor 2 may implement or execute the methods, steps, and logical block diagrams disclosed in the embodiments of the present application.
  • the general-purpose processor may be a microprocessor or any conventional processor.
  • the steps of the method disclosed in the embodiments of the present application can be directly embodied as being executed and completed by a hardware decoding processor, or executed by a combination of hardware and software modules in the decoding processor.
  • the software module may be located in a storage medium, and the storage medium is located in the memory 3, and the processor 2 reads the program in the memory 3, and completes the steps of the foregoing method in combination with its hardware.
  • the embodiment of the present application also provides a storage medium, that is, a computer storage medium, specifically a computer-readable storage medium, such as a memory 3 storing a computer program, which can be used by the processor 2 of the terminal. Execute to complete the steps described in the foregoing method.
  • the computer-readable storage medium may be a memory such as FRAM, ROM, PROM, EPROM, EEPROM, Flash Memory, magnetic surface memory, optical disc, or CD-ROM.
  • the disclosed device, terminal, and method may be implemented in other ways.
  • the device embodiments described above are merely illustrative.
  • the division of the units is only a logical function division, and there may be other divisions in actual implementation, such as: multiple units or components can be combined, or It can be integrated into another system, or some features can be ignored or not implemented.
  • the coupling, or direct coupling, or communication connection between the components shown or discussed may be indirect coupling or communication connection through some interfaces, devices or units, and may be electrical, mechanical or other forms of.
  • the units described above as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, that is, they may be located in one place or distributed on multiple network units; Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • the functional units in the embodiments of the present application can all be integrated into one processing unit, or each unit can be individually used as a unit, or two or more units can be integrated into one unit;
  • the unit can be implemented in the form of hardware, or in the form of hardware plus software functional units.
  • the foregoing program can be stored in a computer readable storage medium. When the program is executed, it is executed. Including the steps of the foregoing method embodiment; and the foregoing storage medium includes: various media that can store program codes, such as a mobile storage device, ROM, RAM, magnetic disk, or optical disk.
  • the above-mentioned integrated unit of this application is implemented in the form of a software function module and sold or used as an independent product, it can also be stored in a computer readable storage medium.
  • the computer software product is stored in a storage medium and includes several instructions for A computer device (which may be a personal computer, a server, or a network device, etc.) executes all or part of the methods described in the various embodiments of the present application.
  • the aforementioned storage media include: removable storage devices, ROM, RAM, magnetic disks, or optical disks and other media that can store program codes.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

An information search method and apparatus, a terminal and a storage medium. The information search method comprises: acquiring a query keyword (S101); searching for a point of interest related to the query keyword and located in the geographical area where a terminal is located (S102); determining the point of interest found, and increasing a relevance score corresponding to the point of interest found, the relevance score being used for characterizing the correlation between a corresponding search result and the query keyword (S103); according to the relevance score, sorting at least two search results, the at least two search results comprising the point of interest found and matching information of the query keyword besides the point of interest found (S104); and outputting the search results according to the sorted order (S105).

Description

信息搜索方法、装置、终端及存储介质Information search method, device, terminal and storage medium
相关申请的交叉引用Cross references to related applications
本申请基于申请号为201910600963.8,申请日为2019年7月4日,以的中国专利申请提出,并要求上述中国专利申请的优先权,上述中国专利申请的全部内容在此引入本申请作为参考。This application is based on the application number 201910600963.8, the application date is July 4, 2019, and the Chinese patent application is filed and claims the priority of the above-mentioned Chinese patent application. The entire content of the above-mentioned Chinese patent application is hereby incorporated by reference into this application.
技术领域Technical field
本申请属于数据处理技术领域,尤其涉及一种信息搜索方法、装置、终端及存储介质。This application belongs to the field of data processing technology, and in particular relates to an information search method, device, terminal and storage medium.
背景技术Background technique
搜索引擎用于提供搜索服务,将与用户输入的查询关键词(Query)相关的搜索结果展示给用户。The search engine is used to provide search services and display search results related to the query keyword (Query) entered by the user to the user.
一个查询关键词可以体现出多种不同的搜索意图,现有的综合性搜索引擎无法很好地解析用户真实的搜索意图,其优先返回的搜索结果常常无法与用户真实的搜索意图相匹配,导致信息搜索效率低。A query keyword can reflect a variety of different search intents. Existing comprehensive search engines cannot parse the user's real search intent well. The search results returned by priority often fail to match the user's real search intent, resulting in Information search efficiency is low.
发明内容Summary of the invention
有鉴于此,本申请实施例提供一种信息搜索方法、装置、终端及存储介质,以至少解决综合性搜索引擎优先返回的搜索结果常常无法与用户真实的搜索意图相匹配,信息搜索效率低的问题。In view of this, the embodiments of the present application provide an information search method, device, terminal and storage medium to at least solve the problem that search results preferentially returned by comprehensive search engines often fail to match the user’s real search intent, and the efficiency of information search is low. problem.
本申请实施例的技术方案是这样实现的:The technical solutions of the embodiments of the present application are implemented as follows:
本申请实施例提供一种信息搜索方法,包括:The embodiment of the present application provides an information search method, including:
获取查询关键词;Get query keywords;
查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点;Searching for points of interest in the geographic area where the terminal is located related to the query keyword;
确定查找到兴趣点,增加查找到的兴趣点对应的相关度分值,所述相关度分值用于表征对应的搜索结果与所述查询关键词之间的相关性;Determine that the point of interest is found, and increase the relevance score corresponding to the found point of interest, where the relevance score is used to characterize the correlation between the corresponding search result and the query keyword;
根据所述相关度分值,对至少两个搜索结果进行排序,所述至少两个搜索结果包括查找到的兴趣点和除查找到的兴趣点外的所述查询关键词的匹配信息;Sorting at least two search results according to the relevance score, the at least two search results including the found points of interest and matching information of the query keywords other than the found points of interest;
按照排序顺序输出搜索结果。The search results are output in sorted order.
上述方案中,所述查找与所述查询关键词相关的位于终端所处地理 区域内的兴趣点,包括:In the above solution, the searching for points of interest in the geographic area where the terminal is located related to the query keyword includes:
根据本次搜索的上下文,确定搜索目标类型为兴趣点的概率;According to the context of this search, determine the probability that the search target type is a point of interest;
确定所述概率大于设定的概率值时,查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点。When it is determined that the probability is greater than the set probability value, search for points of interest that are related to the query keyword and located in the geographic area where the terminal is located.
上述方案中,所述查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点,包括:In the above solution, the searching for points of interest in the geographic area where the terminal is located related to the query keyword includes:
确定所述查询关键词的输入方式;Determine the input mode of the query keyword;
在与所述输入方式对应的基于位置服务(LBS,Location Based Service)语料数据库中,查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点。In a location-based service (LBS, Location-Based Service) corpus database corresponding to the input method, search for points of interest in the geographic area where the terminal is located related to the query keyword.
上述方案中,所述输入方式为逐字输入,所述在与所述输入方式对应的LBS语料数据库中,查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点,包括:In the above solution, the input method is a word-by-word input, and the search for points of interest in the geographic area where the terminal is located related to the query keyword in the LBS corpus database corresponding to the input method includes:
在以前缀树方式存储的LBS语料数据库中,查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点。In the LBS corpus database stored in a prefix tree manner, search for points of interest in the geographic area where the terminal is located and related to the query keyword.
上述方案中,所述输入方式为一次性输入,所述在与所述输入方式对应的LBS语料数据库中,查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点,包括:In the above solution, the input mode is a one-time input, and the search for the points of interest in the geographic area where the terminal is located related to the query keyword in the LBS corpus database corresponding to the input mode includes:
在以字典树方式存储的LBS语料数据库中,查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点。In the LBS corpus database stored in the form of a dictionary tree, search for points of interest in the geographic area where the terminal is located related to the query keyword.
上述方案中,所述获取查询关键词,包括:In the above solution, the obtaining of query keywords includes:
对输入至所述终端的关键词进行格式归一化处理,得到所述查询关键词。Perform format normalization processing on the keywords input to the terminal to obtain the query keywords.
上述方案中,所述查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点,包括:In the above solution, the searching for points of interest in the geographic area where the terminal is located related to the query keyword includes:
确定所述终端的地理位置;Determining the geographic location of the terminal;
确定所述地理位置所在的地理围栏;Determine the geographic fence where the geographic location is located;
查找与所述查询关键词相关的位于所述地理围栏内的兴趣点。Search for points of interest located in the geofence that are related to the query keyword.
上述方案中,所述查找与所述查询关键词相关的位于终端所处地理围栏内的兴趣点,包括:In the above solution, the searching for the points of interest in the geofence where the terminal is located related to the query keyword includes:
访问所述地理围栏对应的设定数据库,所述设定数据库中存储有所述地理围栏内的所有兴趣点;Accessing a setting database corresponding to the geofence, where all points of interest in the geofence are stored in the setting database;
在所述设定数据库内查找与所述查询关键词相关的兴趣点。Search for points of interest related to the query keyword in the setting database.
本申请实施例还提供了一种信息搜索装置,包括:The embodiment of the present application also provides an information search device, including:
获取单元,配置为获取查询关键词;The obtaining unit is configured to obtain query keywords;
查找单元,配置为查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点;The searching unit is configured to search for points of interest in the geographic area where the terminal is located and related to the query keyword;
增加单元,配置为确定查找到兴趣点,增加查找到的兴趣点对应的 相关度分值,所述相关度分值用于表征对应的搜索结果与所述查询关键词之间的相关性;The adding unit is configured to determine that a point of interest is found, and to add a relevance score corresponding to the found point of interest, where the relevance score is used to characterize the correlation between the corresponding search result and the query keyword;
排序单元,配置为根据所述相关度分值,对至少两个搜索结果进行排序,所述至少两个搜索结果包括查找到的兴趣点和除查找到的兴趣点外的所述查询关键词的匹配信息;The sorting unit is configured to sort at least two search results according to the relevance score, the at least two search results including the found points of interest and the search keywords other than the found points of interest Matching information
输出单元,配置为按照排序顺序输出搜索结果。The output unit is configured to output search results in a sorted order.
本申请实施例还提供了一种终端,包括:处理器和配置为存储能够在处理器上运行的计算机程序的存储器,The embodiment of the present application also provides a terminal, including a processor and a memory configured to store a computer program that can run on the processor,
其中,所述处理器配置为运行所述计算机程序时,执行上述任一信息搜索方法的步骤。Wherein, the processor is configured to execute the steps of any one of the foregoing information searching methods when running the computer program.
本申请实施例还提供了一种存储介质,所述存储介质存储有计算机程序,所述计算机程序被处理器执行时实现上述任一信息搜索方法的步骤。The embodiment of the present application also provides a storage medium, the storage medium stores a computer program, and when the computer program is executed by a processor, the steps of any one of the foregoing information search methods are implemented.
本申请实施例提供的方案,考虑了终端在发起搜索行为时所在的地理位置,查找是否存在与查询关键词相关的位于终端所处地理区域内的兴趣点,若存在这样的兴趣点,则设置该兴趣点与查询关键词之间的相关性增加,从而在输出搜索结果时,将该兴趣点作为优先输出的搜索结果展示给用户,使得优先输出的搜索结果与用户真实的搜索意图匹配的可能性升高,提升了信息搜索的效率。The solution provided by the embodiments of this application considers the geographic location of the terminal when it initiates the search behavior, and searches for whether there are points of interest in the geographic area where the terminal is located related to the query keywords. If such points of interest exist, set The correlation between the interest point and the query keyword is increased, so that when the search result is output, the interest point is displayed to the user as the priority output search result, so that the priority output search result may match the user's real search intention Increased sexuality improves the efficiency of information search.
附图说明Description of the drawings
图1是本申请实施例提供的信息搜索方法的实现流程示意图;FIG. 1 is a schematic diagram of an implementation process of an information search method provided by an embodiment of the present application;
图2是本申请实施例提供的信息搜索方法中对兴趣点进行查找的实现流程示意图;FIG. 2 is a schematic diagram of the implementation flow of searching for points of interest in the information search method provided by an embodiment of the present application;
图3是本申请另一实施例提供的信息搜索方法中对兴趣点进行查找的实现流程示意图;FIG. 3 is a schematic diagram of the implementation process of searching for points of interest in the information search method provided by another embodiment of the present application;
图4是本申请另一实施例提供的信息搜索方法的实现流程示意图;FIG. 4 is a schematic diagram of an implementation process of an information search method provided by another embodiment of the present application;
图5是本申请另一实施例提供的信息搜索方法的实现流程示意图;FIG. 5 is a schematic diagram of an implementation flow of an information search method provided by another embodiment of the present application;
图6是本申请实施例提供的存储LBS语料的数据结构示意图;Figure 6 is a schematic diagram of a data structure for storing LBS corpus provided by an embodiment of the present application;
图7是相关技术中的倒排索引结构示意图;FIG. 7 is a schematic diagram of an inverted index structure in related technologies;
图8是本申请应用实施例提供的搜索结果输出示例图;FIG. 8 is an example diagram of search result output provided by an application embodiment of this application;
图9是另一应用实施例提供的搜索结果输出示例图;FIG. 9 is an example diagram of search result output provided by another application embodiment;
图10本申请实施例提供的信息搜索方法的实现原理示意图;FIG. 10 is a schematic diagram of the implementation principle of the information search method provided by the embodiment of the present application;
图11是本申请实施例提供的信息搜索装置的结构框图;Figure 11 is a structural block diagram of an information search device provided by an embodiment of the present application;
图12是本申请实施例提供的终端的硬件结构示意图。FIG. 12 is a schematic diagram of the hardware structure of a terminal provided by an embodiment of the present application.
具体实施方式Detailed ways
图1示出了本发明实施例提供的信息搜索方法的实现流程,在本发明实施例中,信息搜索方法的执行主体为智能手机、平板、笔记本等移动终端。FIG. 1 shows the implementation process of the information search method provided by the embodiment of the present invention. In the embodiment of the present invention, the execution subject of the information search method is mobile terminals such as smart phones, tablets, and notebooks.
参照图1,本发明实施例提供的信息搜索方法,包括:1, the information search method provided by the embodiment of the present invention includes:
S101:获取查询关键词。S101: Obtain query keywords.
作为本申请的一个实施例,获取查询关键词,包括:As an embodiment of this application, obtaining query keywords includes:
对输入至终端的关键词进行格式归一化处理,得到查询关键词。Perform format normalization processing on keywords input to the terminal to obtain query keywords.
格式归一化是为了去除输入至终端的关键词中的数据噪声,以提高搜索过程中数据处理的效率。格式归一化的方式包括但不限于:将关键词进行全角转半角、繁体转简体、大写转小写等操作。Format normalization is to remove data noise in keywords input to the terminal, so as to improve the efficiency of data processing in the search process. The format normalization methods include but are not limited to: converting keywords from full-width to half-width, from traditional to simplified, and from uppercase to lowercase.
这里,搜索引擎会预先根据一定的策略,运用特定的计算机程序从互联网上爬取信息,并对爬取到的信息进行组织和处理,形成便于检索的数据结构,存入数据库中。在获取到查询关键词后,搜索引擎会在数据库中基于查询关键词进行搜索,得到查询关键词的匹配信息,每个匹配信息对应查询关键词的一个搜索结果。Here, search engines will use specific computer programs to crawl information from the Internet according to certain strategies in advance, organize and process the crawled information, form a data structure that is easy to retrieve, and store it in the database. After obtaining the query keywords, the search engine will search in the database based on the query keywords to obtain matching information of the query keywords, and each matching information corresponds to a search result of the query keywords.
S102:查找与查询关键词相关的位于终端所处地理区域内的兴趣点。S102: Search for points of interest that are related to the query keyword and located in the geographic area where the terminal is located.
兴趣点(英文可以表达为Point of Interest),通常指电子地图上的某个地标,可以为写字楼、小区、商铺、公交站、公园、体育场馆等等。在电子地图中,每个兴趣点可以包含有名称、类别、坐标和分类这四方面的信息,通过在电子地图中标注兴趣点,能够方便用户通过电子地图了解道路分支及周边建筑的详尽信息,也能基于兴趣点实现导航功能。Point of interest (in English can be expressed as Point of Interest), usually refers to a certain landmark on the electronic map, which can be office buildings, communities, shops, bus stations, parks, stadiums, etc. In the electronic map, each point of interest can contain four aspects of information: name, category, coordinates and classification. By marking the point of interest in the electronic map, it is convenient for users to understand the detailed information of road branches and surrounding buildings through the electronic map. The navigation function can also be realized based on points of interest.
由上文可知,兴趣点配置有兴趣点所在位置的地理位置信息。实际应用时,地理位置信息通常以经纬度进行标记,而在本申请实施例中,对兴趣点的查找需要将兴趣点所在的地理位置与终端所在的地理区域相结合,其中,终端所在的地理区域,可以是与终端所处的地理位置为中心,设定半径范围内的地理区域,终端所处的地理位置可通过终端的定位功能来获取。此外,作为本申请一个实施例,终端所在的地理区域也可以与LBS相结合来确定。参照图2,查找与查询关键词相关的位于终端所处地理区域内的兴趣点,包括:It can be seen from the above that the point of interest is configured with geographic location information of the location of the point of interest. In practical applications, geographic location information is usually marked with latitude and longitude. In the embodiment of this application, searching for points of interest requires combining the geographic location of the point of interest with the geographic area where the terminal is located, where the geographic area where the terminal is located It can be a geographic area centered on the geographic location of the terminal, and a geographic area within a set radius. The geographic location of the terminal can be obtained through the positioning function of the terminal. In addition, as an embodiment of the present application, the geographic area where the terminal is located can also be determined in combination with the LBS. Referring to Figure 2, searching for points of interest in the geographic area where the terminal is located related to the query keyword includes:
S201:确定终端的地理位置。S201: Determine the geographic location of the terminal.
这里,实际应用时,终端的地理位置可以通过终端的定位功能来获取,其可采用的定位技术手段包括但不限于卫星定位、基站定位、Wi-Fi定位等。Here, in actual applications, the geographic location of the terminal can be obtained through the positioning function of the terminal, and the available positioning technical means include but not limited to satellite positioning, base station positioning, Wi-Fi positioning, and the like.
S202:确定地理位置所在的地理围栏。S202: Determine the geofence where the geographic location is located.
这里,LBS是通过电信移动运营商的无线电通讯网络,或外部定位方式来获取终端的位置信息,并在地理信息***(GIS,Geographic Information System)平台的支持下,为用户提供相应服务的一种增值业务。作为LBS 的一种应用实现,在GIS平台中用一个虚拟的栅栏围出一个虚拟的地理边界,称该地理边界所包围的地理区域为一个地理围栏。当终端进入、离开某地理围栏,或在该地理围栏内活动时,可以触发终端产生相应的操作。当虚拟出的多个地理围栏覆盖了一片地理区域时,该片地理区域实际上已经在虚拟层面被网格化,即可被视为该片地理区域由这多个地理围栏拼接而成。在本申请实施例中,当终端位于某一地理位置时,圈定了该地理位置的地理围栏就可被视为终端所在的地理区域,一方面能够更为高效地确定终端所在的地理区域,另一方面在后续技术开发过程中,也可以基于地理围栏和用户的搜索需求,提供更多的信息服务。Here, LBS is a way to obtain the location information of the terminal through the radio communication network of the telecommunications mobile operator, or an external positioning method, and with the support of the Geographic Information System (GIS, Geographic Information System) platform, to provide users with corresponding services. Value-added services. As an application implementation of LBS, a virtual fence is used to enclose a virtual geographic boundary in the GIS platform, and the geographic area surrounded by the geographic boundary is called a geographic fence. When the terminal enters, leaves, or moves in the geofence, the terminal can be triggered to generate corresponding operations. When multiple virtual geo-fences cover a geographic area, the geographic area has actually been gridded at the virtual level, and it can be considered that the geographic area is spliced by the multiple geo-fences. In the embodiment of the present application, when the terminal is located in a certain geographic location, the geographic fence delineating the geographic location can be regarded as the geographic area where the terminal is located. On the one hand, the geographic area where the terminal is located can be determined more efficiently, and the other On the one hand, in the follow-up technology development process, it can also provide more information services based on geofencing and user search requirements.
S203:查找与查询关键词相关的位于地理围栏内的兴趣点。S203: Search for points of interest located in the geofence that are related to the query keyword.
在确定出终端所处地理位置对应的地理围栏之后,相当于确定了终端所处地理区域,就可以查找是否存在与查询关键词相关的,地理位置位于该地理围栏内的兴趣点。作为本申请的一个实施例,可以基于搜索到的查询关键词的匹配信息,在搜索到的匹配信息中查找是否存在位于终端所处地理区域内的兴趣点,该兴趣点的查找操作需要在搜索到查询关键词的匹配信息之后进行。作为本申请的另一实施例,也可以在获取到查询关键词后,专门启动用于查找兴趣点的线程,即,兴趣点的查找过程与查询关键词的匹配信息的搜索过程可以并行执行,如图3所示,查找与查询关键词相关的位于地理围栏内的兴趣点,包括:After the geographic fence corresponding to the geographic location of the terminal is determined, it is equivalent to determining the geographic area where the terminal is located, and it is possible to find out whether there is a point of interest related to the query keyword whose geographic location is located in the geographic fence. As an embodiment of the present application, based on the searched matching information of the query keywords, it is possible to find whether there is a point of interest in the geographic area where the terminal is located in the searched matching information. After searching the matching information of the keywords. As another embodiment of the present application, after the query keyword is obtained, the thread for finding points of interest can also be started specifically, that is, the search process of the points of interest and the search process of matching information of the query keywords can be executed in parallel. As shown in Figure 3, searching for points of interest in the geofence related to the query keyword includes:
S2031:访问地理围栏对应的设定数据库,设定数据库中存储有地理围栏内的所有兴趣点。S2031: Access a setting database corresponding to the geofence, and all points of interest in the geofence are stored in the setting database.
这里,在本申请实施例中,可以先行对电子地图中的兴趣点进行梳理,以地理围栏为单位,为每个地理围栏内存在的所有兴趣点创建索引并存储于数据库中,且每个地理围栏均可以对应有一个设定的数据库,该地理围栏内的兴趣点以一定的数据结构存储在数据库的数据表中,并根据兴趣点的更新频率,定期对设定的数据库进行数据更新和维护。Here, in the embodiment of the present application, the points of interest in the electronic map can be sorted out first, using geofences as a unit to create an index for all points of interest in each geofence and store them in the database, and each geofence Each fence can correspond to a set database. The points of interest in the geofence are stored in the data table of the database in a certain data structure. According to the update frequency of the points of interest, the set database is regularly updated and maintained. .
S2032:在设定数据库内查找与查询关键词相关的兴趣点。S2032: Search for points of interest related to the query keyword in the set database.
其中,通过设定数据库的方式来完成兴趣点的查找,一方面,设定数据库以地理围栏为关联进行设定,可以通过终端所在的地理围栏迅速地定位到对应的设定数据库,缩小兴趣点的查找范围,另一方面,在只存在有兴趣点的信息中进行查找,而不是在查询关键词的所有匹配信息中进行查找,大大提高了兴趣点的查找效率。另外,设定数据库可以定期与地理信息***进行数据同步,能够很好地实现对兴趣点的定期更新和维护,以保证兴趣点查找的准确性。Among them, the search for points of interest is completed by setting the database. On the one hand, the setting database is set in relation to the geofence, and the corresponding setting database can be quickly located through the geofence where the terminal is located, reducing the points of interest On the other hand, searching in the information that only has the points of interest, instead of searching in all the matching information of the query keywords, greatly improves the search efficiency of the points of interest. In addition, the setting database can be regularly synchronized with the geographic information system, which can well realize the regular update and maintenance of the points of interest to ensure the accuracy of the search for the points of interest.
在本申请实施例中,查找与查询关键词相关的位于终端所处地理区域内的兴趣点,其目的在于,在常规的搜索流程的基础上,基于查询关键词和终端所处地理位置,对搜索请求进行查询意图的识别,以确定该搜索请求是否与终端所处地理位置相关。In the embodiment of the present application, searching for points of interest in the geographic area where the terminal is located related to the query keyword is intended to be based on the query keyword and the geographic location of the terminal on the basis of the conventional search process. The search request recognizes the query intention to determine whether the search request is related to the geographic location of the terminal.
作为本申请的一个实施例,在检测到查询关键词输入后,可以先行对是否需要执行查询意图的识别进行判定,参见图4,查找与查询关键词相关的位于地理围栏内的兴趣点,包括:As an embodiment of the present application, after the input of the query keyword is detected, it can be determined in advance whether it is necessary to perform the recognition of the query intent. See Figure 4 to find the points of interest in the geofence related to the query keyword, including :
S1021:根据本次搜索的上下文,确定搜索目标类型为兴趣点的概率。S1021: According to the context of this search, determine the probability that the search target type is a point of interest.
这里,上下文,可以理解为本次搜索的环境,包括搜索场景、历史搜索记录等,根据本次搜索的上下文,可以大致确定出本次搜索的搜索目标类型为兴趣点的概率。例如,在本次搜索发生之前的一段邻近时间内,均进行了兴趣点相关的搜索,或者在给出的搜索结果中,与兴趣点相关的搜索结果被点击,则将搜索目标类型为兴趣点的概率设置为一个较大的值。Here, the context can be understood as the environment of this search, including search scenarios, historical search records, etc. According to the context of this search, the probability that the search target type of this search is a point of interest can be roughly determined. For example, in a period of time before this search occurs, a search related to a point of interest is performed, or a search result related to a point of interest is clicked in the given search results, then the search target type is a point of interest The probability is set to a larger value.
S1022:确定概率大于设定的概率值时,查找与查询关键词相关的位于终端所处地理区域内的兴趣点。S1022: When it is determined that the probability is greater than the set probability value, search for points of interest that are related to the query keyword and located in the geographic area where the terminal is located.
在本申请实施例中,只有当搜索目标类型为兴趣点的概率大于设定的概率值时,才触发查找与查询关键词相关的位于终端所处地理区域内的兴趣点的操作,否则,若确定出的搜索目标类型为兴趣点的概率极小,那么也可以直接进行常规的搜索流程,而不会去触发查找与查询关键词相关的位于终端所处地理区域内的兴趣点的操作,从而可以避免无谓的兴趣点查找操作,有效地提升了搜索效率。In this embodiment of the application, only when the probability of the search target type being a point of interest is greater than the set probability value, the operation of finding the point of interest in the geographic area where the terminal is located related to the query keyword is triggered, otherwise, if The probability that the determined search target type is a point of interest is extremely small, then the conventional search process can also be performed directly without triggering the operation of finding the point of interest in the geographic area where the terminal is located related to the query keyword, thereby It can avoid unnecessary point-of-interest searching operations, which effectively improves search efficiency.
作为本申请的一个实施例,可以预先为LBS语料和其他语料建立不同类别的数据结构,再结合具体的搜索场景来从对应的数据结构中读取数据,其中,其他语料为搜索引擎预先从互联网上爬取的,除LBS信息之外的其他信息,例如网页内容、文档、图片等等。如图5所示,查找与查询关键词相关的位于地理围栏内的兴趣点,包括:As an embodiment of the present application, different types of data structures can be established for LBS corpus and other corpora in advance, and then combined with specific search scenarios to read data from the corresponding data structure, where the other corpus is the search engine pre-loaded from the Internet Crawling, other information besides LBS information, such as web content, documents, pictures, etc. As shown in Figure 5, searching for points of interest in the geofence related to the query keyword includes:
S501:确定查询关键词的输入方式。S501: Determine the input mode of the query keyword.
用户侧在终端中发起查询关键词的输入行为时,可以采用的查询关键词的输入方式有两种,一种为逐字输入,一种为一次性输入。其中,逐字输入是指按照字符在查询关键词中的位置顺序,逐个输入查询关键词的字符的输入方式,例如,对于查询关键词“木棉花”,逐个输入“木”、“棉”、“花”这三个字符,或者逐个输入“木棉”和“花”,这两种输入方式均可视为逐字输入。一次性输入是指一次性地输入查询关键词的所有字符的输入方式,例如,通过词组输入法或者复制粘贴的输入方式,一次性地输入“木棉花”。When the user initiates an input behavior of query keywords in the terminal, there are two input methods for query keywords that can be used, one is verbatim input, and the other is one-time input. Among them, verbatim input refers to the input method of inputting the characters of the query keyword one by one according to the position order of the characters in the query keyword. For example, for the query keyword "Kapok", input "wood", "cotton", The three characters "花", or input "Kapok" and "花" one by one, these two input methods can be regarded as input word by word. One-time input refers to the input method of inputting all the characters of the query keyword at one time, for example, inputting "Kapok" at one time through the phrase input method or the input method of copy and paste.
S502:在与输入方式对应的LBS语料数据库中,查找与查询关键词相关的位于终端所处地理区域内的兴趣点。S502: In the LBS corpus database corresponding to the input mode, search for points of interest that are related to the query keyword and located in the geographic area where the terminal is located.
在本申请实施例中,预先将LBS语料通过三种不同的数据结构分别存储于三个不同的LBS语料数据库中,如图6所示,这三种数据结构分别为DAT(Double Array Trie)树、前缀树和哈希映射。In the embodiment of this application, the LBS corpus is stored in three different LBS corpus databases through three different data structures in advance, as shown in FIG. 6, the three data structures are DAT (Double Array Trie) trees. , Prefix tree and hash map.
其中,DAT树,即双数组Tire树,被称为字典树,是一种常被用于词检索的树结构。DAT树利用词的共同前缀以达到节省空间的目的,当利用DAT树进行查找时,查找效率与LBS语料数据库中的词量没有关系,只与 LBS语料数据库中最长的词语长度及查询关键词的长度有关系,因此可以兼顾查询效率和空间存储。由于在DAT树中,只有完全命中查询关键词才可以得到查找结果,因此,作为本申请的一个实施例,当查询关键词的输入方式为一次性输入时,在以DAT树方式存储的LBS语料数据库中,查找地理位置位于终端所处地理区域内,与查询关键词相关的兴趣点,以通过DAT树的特性来提高兴趣点的查找效率。Among them, the DAT tree, that is, the double-array Tire tree, is called the dictionary tree, and is a tree structure often used for word retrieval. The DAT tree uses the common prefix of words to save space. When searching using the DAT tree, the search efficiency has nothing to do with the number of words in the LBS corpus database, only the longest word length and query keywords in the LBS corpus database The length of is related, so you can take into account query efficiency and space storage. Since in the DAT tree, the search result can be obtained only when the query keyword is completely hit. Therefore, as an embodiment of the present application, when the query keyword input method is a one-time input, the LBS corpus stored in the DAT tree method In the database, search for points of interest that are located in the geographic area where the terminal is located and are related to the query keywords, so as to improve the search efficiency of points of interest through the characteristics of the DAT tree.
前缀树,即marisa树,作为本申请的一个实施例,当查询关键词的输入方式为逐字输入时,在以前缀树方式存储的LBS语料数据库中,结合编辑距离的方式来查找兴趣点,可以实现高查询效率。例如,对于查询关键词“木棉花酒店”,当只输入完“木棉花”时,便可以通过查询得到“木棉花酒店”,且编辑距离仅为2。在此,编辑距离又称Levenshtein距离,是指两个字符串之间,由一个转成另一个所需的最少编辑操作次数,许可的编辑操作包括将一个字符替换成另一个字符、***一个字符、删除一个字符等,在上文例子中,由“木棉花”转换为“木棉花酒店”,只需要***两个字符,即编辑距离仅为2。The prefix tree, namely the marisa tree, as an embodiment of the present application, when the query keyword is entered verbatim, in the LBS corpus database stored in the prefix tree method, the point of interest is searched in combination with the edit distance method. Can achieve high query efficiency. For example, for the query keyword "Kapok Hotel", when only "Kapok" is entered, the "Kapok Hotel" can be obtained through the query, and the edit distance is only 2. Here, the edit distance is also called Levenshtein distance, which refers to the minimum number of editing operations required to convert two strings from one to the other. The permitted editing operations include replacing one character with another character and inserting a character , Delete a character, etc. In the above example, to convert from "Kapok" to "Kapok Hotel", only two characters need to be inserted, that is, the editing distance is only 2.
哈希映射(Hash Map)是基于哈希表的映射接口的实现,此实现提供所有可选的映射操作,在本申请实施例中,还可以通过固定的哈希表(Hash Table)建立紧凑型的哈希映射数据结构,建立关于LBS语料的倒排索引。倒排索引的构建,是对分词之后的每个分词均给一种特定的哈希算法,所有的分词会落入到一个长度为N的区间位置,那么通过分词指定的倒排索引对应的文档链表,可以挂载文档的质量分数、类目等,在搜索的时候通过类目过滤,可以根据文档的质量分数对搜索结果进行排序,图7示出了相关技术中的倒排索引结构示意图。针对LBS语料,在建立倒排索引时,可以通过经度、纬度以及城市名称来构建空间索引,可以采用Geohash的构建方式或HibertID的构建方式。Hash Map (Hash Map) is an implementation of a mapping interface based on a hash table. This implementation provides all optional mapping operations. In the embodiment of this application, a fixed hash table (Hash Table) can also be used to establish a compact The hash map data structure of, establishes an inverted index on the LBS corpus. The construction of the inverted index is to give a specific hash algorithm to each word segmentation after the word segmentation. All the word segmentation will fall into an interval of length N, then the document corresponding to the inverted index specified by the word segmentation The linked list can mount the quality scores, categories, etc. of documents, and filter by category when searching, and the search results can be sorted according to the quality scores of the documents. Figure 7 shows a schematic diagram of the inverted index structure in the related technology. For LBS corpus, when building an inverted index, a spatial index can be constructed by longitude, latitude, and city name, and the construction method of Geohash or HibertID can be used.
此外,在本申请实施例中,对除LBS语料外的其他语料,也可以采用倒排索引的方式,以哈希映射的数据结构进行存储。In addition, in the embodiment of the present application, for other corpora except the LBS corpus, an inverted index can also be used to store the data structure in a hash map.
S103:确定查找到兴趣点,增加查找到的兴趣点对应的相关度分值,相关度分值用于表征对应的搜索结果与查询关键词之间的相关性。S103: Determine that the point of interest is found, and increase the relevance score corresponding to the found point of interest. The relevance score is used to characterize the relevance between the corresponding search result and the query keyword.
确定存在与查询关键词相关的兴趣点,且该兴趣点也位于终端所处地理区域之后,则识别出查询关键词对应的查询意图与兴趣点存在关联的可能性很大,此时需要对查找到的兴趣点提高对应的相关度分值。对于查询关键词的每一个搜索结果,无论该搜索结果是兴趣点还是其他搜索出来的除兴趣点之外的匹配信息,在确定其输出排序时,都会计算有一个相关度分值,用于表征该搜索结果与查询关键词之间的相关性,相关度分值越高,那么代表该搜索结果与用户真实的查询意图越匹配,则该搜索结果越有可能被优先输出。If it is determined that there is a point of interest related to the query keyword, and the point of interest is also located behind the geographic area where the terminal is located, it is likely that the query intent corresponding to the query keyword is associated with the point of interest. The points of interest gained increase the corresponding relevance score. For each search result of the query keyword, whether the search result is a point of interest or other searched matching information other than points of interest, when determining its output ranking, a relevance score will be calculated for characterization The relevance between the search result and the query keyword, the higher the relevance score, the more the search result matches the user's real query intention, the more likely the search result will be output first.
S104:根据所述相关度分值,对至少两个搜索结果进行排序,至少两 个搜索结果包括查找到的兴趣点和除查找到的兴趣点外的查询关键词的匹配信息。S104: Sort at least two search results according to the relevance score. The at least two search results include the found points of interest and matching information of query keywords other than the found points of interest.
在确定出相关度分值之后,对搜索结果进行排序,该搜索结果中包括了查找到的兴趣点,还包括了除查找到的兴趣点外查询关键词的匹配信息。After the relevance score is determined, the search results are sorted, and the search results include the found points of interest and also include matching information for query keywords in addition to the found points of interest.
需要说明的是,当存在多于一个兴趣点的搜索结果时,可以综合兴趣点与终端所处地理位置的距离远近、兴趣点的历史搜索热度来考虑,确定这些兴趣点之间的排序顺序。It should be noted that when there are search results for more than one point of interest, the distance between the point of interest and the geographic location where the terminal is located, and the historical search popularity of the point of interest can be considered to determine the sort order among these points of interest.
S105:按照排序顺序输出搜索结果。S105: Output the search results according to the sort order.
在本申请实施例中,由于将查找到的兴趣点的相关度分值进行了增加,该行为相当于将查询关键词在兴趣点上的搜索意图由弱意图转变为了强意图,由此输出的搜索结果相较于相关度分值提升之前可能出现以下两种变化:In the embodiment of this application, since the relevance score of the found points of interest is increased, this behavior is equivalent to changing the search intent of the query keyword on the points of interest from a weak intention to a strong intention, and the output The search results may have the following two changes compared to the increase in relevance score:
例子1:对于查询关键词“木棉花”,在相关度分值提升前,“木棉花”作为植物品种的搜索意图强,作为酒店名称的搜索意图弱,但是结合到终端在进行搜索时所处的地理位置与“木棉花酒店”非常接近,将“木棉花酒店”这一兴趣点搜索结果的相关度分值提升,且综合终端与“木棉花酒店”的位置距离、搜索的上下文以及“木棉花酒店”的历史搜索热度等多方面因素,将“木棉花”作为酒店名称的搜索意图转变为强意图,如图8所示,在搜索结果中最优先显示关于“木棉花酒店”这一兴趣点的搜索结果,然后再显示关于“木棉花”的其他匹配信息。Example 1: For the query keyword "Kapok", before the relevance score is increased, "Kapok" as a plant variety has a strong search intention, and a hotel name has a weak search intention, but it is combined with the terminal where the search is performed The location is very close to the "Kapok Hotel", which increases the relevance score of the search results for the point of interest "Kapok Hotel", and the location distance between the integrated terminal and the "Kapok Hotel", the context of the search, and the "Kapok Hotel" The historical search interest of "Cotton Hotel" and many other factors have transformed the search intention of "Kapok" as a hotel name into a strong intent. As shown in Figure 8, the search results show the interest in "Kapok Hotel" first. Click the search result, and then display other matching information about "Kapok".
例子2:对于查询关键词“小时代”,在相关度分值提升前,“小时代”作为电影名称的搜索意图强,作为酒店公寓名称的搜索意图弱,但是结合到终端在进行搜索时所处的地理位置与“小时代”非常接近,将“小时代短租公寓”这一兴趣点搜索结果的相关度分值提升,且综合终端与“小时代短租公寓”的位置距离、搜索的上下文以及“小时代短租公寓”的历史搜索热度等多方面因素,将“小时代”作为酒店公寓名称的搜索意图增强,如图9所示,确定出将“小时代短租公寓”作为查询关键词“小时代”次优先的搜索结果,同样展示在搜索结果中较靠前的位置。若没有通过本申请实施例提供的方案进行相关度分值提升,那么在搜索结果的靠前位置很难展示出关于“小时代短租公寓”的搜索结果。Example 2: For the query keyword "Xiao Shi", before the relevance score increases, "Xiao Shi" has a strong search intention as a movie name, and a weak search intention as a hotel apartment name, but it is combined with the terminal when searching The location of the location is very close to "Xiaoshidai", which will increase the relevance score of the search results for the point of interest "Xiaoshidai Short-term Rental Apartment", and the distance between the integrated terminal and Context and historical search popularity of "Xiao Shidai Short-term Rental Apartments" and other factors have strengthened the search intention of using "Xiao Shidai" as the name of hotel apartments. As shown in Figure 9, it is determined that "Xiao Shidai Short-term Rental Apartments" is used as the query The second-priority search results for the keyword "small times" are also displayed higher in the search results. If the relevance score is not improved through the solution provided by the embodiments of the present application, it is difficult to display the search results of "small times short-term rental apartments" at the top of the search results.
图10示出了本申请实施例提供的信息搜索的实现原理示意图,参照图10,在得到查询关键词后,将查询关键词输入调度器,调度器一方面将查询关键词结合上下文等信息输入至意图识别器,由意图识别器基于LBS语料数据库,对是否存在本方案中提及的兴趣点进行查找,另一方面,由检索器基于查询关键词及上下文等信息,得到查询关键词的搜索结果。在这其中,LBS语料数据库中的数据以空间索引的方式进行存储,而其他语料则以倒排索引的方式进行存储。Figure 10 shows a schematic diagram of the implementation principle of the information search provided by the embodiments of the present application. Referring to Figure 10, after obtaining the query keywords, the query keywords are input into the scheduler. On the one hand, the scheduler inputs the query keywords in combination with context and other information To the intent recognizer, the intent recognizer is based on the LBS corpus database to find whether there are points of interest mentioned in this scheme. On the other hand, the searcher obtains the search for the query keywords based on the query keywords and context information result. Among them, the data in the LBS corpus database is stored in a spatial index, and other corpora is stored in an inverted index.
本申请实施例考虑了终端在发起搜索行为时所在的地理位置,查找是 否存在与查询关键词相关的位于终端所处地理区域内的兴趣点,若存在这样的兴趣点,则设置该兴趣点与查询关键词之间的相关性增加,从而在输出搜索结果时,将该兴趣点作为优先输出的搜索结果展示给用户,使得优先输出的搜索结果与用户真实的搜索意图匹配的可能性升高,提升了信息搜索的效率。The embodiment of the application considers the geographic location of the terminal when it initiates the search behavior, and searches for whether there is a point of interest in the geographic area where the terminal is located related to the query keyword. If such a point of interest exists, set the point of interest and The correlation between the query keywords increases, so that when outputting search results, the interest point is displayed to the user as the priority output search result, so that the priority output search result is more likely to match the user’s real search intention. Improve the efficiency of information search.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。It should be understood that the size of the sequence number of each step in the foregoing embodiment does not mean the order of execution. The execution sequence of each process should be determined by its function and internal logic, and should not constitute any limitation to the implementation process of the embodiment of the present application.
为实现本申请实施例的信息查询方法,本申请实施例还提供了一种信息搜索装置,设置于智能手机、平板、笔记本等终端上,参照图11,该信息搜索装置包括:In order to implement the information query method of the embodiment of the present application, the embodiment of the present application also provides an information search device, which is set on a terminal such as a smart phone, a tablet, a notebook, etc., referring to FIG. 11, the information search device includes:
获取单元1101,配置为获取查询关键词;The obtaining unit 1101 is configured to obtain query keywords;
查找单元1102,配置为查找与查询关键词相关的位于终端所处地理区域内的兴趣点;The searching unit 1102 is configured to search for points of interest located in the geographic area where the terminal is located and related to the query keywords;
增加单元1103,配置为确定查找到兴趣点,增加查找到的兴趣点对应的相关度分值,相关度分值用于表征对应的搜索结果与查询关键词之间的相关性;The adding unit 1103 is configured to determine that a point of interest is found, and increase the relevance score corresponding to the found point of interest, and the relevance score is used to represent the correlation between the corresponding search result and the query keyword;
排序单元1104,配置为根据相关度分值,对至少两个搜索结果进行排序,至少两个搜索结果包括查找到的兴趣点和除查找到的兴趣点外的查询关键词的匹配信息;The sorting unit 1104 is configured to sort at least two search results according to the relevance scores, the at least two search results including the found points of interest and matching information of query keywords other than the found points of interest;
输出单元1105,配置为按照排序顺序输出搜索结果。The output unit 1105 is configured to output search results in a sorted order.
其中,在一实施例中,所述查找单元1102,配置为:Wherein, in an embodiment, the searching unit 1102 is configured to:
根据本次搜索的上下文,确定搜索目标类型为兴趣点的概率;According to the context of this search, determine the probability that the search target type is a point of interest;
确定概率大于设定的概率值时,查找与查询关键词相关的位于终端所处地理区域内的兴趣点。When the determined probability is greater than the set probability value, search for points of interest in the geographic area where the terminal is located related to the query keyword.
在一实施例中,所述查找单元1102,配置为:In an embodiment, the searching unit 1102 is configured to:
确定查询关键词的输入方式;Determine the input method of query keywords;
在与输入方式对应的LBS语料数据库中,查找与查询关键词相关的位于终端所处地理区域内的兴趣点。In the LBS corpus database corresponding to the input mode, search for points of interest in the geographic area where the terminal is located related to the query keyword.
在一实施例中,输入方式为逐字输入,所述在与输入方式对应的LBS语料数据库中,查找与查询关键词相关的位于终端所处地理区域内的兴趣点,包括:In an embodiment, the input method is word-by-word input, and the search for points of interest in the geographic area where the terminal is located related to the query keyword in the LBS corpus database corresponding to the input method includes:
在以前缀树方式存储的LBS语料数据库中,查找与查询关键词相关的位于终端所处地理区域内的兴趣点。In the LBS corpus database stored in a prefix tree manner, search for points of interest in the geographic area where the terminal is located related to the query keyword.
在一实施例中,输入方式为一次性输入,所述在与输入方式对应的LBS语料数据库中,查找与查询关键词相关的位于终端所处地理区域内的兴趣点,包括:In one embodiment, the input mode is a one-time input, and the search for the points of interest in the geographic area where the terminal is located related to the query keyword in the LBS corpus database corresponding to the input mode includes:
在以字典树方式存储的LBS语料数据库中,查找与查询关键词相关的 位于终端所处地理区域内的兴趣点。In the LBS corpus database stored in a dictionary tree, search for points of interest in the geographic area where the terminal is located related to the query keyword.
在一实施例中,所述获取单元1101,配置为:In an embodiment, the acquiring unit 1101 is configured to:
对输入至终端的关键词进行格式归一化处理,得到查询关键词。Perform format normalization processing on keywords input to the terminal to obtain query keywords.
在一实施例中,所述查找单元1102,配置为:In an embodiment, the searching unit 1102 is configured to:
确定终端的地理位置;Determine the geographic location of the terminal;
确定地理位置所在的地理围栏;Determine the geographic fence where the geographic location is located;
查找与查询关键词相关的位于所述地理围栏内的兴趣点。Find the points of interest located in the geofence that are related to the query keyword.
在一实施例中,所述查找与查询关键词相关的位于所述地理围栏内的兴趣点,包括:In an embodiment, the searching for points of interest located in the geofence related to the query keyword includes:
访问地理围栏对应的设定数据库,设定数据库中存储有地理围栏内的所有兴趣点;Access the setting database corresponding to the geofence, and store all the points of interest in the geofence in the setting database;
在设定数据库内查找与查询关键词相关的兴趣点。Find points of interest related to query keywords in the set database.
实际应用时,所述获取单元1101、查找单元1102、增加单元1103、排序单元1104及输出单元1105可由信息搜索装置中的处理器来实现。当然,处理器需要运行存储器中存储的程序来实现上述各程序模块的功能。In practical applications, the acquiring unit 1101, searching unit 1102, adding unit 1103, sorting unit 1104, and output unit 1105 can be implemented by a processor in an information search device. Of course, the processor needs to run the programs stored in the memory to realize the functions of the above-mentioned program modules.
需要说明的是:上述实施例提供的信息搜索装置在进行信息搜索时,仅以上述各程序模块的划分进行举例说明,实际应用中,可以根据需要而将上述处理分配由不同的程序模块完成,即将装置的内部结构划分成不同的程序模块,以完成以上描述的全部或者部分处理。另外,上述实施例提供的信息搜索装置与信息搜索方法实施例属于同一构思,其具体实现过程详见方法实施例,这里不再赘述。It should be noted that when the information search device provided in the above embodiment performs information search, only the division of the above-mentioned program modules is used as an example for illustration. In actual applications, the above-mentioned processing can be allocated to different program modules as needed. That is, the internal structure of the device is divided into different program modules to complete all or part of the processing described above. In addition, the information search device and the information search method embodiments provided in the above embodiments belong to the same concept, and the specific implementation process is detailed in the method embodiments, which will not be repeated here.
基于上述程序模块的硬件实现,且为了实现本申请实施例的方法,本申请实施例还提供了一种终端。图12为本申请实施例终端的硬件组成结构示意图,如图12所示,终端包括:Based on the hardware implementation of the above program modules, and in order to implement the method in the embodiment of the present application, the embodiment of the present application also provides a terminal. FIG. 12 is a schematic diagram of the hardware composition structure of a terminal according to an embodiment of the application. As shown in FIG. 12, the terminal includes:
通信接口1,能够与其它设备比如网络设备等进行信息交互; Communication interface 1, which can exchange information with other devices such as network equipment;
处理器2,与所述通信接口1连接,以实现与其它设备进行信息交互,配置为运行计算机程序时,执行上述终端侧一个或多个技术方案提供的信息搜索方法。而所述计算机程序存储在存储器3上。The processor 2 is connected to the communication interface 1 to realize information interaction with other devices, and when configured to run a computer program, it executes the information search method provided by one or more technical solutions on the terminal side. The computer program is stored in the memory 3.
当然,实际应用时,终端中的各个组件通过总线***耦合在一起。可理解,总线***配置为实现这些组件之间的连接通信。总线***除包括数据总线之外,还包括电源总线、控制总线和状态信号总线。但是为了清楚说明起见,在图12中将各种总线都标为总线***。Of course, in actual application, the various components in the terminal are coupled together through the bus system. It can be understood that the bus system is configured to implement connection and communication between these components. In addition to the data bus, the bus system also includes a power bus, a control bus, and a status signal bus. However, for the sake of clarity, various buses are marked as bus systems in FIG. 12.
本申请实施例中的存储器3配置为存储各种类型的数据以支持终端的操作。这些数据的示例包括:用于在终端上操作的任何计算机程序。The memory 3 in the embodiment of the present application is configured to store various types of data to support the operation of the terminal. Examples of such data include: any computer program used to operate on the terminal.
可以理解,存储器3可以是易失性存储器或非易失性存储器,也可包括易失性和非易失性存储器两者。其中,非易失性存储器可以是只读存储器(ROM,Read Only Memory)、可编程只读存储器(PROM,Programmable Read-Only Memory)、可擦除可编程只读存储器(EPROM,Erasable  Programmable Read-Only Memory)、电可擦除可编程只读存储器(EEPROM,Electrically Erasable Programmable Read-Only Memory)、磁性随机存取存储器(FRAM,ferromagnetic random access memory)、快闪存储器(Flash Memory)、磁表面存储器、光盘、或只读光盘(CD-ROM,Compact Disc Read-Only Memory);磁表面存储器可以是磁盘存储器或磁带存储器。易失性存储器可以是随机存取存储器(RAM,Random Access Memory),其用作外部高速缓存。通过示例性但不是限制性说明,许多形式的RAM可用,例如静态随机存取存储器(SRAM,Static Random Access Memory)、同步静态随机存取存储器(SSRAM,Synchronous Static Random Access Memory)、动态随机存取存储器(DRAM,Dynamic Random Access Memory)、同步动态随机存取存储器(SDRAM,Synchronous Dynamic Random Access Memory)、双倍数据速率同步动态随机存取存储器(DDRSDRAM,Double Data Rate Synchronous Dynamic Random Access Memory)、增强型同步动态随机存取存储器(ESDRAM,Enhanced Synchronous Dynamic Random Access Memory)、同步连接动态随机存取存储器(SLDRAM,SyncLink Dynamic Random Access Memory)、直接内存总线随机存取存储器(DRRAM,Direct Rambus Random Access Memory)。本申请实施例描述的存储器2旨在包括但不限于这些和任意其它适合类型的存储器。It can be understood that the memory 3 may be a volatile memory or a non-volatile memory, and may also include both volatile and non-volatile memory. Among them, the non-volatile memory can be read only memory (ROM, Read Only Memory), programmable read only memory (PROM, Programmable Read-Only Memory), erasable programmable read only memory (EPROM, Erasable Programmable Read- Only Memory, Electrically Erasable Programmable Read-Only Memory (EEPROM, Electrically Erasable Programmable Read-Only Memory), magnetic random access memory (FRAM, ferromagnetic random access memory), flash memory (Flash Memory), magnetic surface memory , CD-ROM, or CD-ROM (Compact Disc Read-Only Memory); magnetic surface memory can be magnetic disk storage or tape storage. The volatile memory may be random access memory (RAM, Random Access Memory), which is used as an external cache. By way of exemplary but not restrictive description, many forms of RAM are available, such as static random access memory (SRAM, Static Random Access Memory), synchronous static random access memory (SSRAM, Synchronous Static Random Access Memory), and dynamic random access Memory (DRAM, Dynamic Random Access Memory), Synchronous Dynamic Random Access Memory (SDRAM, Synchronous Dynamic Random Access Memory), Double Data Rate Synchronous Dynamic Random Access Memory (DDRSDRAM, Double Data Rate Synchronous Dynamic Random Access Memory), enhanced -Type synchronous dynamic random access memory (ESDRAM, Enhanced Synchronous Dynamic Random Access Memory), synchronous connection dynamic random access memory (SLDRAM, SyncLink Dynamic Random Access Memory), direct memory bus random access memory (DRRAM, Direct Rambus Random Access Memory) ). The memory 2 described in the embodiment of the present application is intended to include, but is not limited to, these and any other suitable types of memory.
上述本申请实施例揭示的方法可以应用于处理器2中,或者由处理器2实现。处理器2可能是一种集成电路芯片,具有信号的处理能力。在实现过程中,上述方法的各步骤可以通过处理器2中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器2可以是通用处理器、DSP,或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。处理器2可以实现或者执行本申请实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者任何常规的处理器等。结合本申请实施例所公开的方法的步骤,可以直接体现为硬件译码处理器执行完成,或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于存储介质中,该存储介质位于存储器3,处理器2读取存储器3中的程序,结合其硬件完成前述方法的步骤。The method disclosed in the foregoing embodiment of the present application may be applied to the processor 2 or implemented by the processor 2. The processor 2 may be an integrated circuit chip with signal processing capability. In the implementation process, each step of the above method can be completed by an integrated logic circuit of hardware in the processor 2 or instructions in the form of software. The aforementioned processor 2 may be a general-purpose processor, a DSP, or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, and the like. The processor 2 may implement or execute the methods, steps, and logical block diagrams disclosed in the embodiments of the present application. The general-purpose processor may be a microprocessor or any conventional processor. The steps of the method disclosed in the embodiments of the present application can be directly embodied as being executed and completed by a hardware decoding processor, or executed by a combination of hardware and software modules in the decoding processor. The software module may be located in a storage medium, and the storage medium is located in the memory 3, and the processor 2 reads the program in the memory 3, and completes the steps of the foregoing method in combination with its hardware.
可选地,所述处理器2执行所述程序时实现本申请实施例的各个方法中由终端实现的相应流程,为了简洁,在此不再赘述。Optionally, when the processor 2 executes the program, the corresponding process implemented by the terminal in each method of the embodiment of the present application is implemented. For brevity, details are not described herein again.
在示例性实施例中,本申请实施例还提供了一种存储介质,即计算机存储介质,具体为计算机可读存储介质,例如包括存储计算机程序的存储器3,上述计算机程序可由终端的处理器2执行,以完成前述方法所述步骤。计算机可读存储介质可以是FRAM、ROM、PROM、EPROM、EEPROM、Flash Memory、磁表面存储器、光盘、或CD-ROM等存储器。In an exemplary embodiment, the embodiment of the present application also provides a storage medium, that is, a computer storage medium, specifically a computer-readable storage medium, such as a memory 3 storing a computer program, which can be used by the processor 2 of the terminal. Execute to complete the steps described in the foregoing method. The computer-readable storage medium may be a memory such as FRAM, ROM, PROM, EPROM, EEPROM, Flash Memory, magnetic surface memory, optical disc, or CD-ROM.
在本申请所提供的几个实施例中,应该理解到,所揭露的装置、终端和方法,可以通过其它的方式实现。以上所描述的设备实施例仅仅是示意 性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,如:多个单元或组件可以结合,或可以集成到另一个***,或一些特征可以忽略,或不执行。另外,所显示或讨论的各组成部分相互之间的耦合、或直接耦合、或通信连接可以是通过一些接口,设备或单元的间接耦合或通信连接,可以是电性的、机械的或其它形式的。In the several embodiments provided in this application, it should be understood that the disclosed device, terminal, and method may be implemented in other ways. The device embodiments described above are merely illustrative. For example, the division of the units is only a logical function division, and there may be other divisions in actual implementation, such as: multiple units or components can be combined, or It can be integrated into another system, or some features can be ignored or not implemented. In addition, the coupling, or direct coupling, or communication connection between the components shown or discussed may be indirect coupling or communication connection through some interfaces, devices or units, and may be electrical, mechanical or other forms of.
上述作为分离部件说明的单元可以是、或也可以不是物理上分开的,作为单元显示的部件可以是、或也可以不是物理单元,即可以位于一个地方,也可以分布到多个网络单元上;可以根据实际的需要选择其中的部分或全部单元来实现本实施例方案的目的。The units described above as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, that is, they may be located in one place or distributed on multiple network units; Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
另外,在本申请各实施例中的各功能单元可以全部集成在一个处理单元中,也可以是各单元分别单独作为一个单元,也可以两个或两个以上单元集成在一个单元中;上述集成的单元既可以采用硬件的形式实现,也可以采用硬件加软件功能单元的形式实现。In addition, the functional units in the embodiments of the present application can all be integrated into one processing unit, or each unit can be individually used as a unit, or two or more units can be integrated into one unit; The unit can be implemented in the form of hardware, or in the form of hardware plus software functional units.
本领域普通技术人员可以理解:实现上述方法实施例的全部或部分步骤可以通过程序指令相关的硬件来完成,前述的程序可以存储于一计算机可读取存储介质中,该程序在执行时,执行包括上述方法实施例的步骤;而前述的存储介质包括:移动存储设备、ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。A person of ordinary skill in the art can understand that all or part of the steps in the above method embodiments can be implemented by a program instructing relevant hardware. The foregoing program can be stored in a computer readable storage medium. When the program is executed, it is executed. Including the steps of the foregoing method embodiment; and the foregoing storage medium includes: various media that can store program codes, such as a mobile storage device, ROM, RAM, magnetic disk, or optical disk.
或者,本申请上述集成的单元如果以软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请实施例的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机、服务器、或者网络设备等)执行本申请各个实施例所述方法的全部或部分。而前述的存储介质包括:移动存储设备、ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。Alternatively, if the above-mentioned integrated unit of this application is implemented in the form of a software function module and sold or used as an independent product, it can also be stored in a computer readable storage medium. Based on this understanding, the technical solutions of the embodiments of the present application essentially or the part that contributes to the prior art can be embodied in the form of a software product. The computer software product is stored in a storage medium and includes several instructions for A computer device (which may be a personal computer, a server, or a network device, etc.) executes all or part of the methods described in the various embodiments of the present application. The aforementioned storage media include: removable storage devices, ROM, RAM, magnetic disks, or optical disks and other media that can store program codes.
需要说明的是:“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。It should be noted that: "first", "second", etc. are used to distinguish similar objects, and not necessarily used to describe a specific sequence or sequence.
另外,本申请实施例所记载的技术方案之间,在不冲突的情况下,可以任意组合。In addition, the technical solutions described in the embodiments of the present application can be combined arbitrarily without conflict.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。The above are only specific implementations of this application, but the protection scope of this application is not limited to this. Any person skilled in the art can easily think of changes or substitutions within the technical scope disclosed in this application. Should be covered within the scope of protection of this application. Therefore, the protection scope of this application should be subject to the protection scope of the claims.

Claims (11)

  1. 一种信息搜索方法,包括:An information search method, including:
    获取查询关键词;Get query keywords;
    查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点;Searching for points of interest in the geographic area where the terminal is located related to the query keyword;
    确定查找到兴趣点,增加查找到的兴趣点对应的相关度分值,所述相关度分值用于表征对应的搜索结果与所述查询关键词之间的相关性;Determine that the point of interest is found, and increase the relevance score corresponding to the found point of interest, where the relevance score is used to characterize the correlation between the corresponding search result and the query keyword;
    根据所述相关度分值,对至少两个搜索结果进行排序,所述至少两个搜索结果包括查找到的兴趣点和除查找到的兴趣点外的所述查询关键词的匹配信息;Sorting at least two search results according to the relevance score, the at least two search results including the found points of interest and matching information of the query keywords other than the found points of interest;
    按照排序顺序输出搜索结果。The search results are output in sorted order.
  2. 根据权利要求1所述的信息搜索方法,其中,所述查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点,包括:The information search method according to claim 1, wherein said searching for points of interest in the geographic area where the terminal is located related to the query keyword comprises:
    根据本次搜索的上下文,确定搜索目标类型为兴趣点的概率;According to the context of this search, determine the probability that the search target type is a point of interest;
    确定所述概率大于设定的概率值时,查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点。When it is determined that the probability is greater than the set probability value, search for points of interest that are related to the query keyword and located in the geographic area where the terminal is located.
  3. 根据权利要求1所述的信息搜索方法,其中,所述查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点,包括:The information search method according to claim 1, wherein said searching for points of interest in the geographic area where the terminal is located related to the query keyword comprises:
    确定所述查询关键词的输入方式;Determine the input mode of the query keyword;
    在与所述输入方式对应的基于位置服务LBS语料数据库中,查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点。In the location-based service LBS corpus database corresponding to the input method, search for points of interest that are related to the query keyword and located in the geographic area where the terminal is located.
  4. 根据权利要求3所述的信息搜索方法,其中,所述输入方式为逐字输入,所述在与所述输入方式对应的LBS语料数据库中,查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点,包括:The information search method according to claim 3, wherein the input mode is a verbatim input, and the LBS corpus database corresponding to the input mode is searched for the search keyword related to the query keyword located in the terminal. Points of interest in the geographic area, including:
    在以前缀树方式存储的LBS语料数据库中,查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点。In the LBS corpus database stored in a prefix tree manner, search for points of interest in the geographic area where the terminal is located and related to the query keyword.
  5. 根据权利要求3所述的信息搜索方法,其中,所述输入方式为一次性输入,所述在与所述输入方式对应的LBS语料数据库中,查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点,包括:The information search method according to claim 3, wherein the input mode is a one-time input, and the LBS corpus database corresponding to the input mode is searched for the search keyword related to the query keyword located in the terminal. Points of interest in the geographic area, including:
    在以字典树方式存储的LBS语料数据库中,查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点。In the LBS corpus database stored in the form of a dictionary tree, search for points of interest in the geographic area where the terminal is located related to the query keyword.
  6. 根据权利要求1所述的信息搜索方法,其中,所述获取查询关键词,包括:The information search method according to claim 1, wherein said obtaining query keywords comprises:
    对输入至所述终端的关键词进行格式归一化处理,得到所述查询关键词。Perform format normalization processing on the keywords input to the terminal to obtain the query keywords.
  7. 根据权利要求1所述的信息搜索方法,其中,所述查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点,包括:The information search method according to claim 1, wherein said searching for points of interest in the geographic area where the terminal is located related to the query keyword comprises:
    确定所述终端的地理位置;Determining the geographic location of the terminal;
    确定所述地理位置所在的地理围栏;Determine the geographic fence where the geographic location is located;
    查找与所述查询关键词相关的位于所述地理围栏内的兴趣点。Search for points of interest located in the geofence that are related to the query keyword.
  8. 根据权利要求7所述的信息搜索方法,其中,所述查找与所述查询关键词相关的位于终端所处地理围栏内的兴趣点,包括:8. The information search method according to claim 7, wherein said searching for points of interest related to said query keyword and located in a geofence where the terminal is located comprises:
    访问所述地理围栏对应的设定数据库,所述设定数据库中存储有所述地理围栏内的所有兴趣点;Accessing a setting database corresponding to the geofence, where all points of interest in the geofence are stored in the setting database;
    在所述设定数据库内查找与所述查询关键词相关的兴趣点。Search for points of interest related to the query keyword in the setting database.
  9. 一种信息搜索装置,包括:An information search device includes:
    获取单元,配置为获取查询关键词;The obtaining unit is configured to obtain query keywords;
    查找单元,配置为查找与所述查询关键词相关的位于终端所处地理区域内的兴趣点;The searching unit is configured to search for points of interest in the geographic area where the terminal is located and related to the query keyword;
    增加单元,配置为确定查找到兴趣点,增加查找到的兴趣点对应的相关度分值,所述相关度分值用于表征对应的搜索结果与所述查询关键词之间的相关性;The adding unit is configured to determine that a point of interest is found, and to add a relevance score corresponding to the found point of interest, where the relevance score is used to characterize the correlation between the corresponding search result and the query keyword;
    排序单元,配置为根据所述相关度分值,对至少两个搜索结果进行排序,所述至少两个搜索结果包括查找到的兴趣点和除查找到的兴趣点外的所述查询关键词的匹配信息;The sorting unit is configured to sort at least two search results according to the relevance score, the at least two search results including the found points of interest and the search keywords other than the found points of interest Matching information
    输出单元,配置为按照排序顺序输出搜索结果。The output unit is configured to output search results in a sorted order.
  10. 一种终端,包括:处理器和配置为存储能够在处理器上运行的计算机程序的存储器,A terminal includes: a processor and a memory configured to store a computer program that can run on the processor,
    其中,所述处理器配置为运行所述计算机程序时,执行权利要求1至8任一项所述的信息搜索方法的步骤。Wherein, the processor is configured to execute the steps of the information search method according to any one of claims 1 to 8 when running the computer program.
  11. 一种存储介质,所述存储介质存储有计算机程序,所述计算机程序被处理器执行时实现权利要求1至8任一项所述的信息搜索方法的步骤。A storage medium storing a computer program, and when the computer program is executed by a processor, the steps of the information searching method according to any one of claims 1 to 8 are realized.
PCT/CN2020/098785 2019-07-04 2020-06-29 Information search method and apparatus, terminal and storage medium WO2021000826A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910600963.8 2019-07-04
CN201910600963.8A CN110399568B (en) 2019-07-04 2019-07-04 Information searching method, device, terminal and storage medium

Publications (1)

Publication Number Publication Date
WO2021000826A1 true WO2021000826A1 (en) 2021-01-07

Family

ID=68323747

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/098785 WO2021000826A1 (en) 2019-07-04 2020-06-29 Information search method and apparatus, terminal and storage medium

Country Status (2)

Country Link
CN (1) CN110399568B (en)
WO (1) WO2021000826A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113961603A (en) * 2021-10-28 2022-01-21 神彩科技股份有限公司 Large-screen data display method and device, electronic equipment and storage medium

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110399568B (en) * 2019-07-04 2022-09-30 Oppo广东移动通信有限公司 Information searching method, device, terminal and storage medium
CN111026937B (en) 2019-11-13 2021-02-19 百度在线网络技术(北京)有限公司 Method, device and equipment for extracting POI name and computer storage medium
CN112923939A (en) * 2019-12-05 2021-06-08 逸驾智能科技有限公司 Method, apparatus, device and medium for navigation
CN111126334B (en) * 2019-12-31 2020-10-16 南京酷朗电子有限公司 Quick reading and processing method for technical data
CN111427983B (en) * 2020-03-23 2023-05-02 携程旅游信息技术(上海)有限公司 Service method, system, equipment and storage medium based on geographic information retrieval
CN111524518B (en) * 2020-04-01 2023-04-18 Oppo广东移动通信有限公司 Augmented reality processing method and device, storage medium and electronic equipment
CN111625701B (en) * 2020-05-25 2024-01-26 Oppo广东移动通信有限公司 Searching method, searching device, server and storage medium
CN111737574B (en) * 2020-06-19 2024-01-26 口口相传(北京)网络技术有限公司 Search information acquisition method, apparatus, computer device and readable storage medium
CN112003857A (en) * 2020-08-20 2020-11-27 深信服科技股份有限公司 Network asset collecting method, device, equipment and storage medium
CN112487159B (en) * 2020-11-19 2024-03-01 深圳市中博科创信息技术有限公司 Search method, search device, and computer-readable storage medium
CN113886685B (en) * 2021-09-23 2023-01-06 北京三快在线科技有限公司 Searching method, searching device, storage medium and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130024461A1 (en) * 2011-07-22 2013-01-24 Nhn Corporation System and method for providing location-sensitive auto-complete query
US20180060438A1 (en) * 2016-08-25 2018-03-01 Linkedin Corporation Prioritizing locations for people search
CN108256125A (en) * 2018-02-26 2018-07-06 杭州数梦工场科技有限公司 Intelligent search method, device and search engine based on administrative division
CN110399568A (en) * 2019-07-04 2019-11-01 Oppo广东移动通信有限公司 Information search method, device, terminal and storage medium

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9268865B2 (en) * 2010-09-24 2016-02-23 Facebook, Inc. Ranking search results by social relevancy
CN102591867B (en) * 2011-01-07 2015-05-27 清华大学 Searching service method based on mobile device position
CN102385636A (en) * 2011-12-22 2012-03-21 陈伟 Intelligent searching method and device
CN103473235A (en) * 2012-06-07 2013-12-25 腾讯科技(深圳)有限公司 Searching method of electronic map, browsing method and system of electronic map
CN104166645A (en) * 2013-05-15 2014-11-26 大陆汽车投资(上海)有限公司 Interest point and path information obtaining method and vehicle-mounted electronic equipment
KR102128320B1 (en) * 2013-07-31 2020-06-30 팅크웨어(주) System and method for providing circumference search result
CN103902680A (en) * 2014-03-21 2014-07-02 百度在线网络技术(北京)有限公司 Search recommendation method and device
CN104239440B (en) * 2014-09-01 2017-08-25 百度在线网络技术(北京)有限公司 Search result shows method and apparatus
US10120948B2 (en) * 2015-10-13 2018-11-06 Google Llc Providing notifications based on geofencing search results
CN105279150A (en) * 2015-10-27 2016-01-27 江苏电力信息技术有限公司 Lucene full-text retrieval based Chinese word segmentation method
KR102418056B1 (en) * 2015-11-30 2022-07-07 현대오토에버 주식회사 Apparatus for searching keyword of navigation and method thereof
CN107291713B (en) * 2016-03-30 2021-02-23 阿里巴巴(中国)有限公司 Prompting method, terminal and server for searching keywords of interest points
CN105930422A (en) * 2016-04-18 2016-09-07 乐视控股(北京)有限公司 Search response method, apparatus and system
CN106354852A (en) * 2016-09-02 2017-01-25 北京百度网讯科技有限公司 Search method and device based on artificial intelligence
CN107357911A (en) * 2017-07-18 2017-11-17 北京新美互通科技有限公司 A kind of text entry method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130024461A1 (en) * 2011-07-22 2013-01-24 Nhn Corporation System and method for providing location-sensitive auto-complete query
US20180060438A1 (en) * 2016-08-25 2018-03-01 Linkedin Corporation Prioritizing locations for people search
CN108256125A (en) * 2018-02-26 2018-07-06 杭州数梦工场科技有限公司 Intelligent search method, device and search engine based on administrative division
CN110399568A (en) * 2019-07-04 2019-11-01 Oppo广东移动通信有限公司 Information search method, device, terminal and storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113961603A (en) * 2021-10-28 2022-01-21 神彩科技股份有限公司 Large-screen data display method and device, electronic equipment and storage medium
CN113961603B (en) * 2021-10-28 2023-03-24 神彩科技股份有限公司 Large-screen data display method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN110399568A (en) 2019-11-01
CN110399568B (en) 2022-09-30

Similar Documents

Publication Publication Date Title
WO2021000826A1 (en) Information search method and apparatus, terminal and storage medium
US10289717B2 (en) Semantic search apparatus and method using mobile terminal
US20120296865A1 (en) Terminal device and word stock update method thereof
US20160275148A1 (en) Database query method and device
US8682646B2 (en) Semantic relationship-based location description parsing
US20190102397A1 (en) Methods and systems for client side search ranking improvements
US8825661B2 (en) Systems and methods for two stream indexing of audio content
US8812508B2 (en) Systems and methods for extracting phases from text
CN105045852A (en) Full-text search engine system for teaching resources
CN105209858B (en) The uncertainty of business location's data disappears qi and matching
US10936819B2 (en) Query-directed discovery and alignment of collections of document passages for improving named entity disambiguation precision
WO2018201980A1 (en) Insurance policy service task allocation method, apparatus, computer device and storage device
US20080010238A1 (en) Index having short-term portion and long-term portion
US8799314B2 (en) System and method for managing information map
CN110019644B (en) Search method, apparatus and computer-readable storage medium in dialog implementation
CN113934869A (en) Database construction method, multimedia file retrieval method and device
KR20120094562A (en) System and method for searching supplementary data using keywords extraction, in translation sentence
KR20150032141A (en) Semantic searching system and method for smart device
CN103226601A (en) Method and device for image search
CN111859042A (en) Retrieval method and device and electronic equipment
KR20180044111A (en) Method for collecting data and device thereof
Chen et al. ESPM: Efficient spatial pattern matching
Fränti et al. Location-based search engine for multimedia phones
KR20070031152A (en) Methed for providing search result of input query and search engine of enabling the method
KR101263403B1 (en) Apparatus and method for keyword searching according to priority of inputted word and computer readable medium having stored thereon computer executable instruction for performing the method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20835214

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20835214

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 20835214

Country of ref document: EP

Kind code of ref document: A1