WO2017181986A1 - 数据检索方法及*** - Google Patents

数据检索方法及*** Download PDF

Info

Publication number
WO2017181986A1
WO2017181986A1 PCT/CN2017/081357 CN2017081357W WO2017181986A1 WO 2017181986 A1 WO2017181986 A1 WO 2017181986A1 CN 2017081357 W CN2017081357 W CN 2017081357W WO 2017181986 A1 WO2017181986 A1 WO 2017181986A1
Authority
WO
WIPO (PCT)
Prior art keywords
identifier
user
link
data
message
Prior art date
Application number
PCT/CN2017/081357
Other languages
English (en)
French (fr)
Inventor
黄斌
罗勋
朱健琛
罗敏
汤善敏
刘永升
Original Assignee
腾讯科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 腾讯科技(深圳)有限公司 filed Critical 腾讯科技(深圳)有限公司
Publication of WO2017181986A1 publication Critical patent/WO2017181986A1/zh
Priority to US15/949,796 priority Critical patent/US10984059B2/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking

Definitions

  • the present application relates to the field of information processing technologies, and in particular, to a data retrieval method and system.
  • Existing social network systems (such as WeChat systems, instant messaging systems, etc.) all have a search function of a relational chain. Specifically, the social network system can retrieve the sharing status of a linked article by the user's friend according to the keyword input by the user. And show the sharing to the user.
  • the existing social network system generally adopts a Good Web Object (GOB) retrieval system to implement the above retrieval function.
  • the social network system stores the user and the shared data by the inverted data structure identified by the Term to the message.
  • the relationship between the articles such that when the social network system retrieves the information of the user sharing the article, the user identifier and the link identifier are used as items, and the first list and the second list of the message identifier corresponding to the user identifier and the link identifier are respectively searched.
  • the first list and the second list are subjected to an intersection operation to obtain a correspondence between the link identifier, the user identifier, and the message identifier. Since the first list and the second list are to be intersected, if the items of the two lists are relatively large, the amount of calculation of the data retrieval is relatively large.
  • the present application provides a data retrieval method and system, which realizes directly finding a message identifier in a correspondence list stored in a data retrieval system.
  • a data retrieval method comprising: receiving a data retrieval request, wherein the data retrieval request includes a user identifier and a link identifier; and if the link identifier corresponds to the user identifier, Searching for a corresponding message identifier in the preset correspondence relationship list; displaying user sharing information of the corresponding article according to the message identifier, wherein if the link identifier is included in a link identifier corresponding to the user identifier, or The user identifier is included in the user identifier corresponding to the link identifier, and then the link identifier is determined to correspond to the user identifier.
  • a data retrieval system comprising: The request receiving unit receives a data retrieval request, where the data retrieval request includes a user identifier and a link identifier; and the searching unit searches for a corresponding message identifier in the preset correspondence relationship list when the link identifier corresponds to the user identifier; a result display unit, configured to display user sharing information of the corresponding article according to the message identifier, wherein if the link identifier is included in a link identifier corresponding to the user identifier, or the user identifier is included in the link identifier In the user identifier, it is determined that the link identifier corresponds to the user identifier.
  • the data retrieval system stores the correspondence between the link identifier or the user identifier and the message identifier in the correspondence relationship list, so that the data retrieval system determines that the link identifier in the data retrieval request is associated with the user identifier, and directly corresponds to A message identifier corresponding to the link identifier or the user identifier is found in the relationship list, thereby obtaining a situation in which the user shares the article.
  • the solution provided by the present application can perform the intersection operation without performing the intersection operation, but can directly find the final result, and the calculation amount of the data retrieval is less than that of the prior art intersection operation.
  • FIG. 1 is a flowchart of a data retrieval method provided by an embodiment of the present application.
  • Example 2 is a schematic diagram of data stored in the data retrieval system of Example 1;
  • Example 3 is a schematic diagram of data storage system storage data in Example 2.
  • FIG. 4 is a schematic structural diagram of a social network system to which a data retrieval method according to an embodiment of the present application is applied;
  • FIG. 5 is a schematic diagram of a data retrieval method according to an embodiment of the present application.
  • FIG. 6 is a schematic structural diagram of a data retrieval system according to an embodiment of the present application.
  • FIG. 7 is a schematic structural diagram of another data retrieval system according to an embodiment of the present application.
  • FIG. 8 is a schematic structural diagram of hardware of a data retrieval system according to an embodiment of the present application.
  • a data retrieval method is provided, which is mainly applied to a social network system, such as a WeChat system, a microblog system, or an instant messaging system (such as a QQ system, etc.), and the client may include a client in the social network system.
  • the server the data retrieval system is loaded in the server, and the method in this embodiment is a method performed by the data retrieval system of the server.
  • the flowchart is as shown in FIG. 1 and includes:
  • Step 101 Receive a data retrieval request, where the data retrieval request includes a user identifier and a link identifier, where the user identifier is information that uniquely identifies a certain user, such as a micro signal, a microblog number, or an instant message; the link identifier is a unique identifier. Information about an article link.
  • the user can initiate a data retrieval process through a client included in the social network system.
  • the user may select at least one article link in the search interface displayed by the client, and may select the user identifier of the user, and/or at least one user identifier associated with the user to initiate a data retrieval request, and the data is
  • the information selected by the user is added to the retrieval request; thus the data retrieval system of the server receives the data retrieval request sent by the client, and performs the following steps 102 to 104.
  • the user identifier and the link identifier in the data retrieval request may be one, or multiple, respectively.
  • Step 102 Determine whether the link identifier corresponds to the user identifier. If yes, execute step 103 to indicate that the user identifier corresponds to the user sharing the link identified by the link identifier through the client; if not, the process ends. Wherein, if the link identifier is included in a link identifier corresponding to the user identifier, or the user identifier is included in a user identifier corresponding to the link identifier, determining the link identifier and the user identifier correspond.
  • Step 103 Search for a corresponding message identifier in the preset correspondence list.
  • Step 104 Display user sharing information of the corresponding article according to the message identifier, and the user sharing information may specifically include the user's comment information, the number of users sharing the article, and the like.
  • the data retrieval system of the embodiment receives the sharing request of the user for a linked article, and includes the certain in the sharing request.
  • the link identifier of the link and the user identifier of the user and may further include the user's comment information; the data retrieval system stores the information of the linked article as a message corresponding to the user identifier, and may also store the user's comment information. Go to the message and generate a message identifier for the message to uniquely identify the message.
  • the link identifier and the message identifier are then stored in the correspondence relationship list, and the link identifier is associated with the user identifier; or the user identifier and the message identifier are stored in the correspondence relationship list, and the user identifier is associated with the link identifier. Therefore, when performing this step 102, the data retrieval system can search whether the link identifier stored in the data retrieval system associated with the user identifier in the data retrieval request includes the link identifier in the data retrieval request or the search in the data retrieval system. Whether the user identifier in the request is included in the user identifier associated with the link identifier in the request, and if so, step 103 is performed.
  • the correspondence relationship list (the first correspondence relationship list) includes the link identifier and the message identifier (example one) and the correspondence relationship list (the second correspondence relationship list) includes the user identifier and the message identifier (example two).
  • the method is further introduced.
  • the data retrieval system not only needs to store the first correspondence list, but also needs to store the information of the link identifier associated with the user identifier.
  • the specific implementation process :
  • the first correspondence list may include a plurality of data blocks, each of which includes a header and a value.
  • the structure of the header may be as shown in Table 1 below, including the data length and the offset data of the next data block (next) in the file, and may also include reserve data, here
  • the data length is used to indicate the length of the data block
  • the offset data of the next data block in the file is used to indicate the location information of the next data block of the data block.
  • each data block can be as shown in Table 2 below, including the link identifier and the corresponding message identifier, wherein the link identifier can be represented by a document identifier (docid), and the message identifier is a tweet identifier.
  • the tid mainly refers to the identifier of a message published by the user through the client of the social network system, such as the identifier of the microblog message, and the WeChat message. logo.
  • the data retrieval system can find the information of the link identifier associated with the user identifier through the user identification index.
  • the structure of the user identifier index may be as shown in Table 3 below, including the offset data of the user identifier and the associated information in the file, where the user identifier may be represented by a user identification number (uni), and the associated information.
  • the offset data in the file is used to indicate location information of the information of the link identifier associated with the user identification.
  • the message identifier corresponding to the link identifier in the retrieval request is searched for.
  • the first correspondence relationship list includes a correspondence between the link identifier and the message identifier.
  • the data retrieval system may first use the filter to retrieve the data before performing step 102.
  • the plurality of user identifiers included in the filtering are filtered to filter out at least one of the user identifiers to obtain the remaining user identifiers, and the filtered user identifiers are user identifiers that do not correspond to any link identifiers; in this case, step 102 is performed.
  • the data retrieval system determines whether the link identifier in the data retrieval request is included in the link identifier corresponding to the remaining user identifier.
  • the data of the filter stored in the data retrieval system may include: a filter index and a filter value.
  • the structure of the filter index may be as shown in Table 4 below, including the link identifier and the offset data of the filter in the file, wherein the offset data of the filter in the file is used to indicate the location information of the filter. And a link identifier corresponds to a filter.
  • the structure of the filter value can be as shown in Table 5 below, including multiple data blocks, each of which includes data length and filtering policy data, wherein the data length is used to indicate the length of the filtering policy data, and the filtering policy data refers to filtering the user.
  • the identified policy data can be preset by the user.
  • the filter can be a Bloom Filter (BF).
  • the data retrieval system stores the first correspondence list and the information of the link identifier associated with the user identifier, and is stored in a permanent storage (such as a disk) of the data retrieval system.
  • the filter data can be stored in temporary storage (such as memory).
  • the data correspondence system stores the correspondence between the link identifier and the message identifier in the first correspondence relationship list, so that the data retrieval system determines that the link identifier in the data retrieval request is associated with the user identifier, A message identifier corresponding to the link identifier is found in the first correspondence relationship list, thereby obtaining a situation in which the user shares an article of a certain link.
  • the method in this embodiment can perform the intersection operation without performing the intersection operation, but can directly find the final result. Compared with the intersection operation in the prior art, the calculation amount of the data retrieval is small.
  • the data retrieval system not only needs to store the second correspondence list, but also needs to store the information of the user identifier associated with the link identifier.
  • the specific implementation process :
  • the second correspondence list may include a plurality of data blocks, each of which includes a header and a value.
  • the structure of the header may be as shown in Table 1 above, including the data length and the offset data of the next data block in the file, and may also include reserved data; the structure of the value of each data block may be as shown in Table 6 below.
  • the indication includes a user identifier and a corresponding message identifier, where the user identifier can be represented by uin, and the message identifier can be represented by tid.
  • the data retrieval system can find the information of the user identification associated with the link identifier through the link identification index.
  • the structure of the link identifier index may be as shown in Table 7 below, including offset data of the link identifier and the associated information in the file, where the link identifier may be represented by a docid, and the offset data of the associated information in the file is used to represent Location information of the information of the user identification associated with the link identifier.
  • a message identifier corresponding to the user identifier in the data retrieval request is looked up.
  • the second correspondence relationship list includes a correspondence between the user identifier and the message identifier.
  • the data retrieval system may first filter the plurality of users by using the filter before performing step 102. At least one user identifier in the identifier obtains the remaining user identifier, and the filtered user identifier is a user identifier that does not correspond to any link identifier. For example, when step 102 is performed, the data retrieval system can determine whether the remaining user identification is included in the user identification corresponding to the link identifier in the data retrieval request.
  • the structure of the filter of the present example may be as described in Table 5 in the above-described first example, and details are not described herein.
  • the structure of the identifier index stored in the data retrieval system may be as shown in Table 8 below, including the link identifier, the offset data of the associated information in the file, and the offset data of the filter in the file, where the filter is in the file.
  • the offset data is used to indicate location information of the data of the filter corresponding to the link identifier
  • the link identifier may be represented by a docid
  • the offset data of the association information in the file represents a user identifier used to represent the link identifier Location information for the information.
  • the location of the corresponding filter can be directly located by the link identifier.
  • the data retrieval system stores the information of the second correspondence list and the user identifier associated with the link identifier in a permanent storage (such as a disk) of the data retrieval system.
  • the filter data can be stored in temporary storage (such as memory).
  • the data retrieval system stores the correspondence between the user identifier and the message identifier in the second correspondence relationship list, so that the data retrieval system determines that the user identifier in the data retrieval request is associated with the link identifier,
  • the message identifier corresponding to the user identifier is directly found in the second correspondence list, so that the user shares the article.
  • the method in this embodiment can perform the intersection operation without performing the intersection operation, but can directly find the final result, compared with the prior art intersection operation. As a result, data retrieval is less computationally intensive.
  • the retrieval method adopted by the data retrieval system is similar, and the final result can be directly found from the correspondence relationship list.
  • the data retrieval system The stored data structure is an inverted data structure of the correspondence between the user identifier and the link identifier and the message identifier. Specifically, as shown in FIG. 2, the corresponding message identifier can be found by using the user identifier.
  • the data structure stored by the data retrieval system is an inverted data structure of the correspondence between the link identifier and the user identifier and the message identifier. Specifically, as shown in FIG. 3, the corresponding message identifier can be found through the link identifier.
  • the data of the link identifier and the message identifier corresponding to the user identifier will be less than the data of the user identifier and the message identifier corresponding to the link identifier.
  • almost all the user identifiers corresponding to the link identifier and the message identifier data can be stored as long as a 4 kb data block. Therefore, using the method in the first example, the amount of data stored in the data retrieval system is small.
  • the social network system of the present embodiment includes a social network client and a social network server, and a data retrieval system is loaded in the social network server.
  • the user initiates the data retrieval process of the embodiment through the social network client.
  • the user can input a keyword in the article search interface displayed by the social network client and click submit, and the social network client sends an article search request to the article server, so that the article server searches for relevant article information according to the keyword. Send the link information of the article to the social network client for display.
  • the user can select one or more links, and can select the user identifier of the user, or other user identifiers related to the user, and submit, so that the social network client sends the data retrieval request to the social network server.
  • the method according to the first example is performed by a data retrieval system in the social network server.
  • the article server is a server supported by the social network system.
  • the data retrieval system receives the data retrieval request, for example, the user identifier included in the data retrieval request is uin1, and the link identifier is docid1.
  • the data retrieval system finds a hash bucket to which the uin1 belongs, that is, a hash index, and determines a location of the hash list according to the hash list offset in the hash index; Hash the list, and determine the link associated with uin1 according to the offset of the association information of uin1 in the hash list The location of the information; and then find the information associated with the uin1, which may include the maximum docid, the minimum docid, the disk identifier (disk id) where the link identifier is located, the file id of the file, the offset of the corresponding relationship, etc. .
  • the data retrieval system determines that docid1 is within the range of the link identifier corresponding to uin1. That is, if the link identifier docid1 is included in the link identifier corresponding to the user identifier uni1, it is determined that the link identifier corresponds to the user identifier.
  • the data retrieval system may send the finally obtained tid value to the article server, and the article server obtains the user sharing information of the corresponding article according to the tid value, and sends the user sharing information to the social network client for display.
  • the present application further provides a data retrieval system, which is used to perform the method in the foregoing embodiment, and a schematic structural diagram thereof is shown in FIG.
  • the request receiving unit 10 is configured to receive a data retrieval request, where the data retrieval request includes a user identifier and a link identifier; the user identifier and the link identifier in the data retrieval request may be one, or multiple, respectively.
  • the searching unit 11 is configured to search for a corresponding message identifier in the preset correspondence relationship list if it is determined that the link identifier corresponds to the user identifier.
  • the searching unit 11 may be based on the correspondence relationship list indicating the correspondence relationship between the link identifier and the message identifier as described in Example 1 of the foregoing method embodiment or the example 2 in the above method embodiment.
  • the corresponding relationship list indicating the correspondence between the user identifier and the message identifier is used to perform a search operation. For example, if the search unit 11 determines that the link identifier included in the data retrieval request received by the request receiving unit 10 is in the link identifier corresponding to the user identifier included in the data retrieval request, the mapping between the preset representation link identifier and the message identifier is performed. In the first correspondence list, the message identifier corresponding to the link identifier included in the request is searched.
  • the determining unit 11 determines that the user identifier included in the data retrieval request received by the request receiving unit 10 is within the range of the user identifier corresponding to the link identifier in the data retrieval request, then the preset user identifier and the message identifier are included in the preset Correspondence In the second correspondence list, the message identifier corresponding to the user identifier in the request is searched.
  • the result display unit 12 is configured to display user sharing information of the corresponding article according to the message identifier searched by the searching unit 11.
  • the correspondence relationship between the link identifier or the user identifier and the message identifier is stored in the correspondence relationship list, so that when the searching unit 11 determines that the link identifier in the data retrieval request corresponds to the user identifier, A message identifier corresponding to the link identifier or the user identifier is found in the correspondence relationship list, thereby obtaining a situation in which the user shares the article.
  • the system in the present application can perform the intersection operation without performing the intersection operation, but can directly find the final result, and the calculation amount of the data retrieval is small compared with the intersection operation in the prior art.
  • the data retrieval system may include, in addition to the structure shown in FIG. 6, a filtering unit 13 and a correspondence relationship storage unit 14, wherein:
  • the filtering unit 13 is configured to filter at least one user identifier from the plurality of user identifiers to obtain remaining user identifiers by using a filter. , wherein the filtered user ID is a user ID that does not correspond to any link identifier. In this way, if the link identifier included in the data retrieval request is within the range of the link identifier corresponding to the remaining user identifiers filtered by the filtering unit 13, the search unit 11 searches in the preset first correspondence list. The link identifier corresponds to the message identifier, which reduces the workload of data retrieval.
  • the search unit 11 searches for a message corresponding to the user identifier in the preset second correspondence list. Identification reduces the workload of data retrieval.
  • the correspondence relationship storage unit 14 is configured to receive a sharing request of the user for a linked article, where the sharing request includes the link identifier of the certain link and the user identifier of the user;
  • the information of the article is stored into a message corresponding to the user identifier, and generates a message identifier of the message, associates the link identifier with the user identifier, and then stores the link identifier and the message identifier to the first
  • the user identifier and the message identifier are stored in the second correspondence list.
  • the first correspondence relationship list includes a plurality of data blocks, each data block includes a header and a value, and the header includes a data length and offset data of a next data block in a file, and each data block The value includes the link identifier and the corresponding message identifier.
  • the second correspondence list also includes a plurality of data blocks, each data block includes a header and a value, and the header includes a data length and offset data of the next data block in the file, and each data block The value includes the user ID and the corresponding message ID.
  • the search unit 11 stores the first correspondence list or the second relationship stored in the correspondence storage unit 14.
  • the corresponding message identifier is found in the series table.
  • the present application also provides a data retrieval system, the structure of which is shown in FIG. 8.
  • the data retrieval system may have relatively large differences due to different configurations or performances, and may include one or more processors (for example, one or more).
  • the memory 31 and the storage medium 32 may be short-term storage or persistent storage.
  • the program stored on storage medium 32 may include one or more modules (not shown), each of which may include a series of instruction operations to implement the methods described herein.
  • processor 30 may be arranged to communicate with storage medium 32 to perform a series of instruction operations in storage medium 32 on the data retrieval system.
  • the data retrieval system may also include one or more power sources 33, one or more wired or wireless network interfaces 34, one or more input and output interfaces 35, and/or one or more operating systems 323, such as Windows ServerTM, Mac. OS XTM, UnixTM, LinuxTM, FreeBSDTM and more.
  • the steps performed by the data retrieval system described in the above method embodiments may be based on the structure of the data retrieval system shown in FIG.
  • the program may be stored in a computer readable storage medium, and the storage medium may include: Read only memory (ROM), random access memory (RAM), magnetic or optical disk, and the like.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Development Economics (AREA)
  • Finance (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

本申请公开了数据检索方法及***,应用于信息处理技术领域。在本实施例的方法中,接收数据检索请求,所述数据检索请求中包括用户标识和链接标识;如果链接标识与用户标识相对应,则在预置的对应关系列表中,查找相应的消息标识;根据所述消息标识显示对应文章的用户分享信息,其中,如果所述链接标识包括在与所述用户标识对应的链接标识中,或者所述用户标识包括在与所述链接标识对应的用户标识中,则确定所述链接标识和所述用户标识相对应。本实施例中的方法可以不用执行求交操作,而是可以直接查找得到最终结果,相对于现有技术中的求交操作,数据检索的计算量较少。

Description

数据检索方法及***
本申请要求于2016年4月22日提交中国专利局、申请号为201610257637.8、发明名称为“一种数据检索方法及***”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及信息处理技术领域,特别涉及一种数据检索方法及***。
背景技术
现有的社交网络***(比如微信***、即时通信***等)都具有关系链的检索功能,具体地,社交网络***可以根据用户输入的关键字检索某一个链接的文章被该用户好友的分享情况,并将分享情况展示给用户。
现有的社交网络***一般采用优质网页数据对象(Good Object,简称GOB)检索***实现上述检索功能,具体地,社交网络***以项(Term)到消息标识的倒排数据结构储存用户与分享的文章之间的关系,这样社交网络***在检索用户分享文章的信息时,会将用户标识和链接标识作为项,分别查找得到用户标识和链接标识对应的消息标识的第一列表和第二列表,然后对第一列表和第二列表进行求交操作得到链接标识、用户标识和消息标识的对应关系。由于要对第一列表和第二列表进行求交操作,如果两个列表的项比较多,则数据检索的计算量就比较大。
发明内容
本申请提供一种数据检索方法及***,实现了在数据检索***储存的对应关系列表中直接查找到消息标识。
根据本申请的一方面,提供了一种数据检索方法,其特征在于,包括:接收数据检索请求,所述数据检索请求中包括用户标识和链接标识;如果链接标识与用户标识相对应,则在预置的对应关系列表中,查找相应的消息标识;根据所述消息标识显示对应文章的用户分享信息,其中,如果所述链接标识包括在与所述用户标识对应的链接标识中,或者所述用户标识包括在与所述链接标识对应的用户标识中,则确定所述链接标识和所述用户标识相对应。
根据本申请的另一方面,提供了一种数据检索***,其特征在于,包括: 请求接收单元,接收数据检索请求,所述数据检索请求中包括用户标识和链接标识;查找单元,在链接标识与用户标识相对应时,在预置的对应关系列表中,查找相应的消息标识;结果显示单元,根据所述消息标识显示对应文章的用户分享信息,其中,如果所述链接标识包括在与所述用户标识对应的链接标识中,或者所述用户标识包括在与所述链接标识对应的用户标识中,则确定所述链接标识和所述用户标识相对应。
根据本申请的技术方案,数据检索***中以对应关系列表储存链接标识或用户标识与消息标识的对应关系,这样数据检索***确定数据检索请求中的链接标识与用户标识相关联,则直接在对应关系列表中查找到与链接标识或用户标识对应的消息标识,从而得到用户分享文章的情况。本申请所提供的方案可以不用执行求交操作,而是可以直接查找得到最终结果,相对于现有技术中的求交操作,数据检索的计算量较少。
附图说明
为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。
图1是本申请实施例提供的一种数据检索方法的流程图;
图2是示例一中数据检索***储存数据的示意图;
图3是示例二中数据检索***储存数据的示意图;
图4是应用本申请实施例的数据检索方法的社交网络***的结构示意图;
图5是应用本申请实施例提供的数据检索方法的示意图;
图6是本申请实施例提供的一种数据检索***的结构示意图;
图7是本申请实施例提供的另一种数据检索***的结构示意图;
图8是本申请实施例提供的一种数据检索***的硬件结构示意图。
具体实施方式
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造 性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。
本申请的说明书和权利要求书及上述附图中的术语“第一”、“第二”、“第三”“第四”等(如果存在)是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本申请的实施例例如能够以除了在这里图示或描述的那些以外的顺序实施。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排它的包含,例如,包含了一系列步骤或单元的过程、方法、***、产品或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。
根据本申请实施例,提供了一种数据检索方法,主要是应用于社交网络***中比如微信***、微博***或即时通信***(比如QQ***等)等,在社交网络***中可以包括客户端和服务器,在服务器中装载数据检索***,本实施例的方法是服务器的数据检索***所执行的方法,流程图如图1所示,包括:
步骤101,接收数据检索请求,数据检索请求中包括用户标识和链接标识,这里用户标识是唯一标识某一个用户的信息,比如微信号、微博号或即时通信号等;链接标识是唯一标识某一个文章链接的信息。
可以理解,用户可以通过社交网络***中包括的客户端发起数据检索流程。具体地,用户可以在客户端显示的搜索界面选择至少一个文章链接,且可以选择该用户的用户标识,和/或,与该用户相关的至少一个用户标识,以发起数据检索请求,并在数据检索请求中添加用户选择的信息;这样服务器的数据检索***会接收到客户端发送的数据检索请求,并执行如下步骤102到104。其中,在数据检索请求中的用户标识和链接标识可以分别为一个,或分别为多个。
步骤102,判断链接标识是否与用户标识相对应,如果是,则执行步骤103,说明该用户标识对应用户通过客户端分享了该链接标识所标识链接的文章;如果不是,则结束流程。其中,如果所述链接标识包括在与所述用户标识对应的链接标识中,或者所述用户标识包括在与所述链接标识对应的用户标识中,则确定所述链接标识和所述用户标识相对应。
步骤103,在预置的对应关系列表中,查找相应的消息标识。
步骤104,根据消息标识显示对应文章的用户分享信息,用户分享信息具体可以包括用户的评论信息,分享该文章的用户数等信息。
需要说明的是,当某一用户通过客户端分享某一链接的文章时,本实施例的数据检索***会接收到该用户对某一链接的文章的分享请求,在分享请求中包括该某一链接的链接标识和该用户的用户标识,且还可以包括用户的评论信息;则数据检索***会将该链接的文章的信息储存成用户标识对应的一条消息,且还可以将用户的评论信息储存到该条消息中,并生成该条消息的消息标识来唯一标识该条消息。然后将该链接标识和消息标识储存到对应关系列表中,并将该链接标识与用户标识关联;或者将用户标识与消息标识存储到对应关系列表中,并将该用户标识与链接标识关联。因此,在执行本步骤102时,数据检索***可以查找数据检索***中储存的与数据检索请求中的用户标识关联的链接标识中是否包括数据检索请求中的链接标识或者查找数据检索***中存储的与请求中的链接标识关联的用户标识中是否包括请求中的用户标识,如果有,则执行步骤103。
下面分别就对应关系列表(第一对应关系列表)包括链接标识和消息标识(示例一)以及对应关系列表(第二对应关系列表)包括用户标识和消息标识(示例二)两种情况来对本申请的方法进行进一步的介绍。
示例一
本示例中,数据检索***中不仅需要储存第一对应关系列表,还需要储存与用户标识关联的链接标识的信息。在具体实现过程中:
(1)第一对应关系列表可以包括多个数据块,每个数据块包括头部和值两部分。
其中头部的结构可以如下表1所示,包括数据长度(data length)和下一数据块(next)在文件中的偏移(offset)数据,且还可以包括预留(reserve)数据,这里数据长度用于指示该数据块的长度,下一数据块在文件中的偏移数据用于指示该数据块的下一数据块的位置信息。
表1
8bytes next offset 8bytes data length 16bytes reserve
每个数据块的值的结构可以如下表2所示,包括链接标识及对应的消息标识,其中链接标识可以用文档标识(document identity,简称docid)来表示,消息标识用推特标识(Tweet identity,简称tid)来表示,该tid主要是指用户通过社交网络***的客户端发表的一条消息的标识,比如微博消息的标识,微信消息 的标识。
表2
8bytes tid A 8bytes docid A 8bytes tid B 8bytes docid B
(2)本示例中,数据检索***可以通过用户标识索引查找到与用户标识关联的链接标识的信息。其中,用户标识索引的结构可以如下述表3所示,包括用户标识和关联信息在文件中的偏移数据,这里用户标识可以用用户识别号码(user identification number,简称uni)来表示,关联信息在文件中的偏移数据用于表示与该用户标识关联的链接标识的信息的位置信息。
表3
8bytes uin 8bytes offset
在该示例中,针对步骤103,在预置的第一对应关系列表中,查找检索请求中的链接标识对应的消息标识。其中,第一对应关系列表包括链接标识与消息标识的对应关系。
在一个具体的示例中,为了进一步地减少数据检索的工作量,如果上述数据检索请求中包括的用户标识有多个,则在执行步骤102之前,数据检索***可以先使用过滤器对数据检索请求中包括的多个用户标识进行过滤以滤除其中的至少一个用户标识得到剩余的用户标识,滤除掉的用户标识为未对应任何链接标识的用户标识;在这种情况下,在执行步骤102时,数据检索***会判断数据检索请求中的链接标识是否包括在剩余的用户标识对应的链接标识中。
具体地,数据检索***中储存的过滤器的数据可以包括:过滤器索引和过滤器值。
针对示例一,所述过滤器索引的结构可以如下表4所示,包括链接标识和过滤器在文件中的偏移数据,其中过滤器在文件中的偏移数据用于指示过滤器的位置信息,且一个链接标识对应一个过滤器。
表4
8bytes docid 8bytes offset
过滤器值的结构可以如下表5所示,包括多个数据块,每个数据块中包括数据长度和过滤策略数据,其中数据长度用于指示过滤策略数据的长度,过滤策略数据是指过滤用户标识的策略数据,可以由用户预置。这里过滤器可以是布隆过滤器(Bloom Filter,简称BF)。
表5
4bytes length X bytes BF data A 4bytes length X bytes BF data B
另外,需要说明的是,本示例中数据检索***在储存第一对应关系列表和与用户标识关联的链接标识的信息时,都是储存于该数据检索***的永久性存储器(比如磁盘)中,而过滤器的数据可以储存在暂时性存储器(比如内存)中。
可见,在应用本示例的方法中,数据检索***中以第一对应关系列表储存链接标识与消息标识的对应关系,这样数据检索***确定数据检索请求中的链接标识与用户标识相关联,则直接在第一对应关系列表中查找到与该链接标识对应的消息标识,从而得到用户分享某一链接的文章的情况。本实施例中的方法可以不用执行求交操作,而是可以直接查找得到最终结果,相对于现有技术中的求交操作,数据检索的计算量较少。
示例二
在本示例中,数据检索***中不仅需要储存第二对应关系列表,还需要储存与链接标识关联的用户标识的信息。在具体实现过程中:
(1)第二对应关系列表可以包括多个数据块,每个数据块包括头部和值两部分。
其中头部的结构可以如上述表1所示,包括数据长度和下一数据块在文件中的偏移数据,且还可以包括预留数据;每个数据块的值的结构可以如下表6所示,包括用户标识及对应的消息标识,这里用户标识可以用uin来表示,消息标识可以用tid来表示。
表6
8bytes tid A 8bytes uin A 8bytes tid B 8bytes uin B
(2)本示例中,数据检索***可以通过链接标识索引查找到与链接标识关联的用户标识的信息。其中,链接标识索引的结构可以如下述表7所示,包括链接标识和关联信息在文件中的偏移数据,这里链接标识可以用docid来表示,关联信息在文件中的偏移数据用于表示与该链接标识关联的用户标识的信息的位置信息。
表7
8bytes docid 8bytes offset
在该示例中,针对步骤103,在预置的第二对应关系列表中,查找与数据检索请求中的用户标识对应的消息标识。其中,第二对应关系列表包括用户标识与消息标识的对应关系。
根据本申请的一方面,为了进一步地减少数据检索的工作量,如果上述数据检索请求中包括的用户标识有多个,则在执行步骤102之前,数据检索***可以先使用过滤器过滤多个用户标识中的至少一个用户标识得到剩余的用户标识,滤除的用户标识为未对应任何链接标识的用户标识。例如在执行步骤102时,数据检索***可以判断剩余的用户标识是否包括在数据检索请求中的链接标识对应的用户标识中。
具体地,本示例的过滤器的结构可以如上述示例一中的表5中所述,在此不进行赘述。
另外,数据检索***中储存的标识索引的结构可以如下表8所示,包括链接标识、关联信息在文件中的偏移数据和过滤器在文件中的偏移数据,这里过滤器在文件中的偏移数据用于表示与该链接标识对应的过滤器的数据的位置信息,链接标识可以用docid来表示,而关联信息在文件中的偏移数据表示用于表示与该链接标识关联的用户标识的信息的位置信息。
表8
8bytes docid 8bytes offset 8bytes BF offset
在这种情况下,直接可以通过链接标识就可以定位到对应过滤器的位置。
另外,需要说明的是,本示例中数据检索***在储存第二对应关系列表和与链接标识关联的用户标识的信息时,都是储存于该数据检索***的永久性存储器(比如磁盘)中,而过滤器的数据可以储存在暂时性存储器(比如内存)中。
可见,在本示例二对应的方法中,数据检索***中以第二对应关系列表储存用户标识与消息标识的对应关系,这样数据检索***确定数据检索请求中的用户标识与链接标识相关联,则直接在第二对应关系列表中查找到该用户标识对应的消息标识,从而得到用户分享文章的情况。本实施例中的方法可以不用执行求交操作,而是可以直接查找得到最终结果,相对于现有技术中的求交操 作,数据检索的计算量较少。
需要说明的是,示例二与上述示例一相比,数据检索***所采用的检索方法类似,都是从对应关系列表中可以直接查找到最终结果,不同的是,上述示例一中,数据检索***储存的数据结构是用户标识到链接标识与消息标识的对应关系的倒排数据结构,具体可以如图2所示,这样通过用户标识可以找到对应的消息标识。而在示例二中,数据检索***储存的数据结构是链接标识到用户标识与消息标识的对应关系的倒排数据结构,具体可以如图3所示,这样通过链接标识可以找到对应的消息标识。其中由于一个用户所分享的文章数量,相比一篇文章被分享的用户数量较小,则用户标识对应的链接标识与消息标识的数据,会比链接标识对应的用户标识与消息标识的数据少很多,一般情况下,几乎所有用户标识对应的链接标识与消息标识的数据只要一个4kb的数据块就可以存放,因此,采用示例一中的方法,数据检索***中储存的数据量要小。
作为示例,下面将参考图4以一个具体的实施例说明根据示例一的数据检索***。如图4所示,本实施例的社交网络***包括社交网络客户端和社交网络服务器,在社交网络服务器中装载数据检索***。
(1)用户通过社交网络客户端发起本实施例的数据检索流程。
具体地,用户可以在社交网络客户端显示的文章搜索界面中输入关键字并点击提交,则社交网络客户端会向文章服务器发送文章搜索请求,这样文章服务器会根据关键字搜索相关的文章信息,并将文章的链接信息发送给社交网络客户端进行显示。
这样用户可以选择一个或多个链接,且可以选择该用户的用户标识,或与该用户相关的其它用户标识,并提交,这样社交网络客户端会向社交网络服务器发送所述的数据检索请求,并由社交网络服务器中的数据检索***执行根据示例一的方法。
其中文章服务器是社交网络***所支撑的一个服务器。
(2)数据检索***接收到数据检索请求,例如在数据检索请求中包括的用户标识为uin1,链接标识为docid1。
参考图5所示,数据检索***会查找到该uin1所属的哈希(hash)桶即哈希索引,并根据哈希索引中的哈希列表偏移确定哈希列表所在的位置;然后查找到哈希列表,并根据哈希列表中uin1的关联信息的偏移确定与uin1关联的链接标 识的信息的位置;再查找到与该uin1关联的信息,具体可以包括最大docid,最小docid,链接标识所在磁盘标识符(disk id),所在文件的标识(file id)及对应关系偏移等。
如果上述docid1在最大docid与最小docid的范围内,且在与uin1关联的链接标识的信息中匹配到该docid1,则数据检索***确定docid1在uin1对应的链接标识的范围内。也即,如果链接标识docid1包括在与所述用户标识uni1对应的链接标识中,则确定所述链接标识和所述用户标识相对应。然后根据其中的对应关系偏移确定链接标识和消息标识的第一对应关系的值的位置,然后根据该位置查找到第一对应关系的值即tid+doc的值;从tid+doc的值中取出docid1对应的tid值。当然,本领域技术人员应该理解,如果用户标识uni1包括在与所述链接标识docid1对应的用户标识中,也可确定所述链接标识和所述用户标识相对应。
(3)数据检索***可以将最终得到的tid值发送给文章服务器,由文章服务器根据该tid值得到对应文章的用户分享信息,并将用户分享信息发送给社交网络客户端进行显示。
本申请还提供一种数据检索***,该***用于执行上述实施例中的方法,其结构示意图如图6所示,具体可以包括:
请求接收单元10,用于接收数据检索请求,所述数据检索请求中包括用户标识和链接标识;在数据检索请求中的用户标识和链接标识可以分别为一个,或分别为多个。
查找单元11,用于如果确定链接标识与用户标识相对应,在预置的对应关系列表中,查找相应的消息标识。
根据本发明的另一方面,所述查找单元11可以基于如上述方法实施例中的示例一所述的表示链接标识与消息标识的对应关系的对应关系列表或者如上述方法实施例中的示例二所述的表示用户标识与消息标识的对应关系的对应关系列表来进行查找操作。例如,查找单元11如果确定请求接收单元10接收的数据检索请求中包括的链接标识在数据检索请求中包括的用户标识对应的链接标识中,则在预置的表示链接标识与消息标识的对应关系的第一对应关系列表中,查找请求中包括的链接标识对应的消息标识。再如,查找单元11如果确定请求接收单元10接收的数据检索请求中包括的用户标识在数据检索请求中的链接标识对应的用户标识的范围内,则在预置的表示用户标识与消息标识的对应关系 的第二对应关系列表中,查找请求中的用户标识对应的消息标识。
结果显示单元12,用于根据所述查找单元11查找的消息标识显示对应文章的用户分享信息。
可见,在本实施例的数据检索***中以对应关系列表储存链接标识或用户标识与消息标识的对应关系,这样当查找单元11确定数据检索请求中的链接标识与用户标识相对应,则直接在对应关系列表中查找到与链接标识或用户标识对应的消息标识,从而得到用户分享文章的情况。本申请中的***可以不用执行求交操作,而是可以直接查找得到最终结果,相对于现有技术中的求交操作,数据检索的计算量较少。
参考图7所示,在一个具体的实施例中,数据检索***除了可以包括如图6所示的结构外,还可以包括过滤单元13和对应关系储存单元14,其中:
当所述请求接收单元10接收的数据检索请求中包括的用户标识有多个时,过滤单元13,用于使用过滤器在所述多个用户标识中过滤掉至少一个用户标识得到剩余的用户标识,其中滤除的用户标识为未对应任何链接标识的用户标识。这样,所述查找单元11如果确定数据检索请求中包括的链接标识在所述过滤单元13过滤得到的剩余用户标识对应的链接标识的范围内,则在预置的第一对应关系列表中,查找所述链接标识对应的消息标识,减少了数据检索的工作量。或者,查找单元11如果确定所述过滤单元13过滤得到的剩余用户标识在所述链接标识对应的用户标识的范围内,在预置的第二对应关系列表中,查找所述用户标识对应的消息标识,减少了数据检索的工作量。
对应关系储存单元14,用于接收到用户对某一链接的文章的分享请求,所述分享请求中包括所述某一链接的链接标识和所述用户的用户标识;将所述某一链接的文章的信息储存成所述用户标识对应的一条消息,并生成所述消息的消息标识,将所述链接标识与所述用户标识关联,然后将所述链接标识和消息标识储存到所述第一对应关系列表中,或者将用户标识与消息标识存储到第二对应关系列表中。其中,所述第一对应关系列表中包括多个数据块,每个数据块包括头部和值,所述头部包括数据长度和下一数据块在文件中的偏移数据,每个数据块的值包括链接标识及对应的消息标识。而所述第二对应关系列表中也包括多个数据块,每个数据块包括头部和值,所述头部包括数据长度和下一数据块在文件中的偏移数据,每个数据块的值包括用户标识及对应的消息标识。这样查找单元11会在对应关系储存单元14储存的第一对应关系列表或者第二关 系列表中查找到对应的消息标识。
本申请还提供一种数据检索***,其结构示意图如图8所示,该数据检索***可因配置或性能不同而产生比较大的差异,可以包括一个或一个以上处理器(例如一个或一个以***处理器(central processing units,CPU))30和存储器31,一个或一个以上存储应用程序321或数据322的存储介质32(例如一个或一个以上海量存储设备)。其中,存储器31和存储介质32可以是短暂存储或持久存储。存储在存储介质32的程序可以包括一个或一个以上模块(图示没标出),每个模块可以包括实施本申请所述的方法的一系列指令操作。更进一步地,处理器30可以设置为与存储介质32通信,在数据检索***上执行存储介质32中的一系列指令操作。
数据检索***还可以包括一个或一个以上电源33,一个或一个以上有线或无线网络接口34,一个或一个以上输入输出接口35,和/或,一个或一个以上操作***323,例如Windows ServerTM,Mac OS XTM,UnixTM,LinuxTM,FreeBSDTM等等。
上述方法实施例中所述的由数据检索***所执行的步骤可以基于该图8所示的数据检索***的结构。
本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序可以存储于一计算机可读存储介质中,存储介质可以包括:只读存储器(ROM)、随机存取存储器RAM)、磁盘或光盘等。
以上对本申请实施例所提供的数据检索方法及***进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的一般技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本申请的限制。

Claims (10)

  1. 一种数据检索方法,其特征在于,包括:
    接收数据检索请求,所述数据检索请求中包括用户标识和链接标识;
    如果链接标识与用户标识相对应,则在预置的对应关系列表中,查找相应的消息标识;
    根据所述消息标识显示对应文章的用户分享信息,
    其中,如果所述链接标识包括在与所述用户标识对应的链接标识中,或者所述用户标识包括在与所述链接标识对应的用户标识中,则确定所述链接标识和所述用户标识相对应。
  2. 如权利要求1所述的方法,其特征在于,所述数据检索请求中包括的用户标识有多个,在如果链接标识与用户标识相对应,则在预置的对应关系列表中,查找相应的消息标识之前,所述方法还包括:
    使用过滤器在所述多个用户标识中滤除至少一个用户标识得到剩余的用户标识,滤除的用户标识为未对应任何链接标识的用户标识;
    其中,所述如果链接标识与用户标识相对应,则在预置的对应关系列表中,查找相应的消息标识的步骤包括:如果链接标识与剩余的用户标识相对应,则在预置的对应关系列表中,查找相应的消息标识。
  3. 如权利要求2所述的方法,其特征在于,
    所述过滤器的数据包括:过滤器索引和过滤器值,其中,所述过滤器索引中包括链接标识和所述过滤器在文件中的偏移数据,所述过滤器值中包括多个数据块,每个数据块中包括数据长度和过滤策略数据。
  4. 如权利要求1至3任一项所述的方法,其特征在于,所述方法还包括:
    接收用户对某一链接的文章的分享请求,所述分享请求中包括所述链接的链接标识和所述用户的用户标识;
    将所述链接的文章的信息储存成所述用户标识对应的一条消息,并生成所述消息的消息标识,将所述链接标识和与所述用户标识关联,并将所述链接标识和消息标识储存到对应关系列表中以便于在对应关系列表中查找相应的消息标识;
    其中,所述对应关系列表中包括多个数据块,每个数据块包括头部和值,所述头部包括数据长度和下一数据块在文件中的偏移数据,每个数据块的值包括链接标识及对应的消息标识。
  5. 如权利要求1至3任一项所述的方法,其特征在于,所述方法还包括:
    接收用户对某一链接的文章的分享请求,所述分享请求中包括所述链接的链接标识和所述用户的用户标识;
    将所述链接的文章的信息储存成所述用户标识对应的一条消息,生成所述消息的消息标识,将所述链接标识和与所述用户标识关联,并将所述链接标识和消息标识储存到所述对应关系列表中,以便于在所述对应关系列表中查找相应的消息标识;
    其中,所述对应关系列表包括多个数据块,每个数据块包括头部和值,所述头部包括数据长度和下一数据块在文件中的偏移数据,每个数据块的值包括用户标识及对应的消息标识。
  6. 一种数据检索***,其特征在于,包括:
    请求接收单元,接收数据检索请求,所述数据检索请求中包括用户标识和链接标识;
    查找单元,在链接标识与用户标识相对应时,在预置的对应关系列表中,查找相应的消息标识;
    结果显示单元,根据所述消息标识显示对应文章的用户分享信息,
    其中,如果所述链接标识包括在与所述用户标识对应的链接标识中,或者所述用户标识包括在与所述链接标识对应的用户标识中,则确定所述链接标识和所述用户标识相对应。
  7. 如权利要求6所述的***,其特征在于,所述请求接收单元接收到的数据检索请求中包括的用户标识有多个,
    所述***还包括:
    过滤单元,使用过滤器在所述多个用户标识中滤除至少一个用户标识得到剩余的用户标识,滤除的用户标识为未对应任何链接标识的用户标识;
    其中,所述查找单元在所述链接标识与剩余的用户标识相对应时,在预置的对应关系列表中,查找相应的消息标识。
  8. 如权利要求7所述的***,其特征在于,
    所述过滤器的数据包括:过滤器索引和过滤器值,其中,所述过滤器索引中包括链接标识和所述过滤器在文件中的偏移数据,所述过滤器值中包括多个数据块,每个数据块中包括数据长度和过滤策略数据。
  9. 如权利要求6至8任一项所述的***,其特征在于,所述***还包括:
    对应关系存储单元,用于:
    接收用户对某一链接的文章的分享请求,所述分享请求中包括所述链接的链接标识和所述用户的用户标识;
    将所述链接的文章的信息储存成所述用户标识对应的一条消息,生成所述消息的消息标识,并将所述链接标识和与所述用户标识关联;
    将所述链接标识和消息标识储存到对应关系列表中,以便于在对应关系列表中查找相应的消息标识;
    其中,所述对应关系列表中包括多个数据块,每个数据块包括头部和值,所述头部包括数据长度和下一数据块在文件中的偏移数据,每个数据块的值包括链接标识及对应的消息标识。
  10. 如权利要求6至8任一项所述的方法,其特征在于,所述***还包括:
    对应关系存储单元,用于:
    接收用户对某一链接的文章的分享请求,所述分享请求中包括所述链接的链接标识和所述用户的用户标识;
    将所述链接的文章的信息储存成所述用户标识对应的一条消息,生成所述消息的消息标识,并将所述链接标识和与所述用户标识关联;将所述链接标识和消息标识储存到所述对应关系列表中,以便于在所述对应关系列表中查找相应的消息标识;
    其中,所述对应关系列表包括多个数据块,每个数据块包括头部和值,所述头部包括数据长度和下一数据块在文件中的偏移数据,每个数据块的值包括用户标识及对应的消息标识。
PCT/CN2017/081357 2016-04-22 2017-04-21 数据检索方法及*** WO2017181986A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US15/949,796 US10984059B2 (en) 2016-04-22 2018-04-10 Data retrieval method and system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610257637.8 2016-04-22
CN201610257637.8A CN107305570B (zh) 2016-04-22 2016-04-22 一种数据检索方法及***

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US15/949,796 Continuation US10984059B2 (en) 2016-04-22 2018-04-10 Data retrieval method and system

Publications (1)

Publication Number Publication Date
WO2017181986A1 true WO2017181986A1 (zh) 2017-10-26

Family

ID=60116581

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/081357 WO2017181986A1 (zh) 2016-04-22 2017-04-21 数据检索方法及***

Country Status (3)

Country Link
US (1) US10984059B2 (zh)
CN (1) CN107305570B (zh)
WO (1) WO2017181986A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114338271A (zh) * 2021-12-29 2022-04-12 深圳市元征科技股份有限公司 数据处理方法、装置、终端及计算机可读存储介质

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170228779A1 (en) * 2011-10-12 2017-08-10 MarketChorus, Inc. Audience identification and interest determination from targeted topical activity system and method
CN110225143B (zh) * 2018-09-05 2020-10-09 宁波海上鲜信息技术有限公司 一种智慧安防数据***
CN112241672B (zh) * 2019-07-19 2024-05-03 杭州海康威视数字技术股份有限公司 一种身份数据的关联方法、装置、电子设备及存储介质
US11062403B2 (en) * 2019-09-23 2021-07-13 Arthur Ray Kerr System and method for customizable link between two entities

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101977223A (zh) * 2010-10-28 2011-02-16 四川长虹电器股份有限公司 基于网络电视的信息交互方法
CN102521253A (zh) * 2011-11-17 2012-06-27 西安交通大学 一种可视化的网络用户多媒体管理方法
CN103034706A (zh) * 2012-12-07 2013-04-10 合一网络技术(北京)有限公司 一种基于信息网络的视频推荐列表的产生装置及方法
CN104639593A (zh) * 2013-11-15 2015-05-20 腾讯科技(深圳)有限公司 信息分享方法、***、浏览器及服务器

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11351896A (ja) * 1998-06-12 1999-12-24 Matsushita Electric Ind Co Ltd 経路誘導装置とその方法、及び地図データベース記録媒体
US20080126476A1 (en) * 2004-08-04 2008-05-29 Nicholas Frank C Method and System for the Creating, Managing, and Delivery of Enhanced Feed Formatted Content
US20080177724A1 (en) * 2006-12-29 2008-07-24 Nokia Corporation Method and System for Indicating Links in a Document
US20130215116A1 (en) * 2008-03-21 2013-08-22 Dressbot, Inc. System and Method for Collaborative Shopping, Business and Entertainment
KR20130009754A (ko) * 2010-02-01 2013-01-23 점프탭, 인크. 통합형 광고 시스템
US8688774B2 (en) * 2010-08-09 2014-04-01 Eustace Prince Isidore Method, system, and devices for facilitating real-time social and business interactions/networking
US10120877B2 (en) * 2011-09-15 2018-11-06 Stephan HEATH Broad and alternative category clustering of the same, similar or different categories in social/geo/promo link promotional data sets for end user display of interactive ad links, coupons, mobile coupons, promotions and sale of products, goods and services integrated with 3D spatial geomapping and mobile mapping and social networking
CN102799693A (zh) * 2012-08-23 2012-11-28 上海广电电子科技有限公司 视频内容推送方法及***
US20150180980A1 (en) * 2013-12-24 2015-06-25 Dropbox, Inc. Systems and methods for preserving shared virtual spaces on a content management system
US10318753B2 (en) * 2014-06-30 2019-06-11 Vescel, Llc Semantic data structure and method
US10481763B2 (en) * 2014-09-17 2019-11-19 Lett.rs LLC. Mobile stamp creation and management for digital communications
CN105100223B (zh) * 2015-07-01 2019-10-15 北京奇虎科技有限公司 基于云端存储的文件分享方法、装置与***

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101977223A (zh) * 2010-10-28 2011-02-16 四川长虹电器股份有限公司 基于网络电视的信息交互方法
CN102521253A (zh) * 2011-11-17 2012-06-27 西安交通大学 一种可视化的网络用户多媒体管理方法
CN103034706A (zh) * 2012-12-07 2013-04-10 合一网络技术(北京)有限公司 一种基于信息网络的视频推荐列表的产生装置及方法
CN104639593A (zh) * 2013-11-15 2015-05-20 腾讯科技(深圳)有限公司 信息分享方法、***、浏览器及服务器

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114338271A (zh) * 2021-12-29 2022-04-12 深圳市元征科技股份有限公司 数据处理方法、装置、终端及计算机可读存储介质

Also Published As

Publication number Publication date
US10984059B2 (en) 2021-04-20
US20180232452A1 (en) 2018-08-16
CN107305570B (zh) 2020-05-12
CN107305570A (zh) 2017-10-31

Similar Documents

Publication Publication Date Title
WO2017181986A1 (zh) 数据检索方法及***
CN110168523B (zh) 改变监测跨图查询
US20210184996A1 (en) Searchable peer-to-peer system through instant messaging based topic indexes
JP2016189214A5 (zh)
US10050866B2 (en) Parallel top-K simple shortest paths discovery
US9867011B2 (en) Identifying proximity history of computer devices
US20190188220A1 (en) Systems and methods for clustering items associated with interactions
WO2019100921A1 (zh) 消息推送方法及装置
US10320937B2 (en) Community notification based on profile update
JP2017211999A (ja) グラフ・データのコンポジットタームインデックス
AU2013397053B2 (en) Content sharing method and social synchronization apparatus
CN111083179B (zh) 物联网云平台、基于物联网云平台的设备交互方法及装置
WO2017143930A1 (zh) 一种搜索结果排序方法及其设备
JP6263547B2 (ja) ユーザ記述子を使用するソーシャル・ネットワーキング・システム・ユーザへのサードパーティ通信
WO2014151507A1 (en) System and method for omni-channel identity matching
WO2014015808A1 (zh) 推送信息的方法和装置
CN111723161A (zh) 一种数据处理方法、装置及设备
US10191948B2 (en) Joins and aggregations on massive graphs using large-scale graph processing
WO2017118318A1 (zh) 一种数据存储与业务处理的方法及装置
WO2022247868A1 (zh) 一种多子图匹配方法、装置及设备
CN106844420B (zh) 基于社交网络和大数据分析的用户分组方法及装置
CN107086917A (zh) 一种并行化和结构化公钥可搜索的加密方法
CN111352938B (zh) 数据处理方法、计算机设备及存储介质
CN113377817A (zh) 数据处理方法、***、设备及存储介质
CN107818117B (zh) 一种数据表的建立方法、在线查询方法及相关装置

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17785470

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 17785470

Country of ref document: EP

Kind code of ref document: A1