CN104636368B - Data retrieval method, device and server - Google Patents

Data retrieval method, device and server Download PDF

Info

Publication number
CN104636368B
CN104636368B CN201310557261.9A CN201310557261A CN104636368B CN 104636368 B CN104636368 B CN 104636368B CN 201310557261 A CN201310557261 A CN 201310557261A CN 104636368 B CN104636368 B CN 104636368B
Authority
CN
China
Prior art keywords
mark
user
retrieval
data object
keyword
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310557261.9A
Other languages
Chinese (zh)
Other versions
CN104636368A (en
Inventor
朱健琛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201310557261.9A priority Critical patent/CN104636368B/en
Publication of CN104636368A publication Critical patent/CN104636368A/en
Application granted granted Critical
Publication of CN104636368B publication Critical patent/CN104636368B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of data retrieval method, device and servers, belong to field of data retrieval.The described method includes: determining the mark of relational users according to the mark of retrieval user;The mark of the corresponding data object of the relational users is inquired by the mark of the relational users;According to the identification retrieval target object of the corresponding data object of the relational users inquired;The target object is exported as search result.The present invention is by determining and retrieving user the mark with the relational users of specific type relationship, inquire the mark of the corresponding data object of the relational users, according to the identification retrieval target object of the corresponding data object of the relational users inquired, it solves in the prior art when the data object quantity in Web Community including the keyword is more, the problem of server needs to consume a large amount of process resource and bandwidth resources when exporting search result achievees the purpose that improve system performance.

Description

Data retrieval method, device and server
Technical field
The present invention relates to field of data retrieval, in particular to a kind of data retrieval method, device and server.
Background technique
With the continuous development of Internet application, it is normal to be increasingly becoming many day for human beings for Web Community, such as blog or forum Important component in life.During using Web Community, it includes specific content that user, which usually needs to retrieve, Data object, such as retrieval include the log of keyword.
In existing data retrieval method, server obtains each data object in Web Community in advance, to each data Object establishes inverted index, includes between each keyword and the data object for including each keyword in the inverted index Corresponding relationship;Server receive user transmission retrieval request when, according to the keyword carried in retrieval request from fall arrange Directly retrieval includes all data objects of the keyword and exports search result in index.
In the implementation of the present invention, the inventor finds that the existing technology has at least the following problems:
In existing data retrieval method, server directly retrieve include the keyword all data objects, work as net When data object quantity in network community comprising the keyword is more, server needs to consume a large amount of place when exporting search result Resource and bandwidth resources are managed, system performance is influenced.
Summary of the invention
In order to solve in the prior art when the data object quantity in Web Community including the keyword is more, server The problem of needing to consume a large amount of process resource and bandwidth resources when exporting search result, the embodiment of the invention provides a kind of numbers According to search method, device and server.The technical solution is as follows:
On the one hand, a kind of data retrieval method is provided, which comprises
The retrieval request that retrieval user sends is received, includes keyword and the retrieval user in the retrieval request Mark;
The mark of relational users is determined according to the mark of the retrieval user, the relational users are and the retrieval user User with specific type relationship;
The corresponding data object of the relational users is inquired by the mark of the relational users and the first corresponding relationship Mark, include the mark of the mark data object corresponding with each user of each user in first corresponding relationship Corresponding relationship between knowledge, wherein in first corresponding relationship, the mark compression of the corresponding data object of each user For the numerical value of regular length;
Retrieval includes the mark of the data object of the keyword;
By the mark of the corresponding data object of the relational users and include the keyword data object mark Intersection be retrieved as the mark of target object, the target object be in the corresponding data object of the relational users with the pass The data object that keyword matches;
The target object is exported as search result.
On the other hand, a kind of data searcher is provided, described device includes:
Request receiving module includes key in the retrieval request for receiving the retrieval request of retrieval user's transmission The mark of word and the retrieval user;
Determining module is identified, for determining that the mark of relational users, the relationship are used according to the mark of the retrieval user Family is the user for having specific type relationship with the retrieval user;
Enquiry module, for the mark and the first corresponding relationship inquiry relational users pair by the relational users The mark for the data object answered, include in first corresponding relationship each user mark it is corresponding with each user Corresponding relationship between the mark of data object, wherein in first corresponding relationship, the corresponding data pair of each user The numerical value of the mark boil down to regular length of elephant;
Retrieval module, for retrieve include the keyword data object mark, the relational users are corresponding Data object mark and include that the intersection of mark of data object of the keyword is retrieved as the mark of target object, The target object is the data object to match in the corresponding data object of the relational users with the keyword;
Output module, the target object for retrieving the retrieval module export as search result.
In another aspect, providing a kind of server, the server includes:
Data searcher as described in terms of above-mentioned.
Also on the one hand, a kind of computer readable storage medium is provided, is stored thereon with program, described program is by processor It executes to realize above-mentioned data retrieval method.
Technical solution provided in an embodiment of the present invention has the benefit that
By determining the mark for the relational users that there is specific type relationship with retrieval user, it is corresponding to inquire the relational users The mark of data object solved according to the identification retrieval target object of the corresponding data object of the relational users inquired In the prior art when the data object quantity in Web Community including the keyword is more, when server exports search result The problem of needing to consume a large amount of process resource and bandwidth resources achievees the purpose that improve system performance.
Detailed description of the invention
To describe the technical solutions in the embodiments of the present invention more clearly, make required in being described below to embodiment Attached drawing is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the invention, for For those of ordinary skill in the art, without creative efforts, it can also be obtained according to these attached drawings other Attached drawing.
Fig. 1 is the method flow diagram of data retrieval method provided by one embodiment of the present invention;
Fig. 2 be another embodiment of the present invention provides data retrieval method method flow diagram;
Fig. 3 be another embodiment of the present invention provides the first corresponding relationship storage schematic diagram;
Fig. 4 is the structure drawing of device of data searcher provided by one embodiment of the present invention;
Fig. 5 be another embodiment of the present invention provides data searcher structure drawing of device;
Fig. 6 is the equipment composition figure of server provided by one embodiment of the present invention.
Specific embodiment
To make the object, technical solutions and advantages of the present invention clearer, below in conjunction with attached drawing to embodiment party of the present invention Formula is described in further detail.
Referring to Figure 1, it illustrates the method flow diagram of data retrieval method provided by one embodiment of the present invention, the numbers Can be used for retrieving in community network according to search method includes designated key word, and is closed with retrieval user with specific type The corresponding data object of the user of system.The data retrieval method may include:
Step 101, the retrieval request that retrieval user sends is received, includes that keyword and the retrieval are used in the retrieval request The mark at family;
Step 102, the mark of relational users is determined according to the mark of retrieval user, which is to use with the retrieval Family has the user of specific type relationship;
Step 103, the mark of the corresponding data object of the relational users is inquired by the mark of the relational users;
Step 104, according to the identification retrieval target object of the corresponding data object of the relational users inquired, the target Object is the data object to match in the corresponding data object of the relational users with the keyword;
Step 105, which is exported as search result.
In conclusion data retrieval method provided in an embodiment of the present invention, has certain kinds with retrieval user by determining The mark of the relational users of type relationship inquires the mark of the corresponding data object of the relational users, according to the relationship inquired The identification retrieval target object of the corresponding data object of user, solve in the prior art when in Web Community include the keyword Data object quantity it is more when, when server output search result, needs to consume asking for a large amount of process resource and bandwidth resources Topic achievees the purpose that improve system performance.
Refer to Fig. 2, it illustrates another embodiment of the present invention provides data retrieval method method flow diagram, the number Can be used for retrieving in community network according to search method includes designated key word, and is closed with retrieval user with specific type The corresponding data object of the user of system.The data retrieval method may include:
Step 201, the retrieval request that retrieval user sends is received, includes keyword and retrieval user in the retrieval request Mark;
When browsing community network and initiation in the personal terminal such as retrieval client personal computer, smart phone or tablet computer When retrieval, terminal to server sends retrieval request, includes the pass retrieving user and inputting in the search box in the retrieval request Keyword, in addition, also including the mark of retrieval user in the retrieval request.
Step 202, the mark of relational users is determined according to the mark of retrieval user;
Wherein, which is the user for having specific type relationship with retrieval user, and server can be deposited in advance Store up the relation chain for characterizing the relationship type between each user.The specific type relationship can be friend relation, Tong Xueguan System, Peer Relationships, old boy network etc..
It can wrap the mark containing retrieval user and the corresponding relationship between the mark of the relational users in the relation chain; Server can be according to corresponding between the mark of retrieval user and the mark and the mark of the relational users of retrieval user Relationship determines the mark of the relational users.
The relation chain can be made of user identifier list corresponding with the mark of each user.With the specific type relationship For friend relation, relationship in the preparatory traverses network community of server between each user, and respectively each user Corresponding buddy list is established, includes the mark of the user with friend relation in buddy list;Server is by each user Mark buddy list corresponding with each user corresponding be stored as relation chain.For example, the good friend of user A includes user B, user C, user D and user E, then be stored in relation chain user A mark with include user B, user C, user D and user E Corresponding relationship between identification list.When user A initiates to retrieve, server can be according to the user A for including in retrieval request Mark the mark of user B, user C, user D and user E is inquired from relation chain.
Step 203, pass through the mark of the corresponding data object of mark inquiry relational users of relational users;
Wherein, before the corresponding data object of inquiry relational users, the mark of the available each data object of server The mark of knowledge and the corresponding user of each data object, according to the mark pair of the corresponding user of each data object The mark of each data object is clustered, to establish the first corresponding relationship and store;It include each in first corresponding relationship Corresponding relationship between the mark of the mark data object corresponding with each user of a user.Server can be used by relationship The mark of the mark at family and the corresponding data object of first corresponding relationship inquiry relational users.
In order to improve storage efficiency and search efficiency, server can be deposited in conjunction with the form of Hash table and/or conflict list The first corresponding relationship is stored up, specifically, can correspond to and close by first by hash function before server stores the first corresponding relationship The numerical value of the mark boil down to regular length of the corresponding data object of each user in system, when storing the first corresponding relationship, by this The mark for each user for including in first corresponding relationship is divided at least two user identifier groups, and each user identifier group is corresponding with Respective storage address, the corresponding user for storing the user identifier group and including in the corresponding storage address of user identifier group The mark of mark and data object corresponding with the mark of user that the user identifier group includes.It is used when server by relationship The mark of the mark at family and the corresponding data object of first corresponding relationship inquiry relational users, can determine relational users Corresponding user identifier group is identified, and inquires the corresponding number of relational users in the determining corresponding storage address of user identifier group According to the mark of object.
For example, the storage method of the first corresponding relationship can be as shown in figure 3, where it is assumed that the registration in Web Community be used Family sum is 10000, and each user's is identified as by 1~10000 User ID (IDentity, identity number incremented by successively Code), data object is the document that each user delivers in Web Community, the ID for being identified as document of data object, Fig. 3 first Each element in column is stored with the corresponding storage address of each User ID in the corresponding section ID respectively, deposits in the storage address Contain the ID of user and the ID of document that the user delivers in Web Community, and the ID of each document is pressed in a hashed form The document id of different length, is converted to the value of equal length, to improve storage efficiency by contracting storage.Assuming that server is according to inspection The ID for the user A for including in rope request is preferred true after the ID for inquiring user B, user C, user D and user E in relation chain Determine the section ID locating for the ID of user B, user C, user D and user E, then inquires user B, user respectively from the corresponding section ID C, the ID for the document that user D and user E are delivered in Web Community is greatly improved without carrying out global search to each user Recall precision.
Step 204, according to the identification retrieval target object of the corresponding data object of the relational users inquired;
Wherein, which is the data object to match in the corresponding data object of relational users with the keyword; Server can retrieve include the keyword data object mark, and by the mark of the corresponding data object of the relational users The intersection of mark of data object for knowing and include the keyword is retrieved as the mark of the target object.
Specifically, server can establish Inverted List to data object each in Web Community in advance, the Inverted List In include each keyword and include each keyword data object mark between corresponding relationship, server according to this Keyword and Inverted List retrieval include the mark of the data object of the keyword.
Further, server to the mark of the corresponding data object of the relational users inquired in step 203 and comprising The mark of the data object of the keyword carries out intersection operation, seeks the mark of the corresponding data object of relational users and comprising this The result of the intersection of the mark of the data object of keyword, intersection operation is the mark of target object.
For example, server can be by the user B inquired in step 203, user C, user D and user E in Web Community In the ID of document that delivers be retrieved as first identifier collection, and by according to inverted index inquire include keyword document ID is retrieved as second identifier collection, seeks common ground to first identifier collection and second identifier collection, and the document id in the intersection acquired is obtained It is taken as the ID of destination document.
Step 205, the authority information for obtaining the target object detects the retrieval according to the authority information of the target object and uses Whether family has the permission for checking the target object;
By taking data object is the document that each user delivers in Web Community as an example, user can set to document is delivered It checks permission, the authority information of the available destination document of server, and whether user is retrieved according to the permission detection got With the permission for checking corresponding destination document.
Step 206, if testing result is that retrieval user has the permission for checking the target object, by the target object Output is search result.
For example, by taking data object is the document delivered in Web Community of each user as an example, user can will be delivered Document setup is that good friend is allowed to check, owner is allowed to check or only allow oneself to check.Server can determine that retrieval is used Family is to allow that good friend checks and the document that allows owner to check is with checking permission to priority assignation, and by corresponding document Output is search result.
In the method shown in the embodiment of the present invention, server pre-establishes the relation chain and use for search relationship user In the first corresponding relationship of the mark of the corresponding data object of search relationship user, by taking user identifier is the ID of user as an example, clothes Business device only needs the ID of ID search relationship user from relation chain according to retrieval user, and according to the ID of relational users from first The mark of the corresponding data object of search relationship user in corresponding relationship, further according to the mark of the corresponding data object of relational users And the mark of keyword retrieval target object.In Web Community, user identifier can there are many, such as User ID, user Name or user's pet name etc. when the function that system needs extension to be retrieved based on other types of user identifier, for example need When the function that extension is retrieved based on user name, it is only necessary to a set of corresponding relation chain and the first corresponding relationship are additionally set up, The system expandability is strong.
In addition, server first passes through the mark of relation chain search relationship user in method shown in the embodiment of the present invention, then According to the ID of the mark of relational users and the document of the first corresponding relationship search relationship user, finally further according to the text of relational users Document that ID and inverted index the retrieval relational users of shelves are delivered, including keyword, wherein relation chain, first, which correspond to, closes It is mutually indepedent between system and inverted index three, the flexibility of searching system is improved, can effectively be avoided in retrieving Hot issue.
In conclusion data retrieval method provided in an embodiment of the present invention, has certain kinds with retrieval user by determining The mark of the relational users of type relationship inquires the mark of the corresponding data object of the relational users, according to the relationship inquired The identification retrieval target object of the corresponding data object of user, solve in the prior art when in Web Community include the keyword Data object quantity it is more when, when server output search result, needs to consume asking for a large amount of process resource and bandwidth resources Topic achievees the purpose that improve system performance;Secondly, data retrieval method provided in an embodiment of the present invention, by combining Hash table The first corresponding relationship is stored with the form of conflict list, can be improved storage efficiency and search efficiency;In addition, the embodiment of the present invention The data retrieval method of offer is only needed when the function that system needs extension to be retrieved based on other types of user identifier It additionally sets up a set of corresponding relation chain and the first corresponding relationship, the system expandability is strong;Finally, the embodiment of the present invention provides Data retrieval method, it is mutually indepedent between relation chain, the first corresponding relationship and inverted index three, improve searching system Flexibility, can effectively avoid the hot issue in retrieving.
Fig. 4 is referred to, it illustrates the structure drawing of device of data searcher provided by one embodiment of the present invention, the numbers The a part that can be implemented as web community server according to retrieval device, includes designated key for retrieving in community network Word, and there is the corresponding data object of the user of specific type relationship with retrieval user.The data searcher may include:
Request receiving module 301 includes related in the retrieval request for receiving the retrieval request of retrieval user's transmission The mark of keyword and the retrieval user;
Determining module 302 is identified, for determining the mark of relational users, the relationship according to the mark of the retrieval user User is the user for having specific type relationship with the retrieval user;
The mark of enquiry module 303, the relational users for being determined by the mark determining module 302 inquires the pass It is the mark of the corresponding data object of user;
Retrieval module 304, the corresponding data object of the relational users for being inquired according to the enquiry module 303 Identification retrieval target object, the target object be the corresponding data object of the relational users in the keyword phase The data object matched;
Output module 305, the target object for retrieving the retrieval module 304 export as search result.
In conclusion data searcher provided in an embodiment of the present invention, has certain kinds with retrieval user by determining The mark of the relational users of type relationship inquires the mark of the corresponding data object of the relational users, according to the relationship inquired The identification retrieval target object of the corresponding data object of user, solve in the prior art when in Web Community include the keyword Data object quantity it is more when, when server output search result, needs to consume asking for a large amount of process resource and bandwidth resources Topic achievees the purpose that improve system performance.
Refer to Fig. 5, it illustrates another embodiment of the present invention provides data searcher structure drawing of device, the number The a part that can be implemented as web community server according to retrieval device, includes designated key for retrieving in community network Word, and there is the corresponding data object of the user of specific type relationship with retrieval user.The data searcher may include:
Request receiving module 401 includes related in the retrieval request for receiving the retrieval request of retrieval user's transmission The mark of keyword and the retrieval user;
Determining module 402 is identified, for determining the mark of relational users, the relationship according to the mark of the retrieval user User is the user for having specific type relationship with the retrieval user;
The mark of enquiry module 403, the relational users for being determined by the mark determining module 402 inquires the pass It is the mark of the corresponding data object of user;
Retrieval module 404, the corresponding data object of the relational users for being inquired according to the enquiry module 403 Identification retrieval target object, the target object be the corresponding data object of the relational users in the keyword phase The data object matched;
Output module 405, the target object for retrieving the retrieval module 404 export as search result.
Described device further include:
First obtains module 406, in the pass that the enquiry module 403 is determined by the mark determining module 402 Be that the mark of user is inquired before the mark of the corresponding data object of the relational users, obtain the mark of each data object with And the mark of the corresponding user of each data object;
Cluster module 407, each data object for getting according to the first acquisition module 406 are corresponding The mark of user clusters the mark of each data object, to establish the first corresponding relationship, the described first corresponding pass It include the corresponding relationship between the mark of the mark data object corresponding with each user of each user in system;
Memory module 408, for storing first corresponding relationship;
The enquiry module 403, for by the relational users mark and first corresponding relationship inquire institute State the mark of the corresponding data object of relational users.
Described device further include:
Compression module 409, for passing through Hash letter before the memory module 408 stores first corresponding relationship Number will be in first corresponding relationship, the numerical value for identifying boil down to regular length of the corresponding data object of each user.
The memory module 408, comprising:
Grouped element 408a, for the mark for each user for including in first corresponding relationship to be divided at least two User identifier group, the user identifier group are corresponding with respective storage address;
Storage unit 408b stores the user identifier for corresponding in the corresponding storage address of the user identifier group The mark of the mark for the user that group includes and data object corresponding with the mark of user that the user identifier group includes;
The enquiry module 403, comprising:
Determination unit 403a, the corresponding user identifier group of mark for determining the relational users;
Query unit 403b, for being looked into the corresponding storage address of user identifier group that the determination unit 403a is determined Ask the mark of the corresponding data object of the relational users.
The retrieval module 404, comprising:
Retrieval unit 404a, for retrieve include the keyword data object mark;
Acquiring unit 404b, for by the mark of the corresponding data object of the relational users and include the keyword The intersection of mark of data object be retrieved as the mark of the target object.
Described device further include:
Second obtains module 410, for including the data object of the keyword in retrieval unit 404a retrieval Mark before, obtain the mark and each data object each keyword for including of each data object;
Module 411 is established in list, include each keyword for establishing Inverted List, in the Inverted List with It include the corresponding relationship between the mark of the data object of each keyword;
The retrieval unit 404a, for including the keyword according to the keyword and Inverted List retrieval Data object mark.
Described device further include:
Third obtains module 412, and the target object for retrieving the retrieval module in the output module 405 is defeated Before being out search result, the authority information of the target object is obtained;
Detection module 413, for being looked into according to whether the authority information of the target object detection retrieval user has See the permission of the target object;
The output module 405 is checked if the testing result for the detection module 413 has for the retrieval user The permission of the target object then executes described the step of exporting the target object for search result.
In conclusion data searcher provided in an embodiment of the present invention, has certain kinds with retrieval user by determining The mark of the relational users of type relationship inquires the mark of the corresponding data object of the relational users, according to the relationship inquired The identification retrieval target object of the corresponding data object of user, solve in the prior art when in Web Community include the keyword Data object quantity it is more when, when server output search result, needs to consume asking for a large amount of process resource and bandwidth resources Topic achievees the purpose that improve system performance;Secondly, data searcher provided in an embodiment of the present invention, by combining Hash table The first corresponding relationship is stored with the form of conflict list, can be improved storage efficiency and search efficiency;In addition, the embodiment of the present invention The data searcher of offer is only needed when the function that system needs extension to be retrieved based on other types of user identifier It additionally sets up a set of corresponding relation chain and the first corresponding relationship, the system expandability is strong.
Fig. 6 is referred to, constitutes and schemes it illustrates the equipment of server provided by one embodiment of the present invention, which can To include designated key word for retrieving in community network, and have the user of specific type relationship corresponding with retrieval user Data object.The server may include:
Such as above-mentioned Fig. 4 or data searcher shown in fig. 5 500.
In conclusion server provided in an embodiment of the present invention, has specific type relationship with retrieval user by determining Relational users mark, the mark of the corresponding data object of the relational users is inquired, according to the relational users pair inquired The identification retrieval target object for the data object answered solves in the prior art when the data in Web Community including the keyword When number of objects is more, the problem of needing to consume a large amount of process resource and bandwidth resources when server output search result, reach To the purpose for improving system performance;Secondly, server provided in an embodiment of the present invention, by combining Hash table and conflict list Form stores the first corresponding relationship, can be improved storage efficiency and search efficiency;In addition, service provided in an embodiment of the present invention Device, when the function that system needs extension to be retrieved based on other types of user identifier, it is only necessary to which it is a set of right to additionally set up The relation chain and the first corresponding relationship, the system expandability answered are strong.
It should be understood that data searcher provided by the above embodiment is when retrieving data object, only with above-mentioned each The division progress of functional module can according to need and for example, in practical application by above-mentioned function distribution by different function Energy module is completed, i.e., the internal structure of device is divided into different functional modules, to complete whole described above or portion Divide function.In addition, data searcher provided by the above embodiment and data retrieval method embodiment belong to same design, have Body realizes that process is detailed in embodiment of the method, and which is not described herein again.
The serial number of the above embodiments of the invention is only for description, does not represent the advantages or disadvantages of the embodiments.
Those of ordinary skill in the art will appreciate that realizing that all or part of the steps of above-described embodiment can pass through hardware It completes, relevant hardware can also be instructed to complete by program, the program can store in a kind of computer-readable In storage medium, storage medium mentioned above can be read-only memory, disk or CD etc..
The foregoing is merely presently preferred embodiments of the present invention, is not intended to limit the invention, it is all in spirit of the invention and Within principle, any modification, equivalent replacement, improvement and so on be should all be included in the protection scope of the present invention.

Claims (14)

1. a kind of data retrieval method, which is characterized in that the described method includes:
The retrieval request that retrieval user sends is received, includes the mark of keyword and the retrieval user in the retrieval request Know;
The mark of relational users is determined according to the mark of the retrieval user, the relational users are to have with the retrieval user The user of specific type relationship;
The mark of the corresponding data object of the relational users is inquired by the mark and the first corresponding relationship of the relational users Know, include in first corresponding relationship mark data object corresponding with each user of each user mark it Between corresponding relationship, wherein in first corresponding relationship, the mark boil down to of the corresponding data object of each user is solid The numerical value of measured length;
Retrieval includes the mark of the data object of the keyword;
By the mark of the corresponding data object of the relational users and include the keyword data object mark friendship Collection is retrieved as the mark of target object, the target object be in the corresponding data object of the relational users with the keyword The data object to match;
The target object is exported as search result.
2. the method according to claim 1, wherein described pass through the mark of the relational users and first pair Before the mark for answering the corresponding data object of relational users described in relational query, the method also includes:
Obtain the mark of each data object and the mark of the corresponding user of each data object;
The mark of each data object is clustered according to the mark of each corresponding user of data object, To establish first corresponding relationship;
Store first corresponding relationship.
3. according to the method described in claim 2, it is characterized in that, before the storage first corresponding relationship, the side Method further include:
By hash function by first corresponding relationship, the mark boil down to institute of the corresponding data object of each user State the numerical value of regular length.
4. according to the method described in claim 2, it is characterized in that, storage first corresponding relationship, comprising:
The mark for each user for including in first corresponding relationship is divided at least two user identifier groups, user's mark Knowledge group is corresponding with respective storage address;
In the corresponding storage address of the user identifier group the corresponding mark for storing the user that the user identifier group includes with And the mark of data object corresponding with the mark of user that the user identifier group includes;
It is described that the corresponding data of the relational users are inquired by the mark of the relational users and first corresponding relationship The mark of object, comprising:
Determine the corresponding user identifier group of the mark of the relational users;
The mark of the corresponding data object of the relational users is inquired in the determining corresponding storage address of user identifier group.
5. the method according to claim 1, wherein the retrieval includes the data object of the keyword Before mark, the method also includes:
Each keyword that the mark and each data object for obtaining each data object include;
It establishes Inverted List, includes each keyword in the Inverted List and the data that include each keyword Corresponding relationship between the mark of object;
The retrieval includes the mark of the data object of the keyword, comprising:
It include the mark of the data object of the keyword according to the keyword and Inverted List retrieval.
6. method according to any one of claims 1 to 5, which is characterized in that described to export the target object for retrieval As a result before, the method also includes:
Obtain the authority information of the target object;
Detect whether the retrieval user has the permission for checking the target object according to the authority information of the target object;
If testing result is that the retrieval user has the permission for checking the target object, execute described by the target pair As output be search result the step of.
7. a kind of data searcher, which is characterized in that described device includes:
Request receiving module, the retrieval request sent for receiving retrieval user, include in the retrieval request keyword and The mark of the retrieval user;
Determining module is identified, for determining the mark of relational users according to the mark of the retrieval user, the relational users are There is the user of specific type relationship with the retrieval user;
Enquiry module, for by the relational users mark and the first corresponding relationship to inquire the relational users corresponding The mark of data object includes the mark data corresponding with each user of each user in first corresponding relationship Corresponding relationship between the mark of object, wherein in first corresponding relationship, the corresponding data object of each user Identify the numerical value of boil down to regular length;
Retrieval module, for retrieve include the keyword data object mark, by the corresponding number of the relational users According to object mark and include that the intersection of mark of data object of the keyword is retrieved as the mark of target object, it is described Target object is the data object to match in the corresponding data object of the relational users with the keyword;
Output module, the target object for retrieving the retrieval module export as search result.
8. device according to claim 7, which is characterized in that described device further include:
First obtains module, and the mark of the relational users for being determined in the enquiry module by the mark determining module is looked into Before the mark for asking the corresponding data object of the relational users, the mark and each data of each data object are obtained The mark of the corresponding user of object;
Cluster module, the mark of the corresponding user of each data object for being got according to the first acquisition module The mark of each data object is clustered, to establish first corresponding relationship;
Memory module, for storing first corresponding relationship.
9. device according to claim 8, which is characterized in that described device further include:
Compression module, for before the memory module stores first corresponding relationship, by hash function by described the In one corresponding relationship, the numerical value of regular length described in the mark boil down to of the corresponding data object of each user.
10. device according to claim 8, which is characterized in that the memory module, comprising:
Grouped element, for the mark for each user for including in first corresponding relationship to be divided at least two user identifiers Group, the user identifier group are corresponding with respective storage address;
Storage unit stores the user identifier group and includes for corresponding in the corresponding storage address of the user identifier group The mark of the mark of user and data object corresponding with the mark of user that the user identifier group includes;
The enquiry module, comprising:
Determination unit, the corresponding user identifier group of mark for determining the relational users;
Query unit is used for inquiring the relationship in the corresponding storage address of user identifier group that the determination unit determines The mark of the corresponding data object in family.
11. device according to claim 7, which is characterized in that described device further include:
Second obtain module, for the retrieval module retrieval include the keyword the mark of data object before, Each keyword that the mark and each data object for obtaining each data object include;
Module is established in list, includes each keyword for establishing Inverted List, in the Inverted List and include Corresponding relationship between the mark of the data object of each keyword;
The retrieval module, for including the data pair of the keyword according to the keyword and Inverted List retrieval The mark of elephant.
12. according to any device of claim 7 to 11, which is characterized in that described device further include:
Third obtains module, and the target object output for retrieving the retrieval module in the output module is retrieval knot Before fruit, the authority information of the target object is obtained;
Detection module, for checking the mesh according to whether the authority information of the target object detection retrieval user has Mark the permission of object;
The output module, if the testing result for the detection module has for the retrieval user is checked the target pair The permission of elephant then executes described the step of exporting the target object for search result.
13. a kind of server, which is characterized in that the server includes:
Data searcher as described in the claims 7 to 12 are any.
14. a kind of computer readable storage medium, is stored thereon with program, which is characterized in that described program is executed by processor To realize data retrieval method as claimed in any one of claims 1 to 6.
CN201310557261.9A 2013-11-11 2013-11-11 Data retrieval method, device and server Active CN104636368B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310557261.9A CN104636368B (en) 2013-11-11 2013-11-11 Data retrieval method, device and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310557261.9A CN104636368B (en) 2013-11-11 2013-11-11 Data retrieval method, device and server

Publications (2)

Publication Number Publication Date
CN104636368A CN104636368A (en) 2015-05-20
CN104636368B true CN104636368B (en) 2019-06-04

Family

ID=53215138

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310557261.9A Active CN104636368B (en) 2013-11-11 2013-11-11 Data retrieval method, device and server

Country Status (1)

Country Link
CN (1) CN104636368B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105871686A (en) * 2016-03-22 2016-08-17 青岛海信移动通信技术股份有限公司 Message receiving method and intelligent terminal in converged communication
CN107172178B (en) * 2017-06-05 2019-03-26 腾讯科技(深圳)有限公司 A kind of content delivery method and device
CN109344342B (en) * 2018-12-17 2021-04-09 北京百度网讯科技有限公司 Map data retrieval method, map data retrieval device, map data retrieval server and map data retrieval system
CN110413570A (en) * 2019-06-25 2019-11-05 大唐软件技术股份有限公司 A kind of document index and search method and its device
CN110399374A (en) * 2019-07-05 2019-11-01 东软集团股份有限公司 Data retrieval method, device, storage medium and electronic equipment
CN116827682B (en) * 2023-08-23 2023-11-24 腾讯科技(深圳)有限公司 Data processing method and device and computer equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102402589A (en) * 2011-10-26 2012-04-04 北京百度网讯科技有限公司 Method and equipment for providing reference research information related to research request
CN102915306A (en) * 2011-08-02 2013-02-06 腾讯科技(深圳)有限公司 Searching method and searching system
CN102955778A (en) * 2011-08-18 2013-03-06 腾讯科技(深圳)有限公司 Method and system for fast search of network community data

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102915306A (en) * 2011-08-02 2013-02-06 腾讯科技(深圳)有限公司 Searching method and searching system
CN102955778A (en) * 2011-08-18 2013-03-06 腾讯科技(深圳)有限公司 Method and system for fast search of network community data
CN102402589A (en) * 2011-10-26 2012-04-04 北京百度网讯科技有限公司 Method and equipment for providing reference research information related to research request

Also Published As

Publication number Publication date
CN104636368A (en) 2015-05-20

Similar Documents

Publication Publication Date Title
CN104636368B (en) Data retrieval method, device and server
CN104598631B (en) Distributed data processing platform
CN102761627B (en) Based on cloud network address recommend method and system and the relevant device of terminal access statistics
CN108574742B (en) Domain name information collection method and domain name information collection device
CN102833262B (en) Phishing website collection and identification method and system based on whois information
US7844593B2 (en) Method and system for network search
GB2555801A (en) Identifying fraudulent and malicious websites, domain and subdomain names
CN104408159A (en) Data correlating, loading and querying method and device
CN105224636A (en) A kind of data access method and device
US11232166B2 (en) Networked page access and addressing based on partial categorization indexing
CN108154024B (en) Data retrieval method and device and electronic equipment
CN111026709B (en) Data processing method and device based on cluster access
CN102622402B (en) Server, method and system for providing information search service by using sheaf of pages
CN108243207B (en) A kind of date storage method of network cloud disk
CN109947759A (en) A kind of data directory method for building up, indexed search method and device
CN105653717B (en) A kind of method and device that information is shared
CN110708309A (en) Anti-crawler system and method
US10147095B2 (en) Chain understanding in search
CN102378407A (en) Object name resolution system and method in internet of things
CN104424316A (en) Data storage method, data searching method, related device and system
CN108804502A (en) Big data inquiry system, method, computer equipment and storage medium
CN107220262B (en) Information processing method and device
CN109302406B (en) Distributed webpage evidence obtaining method and system
CN104636384B (en) A kind of method and device handling document
US20150278372A1 (en) Search Engine and Network for Enhancing Search of Information Over a Data Network

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant