CN107463570B - Document retrieval/analysis method and device - Google Patents

Document retrieval/analysis method and device Download PDF

Info

Publication number
CN107463570B
CN107463570B CN201610390991.8A CN201610390991A CN107463570B CN 107463570 B CN107463570 B CN 107463570B CN 201610390991 A CN201610390991 A CN 201610390991A CN 107463570 B CN107463570 B CN 107463570B
Authority
CN
China
Prior art keywords
document
block
retrieval
standard
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610390991.8A
Other languages
Chinese (zh)
Other versions
CN107463570A (en
Inventor
裘钢
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suoyi Interactive Beijing Information Technology Co ltd
Original Assignee
Suoyi Interactive Beijing Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suoyi Interactive Beijing Information Technology Co ltd filed Critical Suoyi Interactive Beijing Information Technology Co ltd
Priority to CN201610390991.8A priority Critical patent/CN107463570B/en
Publication of CN107463570A publication Critical patent/CN107463570A/en
Application granted granted Critical
Publication of CN107463570B publication Critical patent/CN107463570B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a document retrieval method, which comprises the following steps: pre-storing first data with a predetermined format, wherein the predetermined format comprises a block structure, and the block structure at least comprises a block number and a document number list; the block structure comprises retrievable elements, and the first data is retrieved according to the retrievable elements to obtain a first retrieval result; and searching in a preset document database or a preset document set according to the first search result to generate a first search document set. The invention realizes the search of documents in the document database based on the documents other than the document database, and particularly realizes the search and analysis of relevant standard patents and other patent documents based on the standard relevant documents. Meanwhile, the invention also discloses a document retrieval device.

Description

Document retrieval/analysis method and device
Technical Field
The invention relates to the technical field of data search and analysis, in particular to a document retrieval method and a document retrieval device.
Background
With the continuous development of scientific technology, various technical documents are more and more, particularly, patent documents representing advanced technology are more and more, a plurality of search platforms are already provided at present, a plurality of search means can be provided to search patent documents and non-patent documents in a database, but the current search means basically aim at keywords, classification numbers and bibliographic items of the documents, so that the acquisition and analysis of specific categories, such as standard patent documents, are always difficult.
Disclosure of Invention
In view of the above, the present invention is proposed to provide a method and an electronic device that overcome or at least partially solve the above problems.
In one aspect of the present invention, there is provided a document retrieval method, including:
pre-storing first data with a predetermined format, wherein the predetermined format comprises a block structure, and the block structure at least comprises a block number and a document number list;
the block structure comprises retrievable elements, and the first data is retrieved according to the retrievable elements to obtain a first retrieval result;
and searching in a preset document database or a preset document set according to the first search result to generate a first search document set.
Optionally, the first search result is a document number list.
Optionally, the method includes:
the block structure at least comprises a block file corresponding to a block number, and the first retrieval result is the block file corresponding to the block number;
and searching in a preset document database or a preset document set according to the block file to generate a second searched document set.
Optionally, retrieving the first data according to the retrievable element to obtain a first retrieval result, specifically including:
retrieving the block files to obtain a block file list meeting a preset condition;
and acquiring the first retrieval result according to the block file list.
Optionally, the block number is a standard number.
Optionally, the first data includes ETSI standard data.
Optionally, the ETSI standard data includes 3GPP standard data.
Optionally, the method further includes:
and receiving a retrieval limiting condition input by a user, and retrieving document data matched with the retrieval limiting condition in the first retrieval document set to generate a third retrieval document set.
Optionally, the method further includes: and displaying the document number list and the standard file corresponding to the block number.
Optionally, the method further includes: and displaying the specific document in the document number list and the standard file corresponding to the block number.
The present invention also provides a document retrieval apparatus, comprising:
a storage unit configured to pre-store first data having a predetermined format, the predetermined format including a block structure, the block structure including at least a block number and a document number list;
the first retrieval unit is used for retrieving the first data according to the retrievable elements to obtain a first retrieval result, and the block structure comprises the retrievable elements;
and the second searching unit is used for searching in a preset document database or a preset document set according to the first searching result and generating a first searched document set.
Optionally, the first search result is a document number list.
Optionally, the block structure at least further includes a block file corresponding to a block number, and the first search result is the block file corresponding to the block number; the device includes: the second retrieval unit is used for retrieving in a preset document database or a preset document set according to the block files to generate a second retrieval document set;
optionally, the first retrieving unit specifically includes:
the block file retrieval module is used for retrieving the block files to obtain a block file list meeting preset conditions;
and the retrieval result acquisition module is used for acquiring the first retrieval result according to the partitioned file list.
Optionally, the block number is a standard number.
Optionally, the first data includes ETSI standard data.
Optionally, the ETSI standard data includes 3GPP standard data.
Optionally, the apparatus further comprises:
a receiving unit for receiving a retrieval restriction condition input by a user;
and a third retrieval unit configured to retrieve document data matching the retrieval restriction condition in the first retrieved document set to generate a third retrieved document set.
Optionally, the apparatus further comprises: and the display unit is used for displaying the document number list and the standard file corresponding to the block number.
Optionally, the apparatus further comprises: and the display unit is used for displaying the specific document in the document number list and the standard file corresponding to the block number.
The technical scheme provided by the invention can realize the search of documents in the document database based on the documents except the document database, and particularly can realize the search of relevant standard patents and non-standard patent documents based on the standard relevant documents, thereby providing a data base for the research of the standard patents and providing a data base for the research of the patents which are relevant to the standard and are not declared as the standard patents.
The foregoing description is only an overview of the technical solutions of the present invention, and the embodiments of the present invention are described below in order to make the technical means of the present invention more clearly understood and to make the above and other objects, features, and advantages of the present invention more clearly understandable.
Drawings
Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are only for purposes of illustrating the preferred embodiments and are not to be construed as limiting the invention. Also, like reference numerals are used to refer to like parts throughout the drawings. In the drawings:
FIG. 1 shows a flow diagram of a document retrieval/analysis method proposed in accordance with the present invention;
FIG. 2 shows a schematic diagram of a partitioned file according to one embodiment of the invention;
FIG. 3 illustrates an interface diagram of an ETSI standard framework structure according to one embodiment of the present invention;
FIG. 4 illustrates an interface diagram showing a list of documents under a standard node and a standard at the same time, according to one embodiment of the invention;
FIG. 5 illustrates an interface diagram showing the simultaneous display of specific information and association criteria for a document, in accordance with one embodiment of the present invention;
FIG. 6 shows a display diagram of an ETSI standard framework structure according to one embodiment of the invention
Detailed Description
Exemplary embodiments of the present invention will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the invention are shown in the drawings, it should be understood that the invention can be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.
The present invention provides a document retrieval method, as shown in fig. 1, the method comprising:
s1, pre-storing first data with a preset format, wherein the preset format comprises a block structure, and the block structure at least comprises a block number and a document number list;
s2, the block structure comprises retrievable elements, and the first data are retrieved according to the retrievable elements to obtain a first retrieval result;
and S3, searching in a preset document database or a preset document set according to the first search result to generate a first search document set.
As a first specific embodiment, an ETSI standard file is stored in advance, and the ETSI standard file is composed of one entry, and the one entry is the block structure in step S1. Each entry includes the technical field, the standard number, a family (i.e., the same family) patent document list, the patentee, the time at which the patentee claimed the standard, the document number at which the patentee claimed the standard, and the patent name and country to which the patentee belongs. In the ETSI standard, one entry corresponds to one family patent, and in practical cases, one family patent may be related to a plurality of standard numbers, and thus one family patent may appear in a plurality of entries; a standard number may have multiple families of patents entered and thus a standard number may appear in multiple entries. The association logic between the items can be established according to the standard numbers and patent document number relations between the items, and all elements in the items such as the technical field, the standard numbers, a family (namely, the same family) patent document list, the patentee, the time for the patentee to claim the standard, the file number for the patentee to claim the standard and the like can be used as retrieval elements. With respect to the block structure, it is understood that a technique may include a plurality of standard numbers, one standard number corresponding to a plurality of patent documents, and it is highly likely that one standard number corresponds to a title, a text, or a file. In a specific case, in the retrieval input box, the user inputs "type/ETSI", the server or the cloud traverses each entry in the ETSI standard document according to the instruction, extracts the patent document number in each entry, and then retrieves in a predetermined document database according to the extracted patent document number, thereby obtaining a first retrieval document set, and under the instruction, the retrieval document set obtained by the user is all standard patents declared in the ETSI standard. In this scenario, the extracted document number is the first search result in step S1. In another specific case, when the user inputs the technical field of type/3gpp-release-10, the server or the cloud matches the index word in each entry in the ETSI standard file according to the fact that "3 gpp-release-10" is used as a keyword, so as to obtain document numbers under all entries with the technology of "3 gpp-release-10", and searches in a predetermined document database according to the document numbers, so as to obtain a first search document set. As a third specific scenario, according to the type/NEC Corporation input by the user, the ETSI standard document is first retrieved according to the name of the claimant, each entry of which all NEC Corporation claims to have a standard patent is acquired, the document number under the entry is acquired, and the retrieval is performed in the predetermined document database according to the document number, thereby acquiring the first retrieved document set. The first data are partitioned and structured data, so that the index words can be conveniently determined, and the matching between the index words and the index words is efficient and feasible.
In addition to searching for patent documents in a predetermined document database based on the acquired document numbers, the present invention may also acquire the block files if the block structure further includes block files corresponding to block numbers. Specifically, the server directly searches the first data according to retrievable elements input by a user after a field "type/", such as a technology, a standard number, a family (i.e. same family) patent document list, a file number of a patent right declaration standard, and the like, so as to obtain corresponding entries, and obtain corresponding block files according to the entries. In a specific scenario, the user inputs type/TS-25.331, the server finds an entry corresponding to the standard according to TS-25.331, and obtains a patent document number corresponding to the entry, and the server also obtains a corresponding document, i.e., a corresponding standard text (i.e., the aforementioned block file) according to TS-25.331, where a specific page of the standard text is shown in fig. 2.
In the application, the block file can be directly obtained from the first data, and the second search literature set can be generated by searching in a preset literature database or a preset literature set according to the block file. The block file may be a complete file, a title, or a segment of text. According to the block files, semantic retrieval is carried out in a preset document database or a preset document set, so that patent documents which are sorted according to the semantic relevance of the block files are obtained. Therefore, the method can acquire not only the standard patent, but also the non-standard patent literature highly related to the issued standard. It is known that the sources of standard patents are basically the ones that the enterprise or patentee declares to be relevant to which part of the standard after the standard is issued, but in fact, whether the declared patents are really relevant to the standard or not has no expert or committee to examine, so that the relevance of the declared patents to the standard is actually required to be signed, and there may be some patents which are highly relevant to the standard and are not declared to be standard patents, by using the block files to establish the connection or channel between the first data and the document database, the invention provides an innovative semantic retrieval means, namely, the non-standard patent documents and the standard patent documents with high relevance to the standard can be obtained by using the files outside the document database, so as to fully present to the user various patent barriers that may be encountered in implementing the standard, and also provides a data base for standard related research.
In a second specific embodiment, the block file is searched by using a search word, for example, by inputting 5-bit coding/etsi in a search interface input box, a block file containing a keyword "5-bit coding" is obtained, then a corresponding document number list is found by using the obtained block file, and a fifth search document set is obtained by searching in a predetermined database or a predetermined document set by using the document number list; or further searching in a document database or a preset document set based on the block file to obtain a second searched document set; alternatively, a boolean search may be performed in a predetermined database or a predetermined document set based on the keyword to obtain a fourth search document set, and the fifth search document set and the second search document set may be logically or-operated to output the operation result, or the fifth search document set and the fourth search document set may be logically or-operated to output the operation result, so that all standard patents (patents declared as standards) and non-standard patents (patents not declared as standards) related to the standards may be acquired and displayed to the user.
In a third specific embodiment, the partitioned files in the first data can be used as documents in the document database to perform traditional retrieval, including boolean retrieval and semantic retrieval, so that the partitioned files form part of the documents in the document database, and the document database is enriched to a certain extent.
In a fourth specific embodiment, retrieving the first data according to the retrievable element to obtain a first retrieval result specifically includes: retrieving the block files to obtain a block file list meeting a preset condition; and acquiring the first retrieval result according to the block file list. In a specific scenario, a user wants to know a standard corresponding to a predetermined keyword and a corresponding standard patent, and then inputs an instruction, anti/(pusch w capacity request) and (channel w allocation), after receiving the instruction, a server identifies the keyword behind the anti, performs semantic matching or boolean matching on the keyword and all block files, can obtain a block file list with a predetermined degree of correlation or with the keyword according to a matching result, and obtains a corresponding entry and a patent document number under the entry according to the block file list, so that a final standard patent and a final standard file can be obtained. Through the technical means, a user can acquire a series of standard patents under a standard and corresponding standard files.
In the above embodiment, the block number is a standard number. The first data is ETSI standard data.
In a specific embodiment, the ETSI standard data includes 3GPP standard data, so that when ETSI standard data is used as the first data in step S1, the user can search for any 3 GPP-related standard and standard patent, related non-standard patent, which undoubtedly provides a very comprehensive patent data base for the implementation of 3GPP standard.
After the first search document set is acquired by searching the first data, a search restriction condition input by a user is received, and document data matching the search restriction condition is searched in the first search document set to generate a third search document set. That is, the expression for retrieving the first data may be logically operated with any retrievable or field for retrieving a document database, including an AND, OR, NAND, etc. In one scenario, a user wants to know whether down corporation, as a national enterprise lead in the telecommunications industry, has a standard patent under 3GPP, then constructs the expression "type/3 GPP and ann/down", finds none, then has a standard patent under ETSI large standard? Type/etsiand ann/large Tang, found 447, also provided a very good data base for the large enterprise's standard patent management.
As a specific implementation, a user wants to know the standard patent conditions of several competitors, and finds all patents of the competitors by constructing a search formula, ann/millet or hectometor orevora, i.e. obtaining the patents at a search module, then importing the patents to an analysis module, and looking at the patents and the standards entered by the competitors, respectively, by first grouping the patents according to the applicant, then performing search grouping on each group of grouped patents according to the first data etsi, then the analysis module matches each group of patents with the patent document number under each entry in the standards, and displays the matched patents in a standard frame structure constructed according to the standard number in the first data etsi. In order to simply show how the matched patents are displayed in the standard frame structure constructed according to the standard number in the first data etsi, the patents searched according to the search formula ann/millet or Baidu or Ori are displayed together in the standard frame structure constructed according to the standard number in the first data etsi, as shown in FIG. 3. Fig. 3 shows that the nodes of patents that are not matched in the ETSI standard framework structure are removed by default, and may of course be retained, in the case of retention, an icon in front of each node may show which nodes have matched patent documents under them, and which nodes have no matched patent documents under them, for example, a box with a "+" sign is used as a node icon to indicate that there are matched patent documents under the node, and a box with a "-" sign is used as a node icon to indicate that there are not matched patent documents under the node. It is obvious from this embodiment that the retrieval method described in this application can also be used in the analysis process of data, that is, in the analysis process, the data is searched by using the retrieval method and the searched data is grouped, and the grouped data is presented according to the data framework structure that can be expressed by the first data.
In the specific presentation, as shown in fig. 4, the document number list and the standard file corresponding to the block number may be displayed at the same time, and a preferred mode is to click a node in the trigger data frame structure by a user, display a document number list corresponding to the block number on the left, and display a block file corresponding to the block number on the right.
In the concrete presentation, as shown in fig. 5, a specific document in the document number list and the block file corresponding to the block number may also be displayed at the same time, and a preferred way is to click a specific document in the trigger data frame structure by a user, display a specific written item or full text of the specific document on the left side, and display the block file under the block number corresponding to the specific document on the right side.
The present invention also provides a document retrieval apparatus, as shown in fig. 6, comprising:
a storage unit 10 configured to pre-store first data having a predetermined format, the predetermined format including a block structure, the block structure including at least a block number and a document number list; the first data is preferably stored in tpl format, i.e. the suffix of the first file is preferably.
A first retrieving unit 20, configured to retrieve the first data according to a retrievable element to obtain a first retrieval result, where the block structure includes the retrievable element;
and a second search unit 30 configured to perform a search in a predetermined document database or a predetermined document set according to the first search result, and generate a first search document set.
The document retrieval/analysis device can be an independent retrieval device, the retrieval device can be in a browser form, namely the structural setting of browser client software and a server is realized, the data search and matching processes are finished at the server, or the structural setting of the retrieval client software and the server is realized, part of the data search and matching processes are finished at the client, part of the data search and matching processes are finished at the server, or all the data search and matching processes are finished at the client; the document retrieval/analysis device can also be a single analysis device, namely the analysis client software + the server structure setting; the document retrieval/analysis device may be a device integrating analysis and retrieval functions, that is, the device includes a retrieval module and an analysis module, a document set retrieved by the retrieval module can be directly imported to the analysis module, and the analysis module can search data by the retrieval module when searching for a group. In one embodiment, the first search result obtained by the first search unit is a document number list. The document number list is a link or channel between the first data and the document database, so as to obtain documents in the document database according to the first data.
In another embodiment, the first search result obtained by the first search unit is a block file. The block file is a link or channel between the first data and the document database, so that the documents in the document database are obtained according to the block file. Therefore, the second search means in the document search/analysis device is configured to search in a predetermined document database or a predetermined document set based on the block file to generate a second search document set, where the search is preferably a semantic search, and by the semantic search, it is possible to search based on the semantic understanding of the block file by the device to acquire a related patent document recommended according to the semantic relevance, and it is particularly pointed out that, in this way, it is possible to acquire a patent related to the standard that does not claim to belong to the standard patent.
The first retrieval unit specifically includes: the block file retrieval module is used for retrieving the block files to obtain a block file list meeting preset conditions; and the retrieval result acquisition module is used for acquiring the first retrieval result according to the partitioned file list. Through the block file retrieval module, a series of block files, especially standard files corresponding to a plurality of associated block structures, such as standard files composed of a plurality of block files under a technical branch, can be obtained, and through the block file list, a user can obtain standard patents and non-standard patents related to the standard under the technical branch.
As a specific embodiment, the first data stored by the document retrieval/analysis apparatus is ETSI standard data, and the block number is a standard number. The ETSI standard data includes 3GPP standard data. The method can conveniently search and analyze standard patents, non-standard patents and standard documents related to the 3GPP standard data.
The document retrieval/analysis apparatus further includes: a receiving unit for receiving a retrieval restriction condition input by a user; and a third retrieval unit configured to retrieve document data matching the retrieval restriction condition in the first retrieved document set to generate a third retrieved document set. The first document set composed of documents acquired based on the first data can be further searched using a search element for documents as a search restriction condition as in the conventional document set directly obtained in the patent document database, and can also be logically element-combined with other document sets.
The document retrieval/analysis apparatus further includes: and the display unit is used for displaying the document number list and the standard file corresponding to the block number. As a specific embodiment, the display unit may include a left screen on which the document number list is displayed and a right screen on which the standard file corresponding to the block number is displayed, and as another specific embodiment, the display unit may include a set screen on which the document number list is displayed on a bottom screen and the standard file corresponding to the block number is displayed on a middle set screen.
The document retrieval/analysis apparatus further includes: and the display unit is used for displaying the specific document in the document number list and the standard file corresponding to the block number. As one embodiment, the display unit may include a left screen on which a specific document may be displayed and a right screen on which a standard document corresponding to the specific document may be displayed, and as another embodiment, the display unit may include a set screen on which the specific document is displayed on a bottom screen and a standard document corresponding to the specific document is displayed on a middle set screen.
Since the document searching/analyzing apparatus described in the present invention is an apparatus for implementing the document searching/analyzing method in the embodiments of the present application, a person skilled in the art can understand the specific implementation of the document searching/analyzing apparatus of the present embodiment and various variations thereof based on the document searching/analyzing method described in the embodiments of the present application, and therefore, a detailed description of how the apparatus implements the document searching/analyzing method in the embodiments of the present application is not provided herein. The scope of the present application is intended to be covered by the claims so long as those skilled in the art can implement the methods for marking in the embodiments of the present application.
The technical scheme provided in the embodiment of the application at least has the following technical effects or advantages:
the search of documents in the document database can be realized based on documents other than the document database, particularly, the search of related standard patents and other patent documents can be realized based on standard related documents, so that a data base is provided for the research of the standard patents, and a data base is provided for the research of the standard related patents which are not declared as the standard patents.
The algorithms and displays presented herein are not inherently related to any particular computer, virtual machine, or other apparatus. Various general purpose systems may also be used with the teachings herein. The required structure for constructing such a system will be apparent from the description above. Moreover, the present invention is not directed to any particular programming language. It is appreciated that a variety of programming languages may be used to implement the teachings of the present invention as described herein, and any descriptions of specific languages are provided above to disclose the best mode of the invention.
In the description provided herein, numerous specific details are set forth. It is understood, however, that embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure an understanding of this description.
Similarly, it should be appreciated that in the foregoing description of exemplary embodiments of the invention, various features of the invention are sometimes grouped together in a single embodiment, figure, or description thereof for the purpose of streamlining the disclosure and aiding in the understanding of one or more of the various inventive aspects. However, the disclosed method should not be interpreted as reflecting an intention that: that the invention as claimed requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the claims following the detailed description are hereby expressly incorporated into this detailed description, with each claim standing on its own as a separate embodiment of this invention.
Those skilled in the art will appreciate that the modules in the device in an embodiment may be adaptively changed and disposed in one or more devices different from the embodiment. The modules or units or components of the embodiments may be combined into one module or unit or component, and furthermore they may be divided into a plurality of sub-modules or sub-units or sub-components. All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and all of the processes or elements of any method or apparatus so disclosed, may be combined in any combination, except combinations where at least some of such features and/or processes or elements are mutually exclusive. Each feature disclosed in this specification (including any accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise.
Furthermore, those skilled in the art will appreciate that while some embodiments herein include some features included in other embodiments, rather than other features, combinations of features of different embodiments are meant to be within the scope of the invention and form different embodiments. For example, in the following claims, any of the claimed embodiments may be used in any combination.
The various component embodiments of the invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. Those skilled in the art will appreciate that a microprocessor or Digital Signal Processor (DSP) may be used in practice to implement some or all of the functionality of some or all of the components of a gateway, proxy server, system according to embodiments of the present invention. The present invention may also be embodied as apparatus or device programs (e.g., computer programs and computer program products) for performing a portion or all of the methods described herein. Such programs implementing the present invention may be stored on computer-readable media or may be in the form of one or more signals. Such a signal may be downloaded from an internet website or provided on a carrier signal or in any other form.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention may be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The usage of the words first, second and third, etcetera do not indicate any ordering. These words may be interpreted as names.

Claims (12)

1. A document retrieval method, comprising:
pre-storing first data with a preset format, wherein the first data comprises standard data, the preset format comprises a block structure, the block structure comprises retrievable elements, the block structure at least comprises a block number, a document number list and a block file corresponding to the block number, and the block number is the standard number;
searching the block files to obtain a block file list meeting a preset condition, and obtaining a first search result according to the block file list, wherein the first search result is at least one of a document number list and a block file corresponding to a block number;
and searching in a preset document database or a preset document set according to the first search result to generate a first search document set.
2. The method of claim 1, comprising:
and searching in a preset document database or a preset document set according to the block file to generate a second searched document set.
3. The method of claim 1, wherein standard data is ETSI standard data, and wherein the ETSI standard data comprises 3GPP standard data.
4. The method of claim 1, further characterized in that the method further comprises:
and receiving a retrieval limiting condition input by a user, and retrieving document data matched with the retrieval limiting condition in the first retrieval document set to generate a third retrieval document set.
5. The method of claim 1 or 2, further comprising: and displaying the document number list and the standard file corresponding to the block number.
6. The method of claim 1 or 2, further comprising: and displaying the specific document in the document number list and the standard file corresponding to the block number.
7. A document retrieval apparatus, comprising:
the storage unit is used for pre-storing first data with a preset format, wherein the first data comprises ETSI standard data, the preset format comprises a block structure, the block structure comprises retrievable elements, the block structure at least comprises a block number, a document number list and a block file corresponding to the block number, and the block number is a standard number;
a first retrieval unit, configured to retrieve the first data according to a retrievable element to obtain a first retrieval result, where the first retrieval result is at least one of a document number list and a block file corresponding to a block number, and the first retrieval unit specifically includes: the device comprises a block file retrieval module, a retrieval result acquisition module and a first search module, wherein the block file retrieval module is used for retrieving the block files so as to acquire a block file list meeting a preset condition;
and the second searching unit is used for searching in a preset document database or a preset document set according to the first searching result and generating a first searched document set.
8. The apparatus of claim 7, comprising: the second retrieval unit is used for retrieving in a predetermined document database or a predetermined document set according to the block files and generating a second retrieval document set.
9. The apparatus of claim 7, wherein standard data is ETSI standard data, and wherein the ETSI standard data comprises 3GPP standard data.
10. The apparatus of any of claims 7-9, further characterized in that the apparatus further comprises:
a receiving unit for receiving a retrieval restriction condition input by a user;
and a third retrieval unit configured to retrieve document data matching the retrieval restriction condition in the first retrieved document set to generate a third retrieved document set.
11. The apparatus of any of claims 7-9, further comprising: and the display unit is used for displaying the document number list and the standard file corresponding to the block number.
12. The apparatus of any of claims 7-9, further comprising: and the display unit is used for displaying the specific document in the document number list and the standard file corresponding to the block number.
CN201610390991.8A 2016-06-02 2016-06-02 Document retrieval/analysis method and device Active CN107463570B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610390991.8A CN107463570B (en) 2016-06-02 2016-06-02 Document retrieval/analysis method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610390991.8A CN107463570B (en) 2016-06-02 2016-06-02 Document retrieval/analysis method and device

Publications (2)

Publication Number Publication Date
CN107463570A CN107463570A (en) 2017-12-12
CN107463570B true CN107463570B (en) 2020-10-13

Family

ID=60545788

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610390991.8A Active CN107463570B (en) 2016-06-02 2016-06-02 Document retrieval/analysis method and device

Country Status (1)

Country Link
CN (1) CN107463570B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110321868A (en) * 2019-07-10 2019-10-11 杭州睿琪软件有限公司 Object identifying and the method and system of display

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1389811A (en) * 2002-02-06 2003-01-08 北京造极人工智能技术有限公司 Intelligent search method of search engine
CN1975724A (en) * 2006-12-13 2007-06-06 上海汉光知识产权数据科技有限公司 Method for searching patents utilizing IPC classification
CN103164462A (en) * 2011-12-16 2013-06-19 苏州威世博知识产权服务有限公司 Method and system for downloading patent literature
CN104516979A (en) * 2014-12-31 2015-04-15 北京锐安科技有限公司 Data query method and data query system based on quadratic search
CN104978312A (en) * 2014-04-01 2015-10-14 江苏佰腾科技有限公司 Method using stock code to retrieve patent information

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1389811A (en) * 2002-02-06 2003-01-08 北京造极人工智能技术有限公司 Intelligent search method of search engine
CN1975724A (en) * 2006-12-13 2007-06-06 上海汉光知识产权数据科技有限公司 Method for searching patents utilizing IPC classification
CN103164462A (en) * 2011-12-16 2013-06-19 苏州威世博知识产权服务有限公司 Method and system for downloading patent literature
CN104978312A (en) * 2014-04-01 2015-10-14 江苏佰腾科技有限公司 Method using stock code to retrieve patent information
CN104516979A (en) * 2014-12-31 2015-04-15 北京锐安科技有限公司 Data query method and data query system based on quadratic search

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
技术标准的专利分析——以地面数字电视国家标准为例;王有国 等;《情报杂志》;20131231;第32卷(第12期);第79-82、118页 *

Also Published As

Publication number Publication date
CN107463570A (en) 2017-12-12

Similar Documents

Publication Publication Date Title
CN107729336B (en) Data processing method, device and system
US8972372B2 (en) Searching code by specifying its behavior
JP5575902B2 (en) Information retrieval based on query semantic patterns
US7660783B2 (en) System and method of ad-hoc analysis of data
US10380197B2 (en) Network searching method and network searching system
CN104715064B (en) It is a kind of to realize the method and server that keyword is marked on webpage
TWI524193B (en) Computer-readable media and computer-implemented method for semantic table of contents for search results
JP5721818B2 (en) Use of model information group in search
JP2015191655A (en) Method and apparatus for generating recommendation page
CN107885873B (en) Method and apparatus for outputting information
CN109918594B (en) Information display method and device
CN111008321A (en) Recommendation method and device based on logistic regression, computing equipment and readable storage medium
CN113407785B (en) Data processing method and system based on distributed storage system
CN107330079B (en) Method and device for presenting rumor splitting information based on artificial intelligence
CN104391978A (en) Method and device for storing and processing web pages of browsers
US20120179709A1 (en) Apparatus, method and program product for searching document
CN103793495A (en) Application message search method and system and application message acquisition method and system
US20130232139A1 (en) Electronic device and method for generating recommendation content
US10504145B2 (en) Automated classification of network-accessible content based on events
CN107463570B (en) Document retrieval/analysis method and device
WO2016101727A1 (en) Question-and-answer-based search result adjustment method and device
CN111126034A (en) Medical variable relation processing method and device, computer medium and electronic equipment
CN116521729A (en) Information classification searching method and device based on elastic search
CN107862028B (en) Method for establishing standard academic model, server and storage medium
CN107622125B (en) Information crawling method and device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant