CN106777103A - A kind of patent document indexing method and device - Google Patents

A kind of patent document indexing method and device Download PDF

Info

Publication number
CN106777103A
CN106777103A CN201611157229.1A CN201611157229A CN106777103A CN 106777103 A CN106777103 A CN 106777103A CN 201611157229 A CN201611157229 A CN 201611157229A CN 106777103 A CN106777103 A CN 106777103A
Authority
CN
China
Prior art keywords
index
search
patent document
word segmentation
chinese word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201611157229.1A
Other languages
Chinese (zh)
Other versions
CN106777103B (en
Inventor
赵大川
景俊杰
黄菲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Kelong Technology Co Ltd
Original Assignee
Beijing Kelong Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Kelong Technology Co Ltd filed Critical Beijing Kelong Technology Co Ltd
Priority to CN201611157229.1A priority Critical patent/CN106777103B/en
Publication of CN106777103A publication Critical patent/CN106777103A/en
Application granted granted Critical
Publication of CN106777103B publication Critical patent/CN106777103B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/38Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2216/00Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
    • G06F2216/11Patent retrieval

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Library & Information Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a kind of patent document indexing method and device.Wherein, the method includes:Obtain search type;Index patent search condition is formed by the search keyword in search type;Search meets the patent document of index patent search condition;Indexing operation is carried out to patent document by the index information option matched with patent document.By patent document indexing method provided in an embodiment of the present invention and device, index information can be enable more intuitively, compactly to embody field and technology involved by patent document.

Description

A kind of patent document indexing method and device
Technical field
The present invention relates to technical field of data processing, in particular to a kind of patent document indexing method and device.
Background technology
At present, patent retrieval website is in order to preferably show the content involved by patent document, it is necessary to by patent document Implicit, not expressing or not prominent enough information is summarized, and obtains the index information of patent document.With special in user search During sharp file, the index information of the patent document is shown in the lump, user is more fully understood patent document and covered Content.
In correlation technique, patent retrieval website in order to obtain the index information of patent document, be by user by every specially Patent information (such as applicant, application number, patent name) typing patent retrieval website of sharp file, is carried out with to every patent Indexing work.
In process of the present invention is realized, inventor has found that at least there are the following problems in the prior art:
Patent retrieval website, can only be to every above-mentioned patent letter of patent document during the index for carrying out patent document Breath is indexed, and the content of index is less, and the content of index intuitively, can not be embodied compactly involved by patent document Field and technology, the help brought to user is very limited.
The content of the invention
In view of this, the purpose of the embodiment of the present invention is to provide a kind of patent document indexing method and device, so that mark Fuse breath can more directly perceived, compactly embody field and the technology involved by patent document.
In a first aspect, a kind of patent document indexing method is the embodiment of the invention provides, including:
Search type is obtained, the search type includes:Search keyword;
Index patent search condition is formed by the search keyword in the search type;
Search meets the patent document of the index patent search condition;
Indexing operation is carried out to the patent document by the index information option matched with the patent document.
With reference in a first aspect, the embodiment of the invention provides the first possible implementation method of first aspect, wherein:It is logical The search keyword crossed in the search type forms index patent search condition, including:
Obtain the search keyword in the search type;
By segmentation methods, Chinese word segmentation is extracted from the search keyword;
The Chinese word segmentation according to extracting forms the index patent search condition.
With reference in a first aspect, the embodiment of the invention provides second possible implementation method of first aspect, wherein:Root The index patent search condition is formed according to the Chinese word segmentation for extracting, including:
From default index information aggregate, the index information option including the Chinese word segmentation is inquired;
The index information choosing that the index information option for including the Chinese word segmentation is defined as being matched with the Chinese word segmentation , and the index information option of the Chinese word segmentation and matching is showed into the user;
When user's selection Chinese word segmentation is monitored, determine to include user's selection from the search keyword Chinese word segmentation index search keyword, and according to index search keyword and the search type, form the index Patent search condition.
With reference in a first aspect, the embodiment of the invention provides the third possible implementation method of first aspect, wherein:When When monitoring user's selection Chinese word segmentation, determine to include the Chinese word segmentation of user's selection from the search keyword Index search keyword, and according to it is described index search keyword and the search type, formed it is described index patent search bar Part, including:
Monitor the Chinese word segmentation of user's selection;
The Chinese word segmentation selected by the user inquires about the search type, and the search type is included into the user The search keyword of the Chinese word segmentation of selection is defined as indexing search keyword;
When it is determined that there is multiple index search keywords, will be many in the multiple index search keyword and the search type Logical operator between the individual index search keyword is combined, and forms index limitation search expression;
Accorded with by logic and operation and the search type and the index are limited into search expression combination, form index patent Search condition.
With reference in a first aspect, the embodiment of the invention provides the 4th kind of possible implementation method of first aspect, wherein:It is logical Crossing the index information option matched with the patent document carries out indexing operation to the patent document, including:
The index information option that the Chinese word segmentation selected with the user is matched is defined as the patent document matching Index information option;
Indexing operation is carried out to the patent document by the index information option for determining.
Second aspect, the embodiment of the present invention also provides a kind of patent document index device, including:
Acquisition module, for obtaining search type, the search type includes:Search keyword;
Module is formed, for forming index patent search condition by the search keyword in the search type;
Search module, the patent document of the index patent search condition is met for searching for;
Index module, rower is entered for the index information option by being matched with the patent document to the patent document Draw operation.
With reference to second aspect, the first possible implementation method of second aspect is the embodiment of the invention provides, wherein:Institute State to form module, including:
Acquisition submodule, for obtaining the search keyword in the search type;
Extracting sub-module, for by segmentation methods, Chinese word segmentation being extracted from the search keyword;
Submodule is formed, for forming the index patent search condition according to the Chinese word segmentation for extracting.
With reference to second aspect, second possible implementation method of second aspect is the embodiment of the invention provides, wherein:Institute State to form submodule, including:
Query unit, for from default index information aggregate, inquiring the index information including the Chinese word segmentation Option;
Display unit, the index information option for will include the Chinese word segmentation is defined as being matched with the Chinese word segmentation Index information option, and by the Chinese word segmentation and matching index information option show the user;
Unit is formed, for when user's selection Chinese word segmentation is monitored, bag being determined from the search keyword The index search keyword of the Chinese word segmentation of user's selection is included, and according to index search keyword and the search Formula, forms the index patent search condition.
With reference to second aspect, the third possible implementation method of second aspect is the embodiment of the invention provides, wherein:Institute State to form unit, specifically for:
Monitor the Chinese word segmentation of user's selection;
The Chinese word segmentation selected by the user inquires about the search type, and the search type is included into the user The search keyword of the Chinese word segmentation of selection is defined as indexing search keyword;
When it is determined that there is multiple index search keywords, will be many in the multiple index search keyword and the search type Logical operator between the individual index search keyword is combined, and forms index limitation search expression;
Accorded with by logic and operation and the search type and the index are limited into search expression combination, form index patent Search condition.
With reference to second aspect, the 4th kind of possible implementation method of second aspect is the embodiment of the invention provides, wherein:Institute Index module is stated, including:
Determination sub-module, it is described for the index information option that the Chinese word segmentation selected with the user is matched to be defined as The index information option of patent document matching;
Index submodule, indexing operation is carried out for the index information option by determining to the patent document.
Patent document indexing method provided in an embodiment of the present invention and device, are indexed by the keyword in search type Patent search condition, then search meets the patent document of index patent search condition, and the mark by being matched with patent document Draw information option to index patent document, with it is existing in the art can only in patent document patent information (such as applicant, Application number, patent name etc.) carry out index and compare, can to meet the patent document of index patent search condition by with these The index information option of patent document matching carries out indexing operation, right such that it is able to the difference according to index patent search condition Field and technology involved by patent document carry out the index of different aspect so that patent document index information can comprehensively, Intuitively, field and the technology involved by patent document are compactly embodied, allows the user of access patent document more intuitively Understand the field involved by patent document and technology, improve the experience of user.
To enable the above objects, features and advantages of the present invention to become apparent, preferred embodiment cited below particularly, and coordinate Appended accompanying drawing, is described in detail below.
Brief description of the drawings
Technical scheme in order to illustrate more clearly the embodiments of the present invention, below will be attached to what is used needed for embodiment Figure is briefly described, it will be appreciated that the following drawings illustrate only certain embodiments of the present invention, thus be not construed as it is right The restriction of scope, for those of ordinary skill in the art, on the premise of not paying creative work, can also be according to this A little accompanying drawings obtain other related accompanying drawings.
Fig. 1 shows the applied environment schematic diagram of the patent document indexing method that the embodiment of the present invention is provided;
Fig. 2 shows the flow chart of the patent document indexing method that the embodiment of the present invention 1 is provided;
Fig. 3 shows that a kind of patent document that the embodiment of the present invention 2 is provided indexes the structural representation of device;
Fig. 4 shows that the structure of formation module in a kind of patent document index device that the embodiment of the present invention 2 is provided is shown It is intended to.
Specific embodiment
To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with the embodiment of the present invention Middle accompanying drawing, is clearly and completely described to the technical scheme in the embodiment of the present invention, it is clear that described embodiment is only It is a part of embodiment of the invention, rather than whole embodiments.The present invention generally described and illustrated in accompanying drawing herein is real The component for applying example can be arranged and designed with a variety of configurations.Therefore, it is of the invention to what is provided in the accompanying drawings below The detailed description of embodiment is not intended to limit the scope of claimed invention, but is merely representative of selected reality of the invention Apply example.Based on embodiments of the invention, the institute that those skilled in the art are obtained on the premise of creative work is not made There is other embodiment, belong to the scope of protection of the invention.
Fig. 1 shows and a kind of can be applied in the embodiment of the present invention perform the knot of the server of patent document indexing method Structure block diagram.As shown in figure 1, server 100 includes:Memory 101, processor 102 and mixed-media network modules mixed-media 103.
Memory 101 can be used to store software program and module, the patent document index side such as in the embodiment of the present invention Method and the corresponding programmed instruction/module of device, processor 102 by run software program of the storage in memory 201 and Module, so as to perform various function application and data processing, that is, realizes the patent document indexing method in the embodiment of the present invention. Memory 101 may include high speed random access memory, may also include nonvolatile memory, such as one or more magnetic storage dress Put, flash memory or other non-volatile solid state memories.Further, above-mentioned software program and module may also include:Operation System 121 and service module 122.Wherein operating system 121, for example, can be LINUX, UNIX, WINDOWS, and it may include respectively The component software for management system task (such as memory management, storage device control, power management etc.) and/or driving are planted, And can mutually be communicated with various hardware or component software, so as to provide the running environment of other software component.Service module 122 is transported Row is monitored come the request of automatic network on the basis of operating system 121 by the network service of operating system 121, according to please The corresponding data processing of completion is asked, and returns to result to client.That is, service module 122 is used for client Network service is provided.
Mixed-media network modules mixed-media 103 is used to receive and send network signal.Above-mentioned network signal may include wireless signal or have Line signal.
It is appreciated that the structure shown in Fig. 1 is only to illustrate, server 100 may also include more more than shown in Fig. 1 or more Few component, or with the configuration different from shown in Fig. 1.Each component shown in Fig. 1 can use hardware, software or its group Close and realize.In addition, the server in the embodiment of the present invention can also include the server of multiple specific difference in functionalitys.
At present, patent retrieval website is to obtain the index information of patent document, is by every patent document by user Patent information (such as applicant, application number, patent name) typing patent retrieval website, carry out index work with to every patent Make.Patent retrieval website can only be entered during the index for carrying out patent document to the above-mentioned patent information of every patent document Rower draws, and the content of index is less, and the content for indexing can not intuitively, compactly embody the neck involved by patent document Domain and technology, the help brought to user are very limited.Based on this, the application provide a kind of patent document indexing method and dress Put.
Embodiment 1
A kind of patent document indexing method is present embodiments provided, the executive agent of the present embodiment is server.It is determined that After index patent search condition, may search for meeting the patent document of index patent search condition, and by with patent document The index information option matched somebody with somebody is indexed to patent document, so that the index information of patent document can comprehensively, intuitively, compactly Embody the field involved by patent document and technology.
Provided in an embodiment of the present invention a kind of patent document indexing method flow chart shown in Figure 2, the method includes Following steps:
Step 200, acquisition search type.
Wherein, above-mentioned search type includes:Search keyword.
Above-mentioned search type, is exactly patent document search expression, is user input, wants to carry out for searching out user The patent document of index.Such as:Search type can be:
Ti=(load-sensitive hydraulic valve or heavy dutys flow valve) and ic=a61k35/78, search type explanation user It is intended to include load-sensitive hydraulic valve and heavy duty flow valve to patent name (ti) under IPC (ic) classification numbers a61k35/78 Patent document indexed.
Above-mentioned user, refers to just the indexer indexed to patent document.
Above-mentioned search keyword, is exactly the word of appearance in search type, patent name, summary in patent document, right The word being made up of Chinese word segmentation occurs in claim and specification.Therefore, it can the patent name to patent document, pluck The word, to occur in claims and specification is scanned for, and searches out the patent document for wanting index.
After the search type including keyword is got by above-mentioned steps 200, following step 202 can be continued and form mark Draw patent search condition.
Step 202, index patent search condition is formed by search keyword in above-mentioned search type.
Above-mentioned index patent search condition, is combined by above-mentioned search type and is obtained according to after processing above-mentioned search keyword The search expression that the index limitation search expression for arriving is formed, the patent document for needing index to user is scanned for.
Step 204, search meet the patent document of above-mentioned index patent search condition.
Step 206, index behaviour is carried out to above-mentioned patent document by the index information option matched with above-mentioned patent document Make.
Above-mentioned index information option, meets involved by the patent document of above-mentioned index patent search condition for embodying Field and technology.
In sum, the patent document indexing method that the present embodiment is provided, is indexed by the keyword in search type Patent search condition, then search meets the patent document of index patent search condition, and the mark by being matched with patent document Draw information option to index patent document, with it is existing in the art can only in patent document patent information (such as applicant, Application number, patent name etc.) carry out index and compare, can to meet the patent document of index patent search condition by with these The index information option of patent document matching carries out indexing operation, right such that it is able to the difference according to index patent search condition Field and technology involved by patent document carry out the index of different aspect so that patent document index information can comprehensively, Intuitively, field and the technology involved by patent document are compactly embodied, allows the user of access patent document more intuitively Understand the field involved by patent document and technology, improve the experience of user.
In correlation technique, user is not each those skilled in the art, can not be equal to each technical field There is comprehensive and accurate understanding and be familiar with, so the search keyword of user input is unable to the special of comprehensive representation its desired index sometimes Sharp file.It is crucial by the search in above-mentioned search type in order to comprehensively retrieve the patent document that user wants index Morphology is comprised the following steps (1) to step (3) into index patent search condition:
(1) search keyword in above-mentioned search type is obtained;
(2) by segmentation methods, Chinese word segmentation is extracted from above-mentioned search keyword;
(3) above-mentioned index patent search condition is formed according to the above-mentioned Chinese word segmentation for extracting.
In above-mentioned steps (2), above-mentioned Chinese word segmentation, the multiple obtained after exactly being split to above-mentioned search keyword Independent word.In one embodiment, in above-mentioned search type search keyword " load-sensitive hydraulic valve and heavy duty Flow valve ", then can obtain the independent word of following multiple by segmentation methods:" load ", " sensitivity ", " hydraulic pressure ", " heavy burden Load ", " flow " and " valve ".
And, the segmentation methods described in above-mentioned steps (2) any can be entered using existing to search keyword The Chinese Word Automatic Segmentation that row splits, no longer repeats one by one here.
In sum, index patent search bar is formed rather than the search keyword of user input by extracting Chinese word segmentation Part, can comprehensively retrieve the patent document that user wants index, improve the accuracy of index.
In correlation technique, if multiple keyword relevancies of user input are not high, then can include Search Results Many patent documents that need not be indexed, also needing user's artificial screening from Search Results to be just determined needs the special of index Sharp file, reduces the index efficiency of patent document.In order to improve the index efficiency of patent document, according to the above-mentioned Chinese for extracting Participle forms above-mentioned index patent search condition, comprises the following steps (1) to step (3):
(1) from default index information aggregate, the index information option including above-mentioned Chinese word segmentation is inquired;
(2) will include that the index information option of above-mentioned Chinese word segmentation is defined as the index information matched with above-mentioned Chinese word segmentation Option, and the index information option of above-mentioned Chinese word segmentation and matching is showed into above-mentioned user;
(3) when above-mentioned user's selection Chinese word segmentation is monitored, determine to include above-mentioned user from above-mentioned search keyword The index search keyword of the Chinese word segmentation of selection, and according to above-mentioned index search keyword and above-mentioned search type, formed above-mentioned Index patent search condition.
In above-mentioned steps (1), if the Chinese word segmentation after splitting is " flow ", then can be from default index information Inquired in set including Chinese word segmentation " flow " as:Flow valve, flowmeter etc. index information option.
Above-mentioned steps (3) include step in detail below (31) to step (35):
(31) Chinese word segmentation of above-mentioned user's selection is monitored;
(32) Chinese word segmentation selected by above-mentioned user inquires about above-mentioned search type, and above-mentioned search type is included above-mentioned The search keyword of the Chinese word segmentation of user's selection is defined as indexing search keyword;
(33) when it is determined that there is multiple index search keywords, by multiple above-mentioned index search keywords and above-mentioned search type Logical operator between middle multiple above-mentioned index search keywords is combined, and forms index limitation search expression;
(34) accorded with by logic and operation and above-mentioned search type and above-mentioned index are limited into search expression combination, form index Patent search condition;
(35) when it is determined that there is only one of which to index search keyword, accord with above-mentioned search type and be somebody's turn to do by logic and operation Index search keyword is combined, and forms index patent search condition.
Alternatively, the flow performed by above-mentioned steps (32) to step (34) is described by implementation below, Assuming that search type is:Ti=((one one by one or 22 2) not (3 33 or 44 4) and 55 5) and ic=(123 Or 321), then server is from the search keyword of the search type " one one by one, 222,333,444 and 555 " The Chinese word segmentation of middle determination is respectively:First, two, three, four and five, if the Chinese word segmentation of user's selection is two and three, then clothes Business device will be defined as the corresponding search keyword " 222 and 333 " of Chinese word segmentation two and three indexing search keyword, And by the logical NOT operator in above-mentioned search type between search keyword " 222 " and " 333 " by search keyword " 222 " and " 333 " combine, and the index for forming above-mentioned search type limits search expression " 222 not 333 ".That , index limitation search expression " 222 not 333 " that will be obtained is by logic and operation symbol and above-mentioned search type knot Close, obtain indexing patent search condition:
(ti=((one one by one or 22 2) not (3 33 or 44 4) and 55 5) and ic=(123 or 321)) and ti=(2 22 not 33 3).
In sum, search type is combined to form into final mark with the search keyword of the Chinese word segmentation for including user's selection Draw patent search condition so that the patent document relevance that retrieval is obtained is higher, and patent document is manually determined without user, improve The index efficiency of patent document.
In correlation technique, user is not each those skilled in the art, can not be equal to each technical field Have comprehensive and accurate understanding and be familiar with, thus the search keyword of user input also usually occur it is unprofessional, lack of standardization and not Accurate situation, if indexed to patent document by these search keywords, can cause index result inaccurate, reduce To the index effect of patent document.In order to improve the index effect of patent document, by the mark matched with above-mentioned patent document Draw information option carries out indexing operation to above-mentioned patent document, comprises the following steps (1) to step (2):
(1) the index information option that the Chinese word segmentation selected with above-mentioned user is matched is defined as above-mentioned patent document matching Index information option;
(2) the above-mentioned index information option by determining carries out indexing operation to above-mentioned patent document.
In sum, the index information option by being matched with the Chinese word segmentation that user selects is carried out to above-mentioned patent document Indexing operation, is not through the keyword that user uses and directly indexes, and can be obviously improved the normalization of index information option, carries The accuracy and the index effect to patent document of height index result, beneficial to the later stage is to patent document retrieval, reading and analyzes The carrying out of work.
Embodiment 2
The present embodiment provides a kind of patent document index device, for performing above-mentioned patent document indexing method.
Provided in an embodiment of the present invention a kind of patent document shown in Figure 3 indexes the structural representation of device, including:
Acquisition module 300, for obtaining search type, above-mentioned search type includes:Search keyword;
Module 302 is formed, for forming index patent search condition by the search keyword in above-mentioned search type;
Search module 304, the patent document of above-mentioned index patent search condition is met for searching for;
Index module 306, enters for the index information option by being matched with above-mentioned patent document to above-mentioned patent document Row indexing operation.
In sum, the patent document index device that the present embodiment is provided, is indexed by the keyword in search type Patent search condition, then search meets the patent document of index patent search condition, and the mark by being matched with patent document Draw information option to index patent document, with it is existing in the art can only in patent document patent information (such as applicant, Application number, patent name etc.) carry out index and compare, can to meet the patent document of index patent search condition by with these The index information option of patent document matching carries out indexing operation, right such that it is able to the difference according to index patent search condition Field and technology involved by patent document carry out the index of different aspect so that patent document index information can comprehensively, Intuitively, field and the technology involved by patent document are compactly embodied, allows the user of access patent document more intuitively Understand the field involved by patent document and technology, improve the experience of user.
In correlation technique, user is not each those skilled in the art, can not be equal to each technical field There is comprehensive and accurate understanding and be familiar with, so the search keyword of user input is unable to the special of comprehensive representation its desired index sometimes Sharp file.In order to comprehensively retrieve the patent document that user wants index, referring to Fig. 4, above-mentioned formation module 302, bag Include:
Acquisition submodule 3020, for obtaining the search keyword in above-mentioned search type;
Extracting sub-module 3022, for by segmentation methods, Chinese word segmentation being extracted from above-mentioned search keyword;
Submodule 3024 is formed, for forming above-mentioned index patent search condition according to the above-mentioned Chinese word segmentation for extracting.
In sum, index patent search bar is formed rather than the search keyword of user input by extracting Chinese word segmentation Part, can comprehensively retrieve the patent document that user wants index, improve the accuracy of index.
In correlation technique, if multiple keyword relevancies of user input are not high, then can include Search Results Many patent documents that need not be indexed, also needing user's artificial screening from Search Results to be just determined needs the special of index Sharp file, reduces the index efficiency of patent document.In order to improve the index efficiency of patent document, above-mentioned formation submodule 3024, including:
Query unit, for from default index information aggregate, inquiring the index information including above-mentioned Chinese word segmentation Option;
Display unit, the index information option for will include above-mentioned Chinese word segmentation is defined as being matched with above-mentioned Chinese word segmentation Index information option, and by above-mentioned Chinese word segmentation and matching index information option show above-mentioned user;
Unit is formed, for when above-mentioned user's selection Chinese word segmentation is monitored, bag being determined from above-mentioned search keyword The index search keyword of the Chinese word segmentation of above-mentioned user's selection is included, and according to above-mentioned index search keyword and above-mentioned search Formula, forms above-mentioned index patent search condition.
Above-mentioned formation unit, specifically for:
Monitor the Chinese word segmentation of above-mentioned user's selection;
The Chinese word segmentation selected by above-mentioned user inquires about above-mentioned search type, and above-mentioned search type is included into above-mentioned user The search keyword of the Chinese word segmentation of selection is defined as indexing search keyword;
When it is determined that there is multiple index search keywords, will be many in multiple above-mentioned index search keywords and above-mentioned search type Logical operator between individual above-mentioned index search keyword is combined, and forms index limitation search expression;
Accorded with by logic and operation and above-mentioned search type and above-mentioned index are limited into search expression combination, form index patent Search condition.
In sum, search type is combined to form into final mark with the search keyword of the Chinese word segmentation for including user's selection Draw patent search condition so that the patent document relevance that retrieval is obtained is higher, and patent document is manually determined without user, improve The index efficiency of patent document.
In correlation technique, user is not each those skilled in the art, can not be equal to each technical field Have comprehensive and accurate understanding and be familiar with, thus the search keyword of user input also usually occur it is unprofessional, lack of standardization and not Accurate situation, if indexed to patent document by these search keywords, can cause index result inaccurate, reduce To the index effect of patent document.In order to improve the index effect of patent document, above-mentioned index module 306, including:
Determination sub-module, it is above-mentioned for the index information option that the Chinese word segmentation selected with above-mentioned user is matched to be defined as The index information option of patent document matching;
Index submodule, indexing operation is carried out for the above-mentioned index information option by determining to above-mentioned patent document.
In sum, the index information option by being matched with the Chinese word segmentation that user selects is carried out to above-mentioned patent document Indexing operation, is not through the keyword that user uses and directly indexes, and can be obviously improved the normalization of index information option, carries The accuracy and the index effect to patent document of height index result, beneficial to the later stage is to patent document retrieval, reading and analyzes The carrying out of work.
What the embodiment of the present invention was provided carries out the computer program product of patent document indexing method, including stores journey The computer-readable recording medium of sequence code, the instruction that said procedure code includes can be used to perform in previous methods embodiment The method stated, implements and can be found in embodiment of the method, will not be repeated here.
It is apparent to those skilled in the art that, for convenience and simplicity of description, the system of foregoing description, The specific work process of device and unit, may be referred to the corresponding process in preceding method embodiment, will not be repeated here.
In several embodiments provided herein, it should be understood that disclosed system, apparatus and method, can be with Realize by another way.Device embodiment described above is only schematical, for example, the division of said units, It is only a kind of division of logic function, there can be other dividing mode when actually realizing, but for example, multiple units or component can To combine or be desirably integrated into another system, or some features can be ignored, or not perform.It is another, it is shown or beg for The coupling each other of opinion or direct-coupling or communication connection can be by some communication interfaces, device or unit it is indirect Coupling is communicated to connect, and can be electrical, mechanical or other forms.
The above-mentioned unit that is illustrated as separating component can be or may not be it is physically separate, it is aobvious as unit The part for showing can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple On NE.Some or all of unit therein can be according to the actual needs selected to realize the mesh of this embodiment scheme 's.
In addition, during each functional unit in each embodiment of the invention can be integrated in a processing unit, it is also possible to It is that unit is individually physically present, it is also possible to which two or more units are integrated in a unit.
If above-mentioned functions are to realize in the form of SFU software functional unit and as independent production marketing or when using, can be with Storage is in a computer read/write memory medium.Based on such understanding, technical scheme is substantially in other words The part contributed to prior art or the part of the technical scheme can be embodied in the form of software product, the meter Calculation machine software product is stored in a storage medium, including some instructions are used to so that a computer equipment (can be individual People's computer, server, or network equipment etc.) perform all or part of step of each embodiment above method of the invention. And foregoing storage medium includes:USB flash disk, mobile hard disk, read-only storage (ROM, Read-Only Memory), arbitrary access are deposited Reservoir (RAM, Random Access Memory), magnetic disc or CD etc. are various can be with the medium of store program codes.
Above-mentioned, specific embodiment only of the invention, but protection scope of the present invention above is not limited thereto, any Those familiar with the art the invention discloses technical scope in, change or replacement can be readily occurred in, should all contain Cover within protection scope of the present invention.Therefore, protection scope of the present invention is answered and above-mentioned is defined by scope of the claims.

Claims (10)

1. a kind of patent document indexing method, it is characterised in that including:
Search type is obtained, the search type includes:Search keyword;
Index patent search condition is formed by the search keyword in the search type;
Search meets the patent document of the index patent search condition;
Indexing operation is carried out to the patent document by the index information option matched with the patent document.
2. method according to claim 1, it is characterised in that index patent is formed by the search keyword in the search type Search condition, including:
Obtain the search keyword in the search type;
By segmentation methods, Chinese word segmentation is extracted from the search keyword;
The Chinese word segmentation according to extracting forms the index patent search condition.
3. method according to claim 2, it is characterised in that it is special that the Chinese word segmentation according to extracting forms the index Sharp search condition, including:
From default index information aggregate, the index information option including the Chinese word segmentation is inquired;
The index information option for including the Chinese word segmentation is defined as the index information option matched with the Chinese word segmentation, and The index information option of the Chinese word segmentation and matching is showed into user;
When user's selection Chinese word segmentation is monitored, in determining to include that the user selects from the search keyword The index search keyword of literary participle, and according to index search keyword and the search type, form the index patent Search condition.
4. method according to claim 3, it is characterised in that when user's selection Chinese word segmentation is monitored, searched from described Determine to include the index search keyword of the Chinese word segmentation that the user selects in rope keyword, and closed according to the index search Keyword and the search type, form the index patent search condition, including:
Monitor the Chinese word segmentation of user's selection;
The Chinese word segmentation selected by the user inquires about the search type, and the search type is included into user's selection Chinese word segmentation search keyword be defined as index search keyword;
When it is determined that there is multiple index search keywords, by multiple institutes in the multiple index search keyword and the search type The logical operator stated between index search keyword is combined, and forms index limitation search expression;
Accorded with by logic and operation and the search type and the index are limited into search expression combination, form index patent search Condition.
5. method according to claim 3, it is characterised in that by the index information option that is matched with the patent document to institute Stating patent document carries out indexing operation, including:
The index information option that the Chinese word segmentation selected with the user is matched is defined as the index of the patent document matching Information option;
Indexing operation is carried out to the patent document by the index information option for determining.
6. a kind of patent document indexes device, it is characterised in that including:
Acquisition module, for obtaining search type, the search type includes:Search keyword;
Module is formed, for forming index patent search condition by the search keyword in the search type;
Search module, the patent document of the index patent search condition is met for searching for;
Index module, index behaviour is carried out for the index information option by being matched with the patent document to the patent document Make.
7. device according to claim 6, it is characterised in that the formation module, including:
Acquisition submodule, for obtaining the search keyword in the search type;
Extracting sub-module, for by segmentation methods, Chinese word segmentation being extracted from the search keyword;
Submodule is formed, for forming the index patent search condition according to the Chinese word segmentation for extracting.
8. device according to claim 7, it is characterised in that the formation submodule, including:
Query unit, for from default index information aggregate, inquiring the index information option including the Chinese word segmentation;
Display unit, the index information option for will include the Chinese word segmentation is defined as the mark matched with the Chinese word segmentation Draw information option, and the index information option of the Chinese word segmentation and matching is showed into user;
Unit is formed, for when user's selection Chinese word segmentation is monitored, determining to include institute from the search keyword The index search keyword of the Chinese word segmentation of user's selection is stated, and according to index search keyword and the search type, shape Into the index patent search condition.
9. device according to claim 8, it is characterised in that the formation unit, specifically for:
Monitor the Chinese word segmentation of user's selection;
The Chinese word segmentation selected by the user inquires about the search type, and the search type is included into user's selection Chinese word segmentation search keyword be defined as index search keyword;
When it is determined that there is multiple index search keywords, by multiple institutes in the multiple index search keyword and the search type The logical operator stated between index search keyword is combined, and forms index limitation search expression;
Accorded with by logic and operation and the search type and the index are limited into search expression combination, form index patent search Condition.
10. device according to claim 8, it is characterised in that the index module, including:
Determination sub-module, for the index information option that the Chinese word segmentation selected with the user is matched to be defined as into the patent The index information option of file matching;
Index submodule, indexing operation is carried out for the index information option by determining to the patent document.
CN201611157229.1A 2016-12-15 2016-12-15 Patent file indexing method and device Active CN106777103B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611157229.1A CN106777103B (en) 2016-12-15 2016-12-15 Patent file indexing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611157229.1A CN106777103B (en) 2016-12-15 2016-12-15 Patent file indexing method and device

Publications (2)

Publication Number Publication Date
CN106777103A true CN106777103A (en) 2017-05-31
CN106777103B CN106777103B (en) 2020-07-07

Family

ID=58888248

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611157229.1A Active CN106777103B (en) 2016-12-15 2016-12-15 Patent file indexing method and device

Country Status (1)

Country Link
CN (1) CN106777103B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1818906A (en) * 2006-03-10 2006-08-16 上海汉光知识产权数据科技有限公司 Indexing method of patent document
TW200915107A (en) * 2007-09-28 2009-04-01 Hon Hai Prec Ind Co Ltd System and method for creating index of patent full text search
CN101661469A (en) * 2008-09-09 2010-03-03 山东科技大学 System and method for indexing and retrieving keywords of academic documents
CN101692228A (en) * 2009-05-31 2010-04-07 上海汉光知识产权数据科技有限公司 Accurate and rapid automatic indexing method of patent documents
CN101692240A (en) * 2009-08-14 2010-04-07 北京中献电子技术开发中心 Rule-based method for patent abstract automatic extraction and keyword indexing
CN102929925A (en) * 2012-09-20 2013-02-13 百度在线网络技术(北京)有限公司 Search method and device based on browsing content
US20130086084A1 (en) * 2011-10-03 2013-04-04 Steven W. Lundberg Patent mapping
US20160004768A1 (en) * 2005-09-27 2016-01-07 Patentratings, Llc Method and system for probabilistically quantifying and visualizing relevance between two or more citationally or contextually related data objects

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160004768A1 (en) * 2005-09-27 2016-01-07 Patentratings, Llc Method and system for probabilistically quantifying and visualizing relevance between two or more citationally or contextually related data objects
CN1818906A (en) * 2006-03-10 2006-08-16 上海汉光知识产权数据科技有限公司 Indexing method of patent document
TW200915107A (en) * 2007-09-28 2009-04-01 Hon Hai Prec Ind Co Ltd System and method for creating index of patent full text search
CN101661469A (en) * 2008-09-09 2010-03-03 山东科技大学 System and method for indexing and retrieving keywords of academic documents
CN101692228A (en) * 2009-05-31 2010-04-07 上海汉光知识产权数据科技有限公司 Accurate and rapid automatic indexing method of patent documents
CN101692240A (en) * 2009-08-14 2010-04-07 北京中献电子技术开发中心 Rule-based method for patent abstract automatic extraction and keyword indexing
US20130086084A1 (en) * 2011-10-03 2013-04-04 Steven W. Lundberg Patent mapping
CN102929925A (en) * 2012-09-20 2013-02-13 百度在线网络技术(北京)有限公司 Search method and device based on browsing content

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
汤才祥: "关于关键词标引的讨论", 《HTTP://WENKU.BAIDU.COM/VIEW/F63B173D5901020206409C12.HTML》 *

Also Published As

Publication number Publication date
CN106777103B (en) 2020-07-07

Similar Documents

Publication Publication Date Title
CN108804641B (en) Text similarity calculation method, device, equipment and storage medium
RU2671044C1 (en) Method and device for data storage
US20200089699A1 (en) Leveraging Concepts with Information Retrieval Techniques and Knowledge Bases
Trippe Patinformatics: Tasks to tools
CN109634698B (en) Menu display method and device, computer equipment and storage medium
CN106339756A (en) Training data generation method and device and searching method and device
EP3539018A1 (en) Apparatus and method for semantic search
CN109033105A (en) The method and apparatus for obtaining judgement document's focus
WO2009009192A2 (en) Adaptive archive data management
CN111125086B (en) Method, device, storage medium and processor for acquiring data resources
CN108875065B (en) Indonesia news webpage recommendation method based on content
CN109213921A (en) A kind of searching method and device of merchandise news
CN110263021B (en) Theme library generation method based on personalized label system
CN104484392A (en) Method and device for generating database query statement
JP6223721B2 (en) Formation of optimal comparison criteria within associative memory
CN104598485B (en) The method and apparatus for handling database table
US20080158160A1 (en) Central storage for data entry processing
CN104778202B (en) The analysis method and system of event evolutionary process based on keyword
JP7256357B2 (en) Information processing device, control method, program
CN106777103A (en) A kind of patent document indexing method and device
CN114022086B (en) Purchasing method, device, equipment and storage medium based on BOM identification
CN105095225A (en) Method and apparatus for obtaining file data
CN106708793B (en) Annotate footnote recognition methods, device and electronic equipment
CN114780589A (en) Multi-table connection query method, device, equipment and storage medium
CN114385436A (en) Server grouping method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant