CN106777103A - A kind of patent document indexing method and device - Google Patents
A kind of patent document indexing method and device Download PDFInfo
- Publication number
- CN106777103A CN106777103A CN201611157229.1A CN201611157229A CN106777103A CN 106777103 A CN106777103 A CN 106777103A CN 201611157229 A CN201611157229 A CN 201611157229A CN 106777103 A CN106777103 A CN 106777103A
- Authority
- CN
- China
- Prior art keywords
- index
- search
- patent document
- word segmentation
- chinese word
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/38—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2216/00—Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
- G06F2216/11—Patent retrieval
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Library & Information Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention provides a kind of patent document indexing method and device.Wherein, the method includes:Obtain search type;Index patent search condition is formed by the search keyword in search type;Search meets the patent document of index patent search condition;Indexing operation is carried out to patent document by the index information option matched with patent document.By patent document indexing method provided in an embodiment of the present invention and device, index information can be enable more intuitively, compactly to embody field and technology involved by patent document.
Description
Technical field
The present invention relates to technical field of data processing, in particular to a kind of patent document indexing method and device.
Background technology
At present, patent retrieval website is in order to preferably show the content involved by patent document, it is necessary to by patent document
Implicit, not expressing or not prominent enough information is summarized, and obtains the index information of patent document.With special in user search
During sharp file, the index information of the patent document is shown in the lump, user is more fully understood patent document and covered
Content.
In correlation technique, patent retrieval website in order to obtain the index information of patent document, be by user by every specially
Patent information (such as applicant, application number, patent name) typing patent retrieval website of sharp file, is carried out with to every patent
Indexing work.
In process of the present invention is realized, inventor has found that at least there are the following problems in the prior art:
Patent retrieval website, can only be to every above-mentioned patent letter of patent document during the index for carrying out patent document
Breath is indexed, and the content of index is less, and the content of index intuitively, can not be embodied compactly involved by patent document
Field and technology, the help brought to user is very limited.
The content of the invention
In view of this, the purpose of the embodiment of the present invention is to provide a kind of patent document indexing method and device, so that mark
Fuse breath can more directly perceived, compactly embody field and the technology involved by patent document.
In a first aspect, a kind of patent document indexing method is the embodiment of the invention provides, including:
Search type is obtained, the search type includes:Search keyword;
Index patent search condition is formed by the search keyword in the search type;
Search meets the patent document of the index patent search condition;
Indexing operation is carried out to the patent document by the index information option matched with the patent document.
With reference in a first aspect, the embodiment of the invention provides the first possible implementation method of first aspect, wherein:It is logical
The search keyword crossed in the search type forms index patent search condition, including:
Obtain the search keyword in the search type;
By segmentation methods, Chinese word segmentation is extracted from the search keyword;
The Chinese word segmentation according to extracting forms the index patent search condition.
With reference in a first aspect, the embodiment of the invention provides second possible implementation method of first aspect, wherein:Root
The index patent search condition is formed according to the Chinese word segmentation for extracting, including:
From default index information aggregate, the index information option including the Chinese word segmentation is inquired;
The index information choosing that the index information option for including the Chinese word segmentation is defined as being matched with the Chinese word segmentation
, and the index information option of the Chinese word segmentation and matching is showed into the user;
When user's selection Chinese word segmentation is monitored, determine to include user's selection from the search keyword
Chinese word segmentation index search keyword, and according to index search keyword and the search type, form the index
Patent search condition.
With reference in a first aspect, the embodiment of the invention provides the third possible implementation method of first aspect, wherein:When
When monitoring user's selection Chinese word segmentation, determine to include the Chinese word segmentation of user's selection from the search keyword
Index search keyword, and according to it is described index search keyword and the search type, formed it is described index patent search bar
Part, including:
Monitor the Chinese word segmentation of user's selection;
The Chinese word segmentation selected by the user inquires about the search type, and the search type is included into the user
The search keyword of the Chinese word segmentation of selection is defined as indexing search keyword;
When it is determined that there is multiple index search keywords, will be many in the multiple index search keyword and the search type
Logical operator between the individual index search keyword is combined, and forms index limitation search expression;
Accorded with by logic and operation and the search type and the index are limited into search expression combination, form index patent
Search condition.
With reference in a first aspect, the embodiment of the invention provides the 4th kind of possible implementation method of first aspect, wherein:It is logical
Crossing the index information option matched with the patent document carries out indexing operation to the patent document, including:
The index information option that the Chinese word segmentation selected with the user is matched is defined as the patent document matching
Index information option;
Indexing operation is carried out to the patent document by the index information option for determining.
Second aspect, the embodiment of the present invention also provides a kind of patent document index device, including:
Acquisition module, for obtaining search type, the search type includes:Search keyword;
Module is formed, for forming index patent search condition by the search keyword in the search type;
Search module, the patent document of the index patent search condition is met for searching for;
Index module, rower is entered for the index information option by being matched with the patent document to the patent document
Draw operation.
With reference to second aspect, the first possible implementation method of second aspect is the embodiment of the invention provides, wherein:Institute
State to form module, including:
Acquisition submodule, for obtaining the search keyword in the search type;
Extracting sub-module, for by segmentation methods, Chinese word segmentation being extracted from the search keyword;
Submodule is formed, for forming the index patent search condition according to the Chinese word segmentation for extracting.
With reference to second aspect, second possible implementation method of second aspect is the embodiment of the invention provides, wherein:Institute
State to form submodule, including:
Query unit, for from default index information aggregate, inquiring the index information including the Chinese word segmentation
Option;
Display unit, the index information option for will include the Chinese word segmentation is defined as being matched with the Chinese word segmentation
Index information option, and by the Chinese word segmentation and matching index information option show the user;
Unit is formed, for when user's selection Chinese word segmentation is monitored, bag being determined from the search keyword
The index search keyword of the Chinese word segmentation of user's selection is included, and according to index search keyword and the search
Formula, forms the index patent search condition.
With reference to second aspect, the third possible implementation method of second aspect is the embodiment of the invention provides, wherein:Institute
State to form unit, specifically for:
Monitor the Chinese word segmentation of user's selection;
The Chinese word segmentation selected by the user inquires about the search type, and the search type is included into the user
The search keyword of the Chinese word segmentation of selection is defined as indexing search keyword;
When it is determined that there is multiple index search keywords, will be many in the multiple index search keyword and the search type
Logical operator between the individual index search keyword is combined, and forms index limitation search expression;
Accorded with by logic and operation and the search type and the index are limited into search expression combination, form index patent
Search condition.
With reference to second aspect, the 4th kind of possible implementation method of second aspect is the embodiment of the invention provides, wherein:Institute
Index module is stated, including:
Determination sub-module, it is described for the index information option that the Chinese word segmentation selected with the user is matched to be defined as
The index information option of patent document matching;
Index submodule, indexing operation is carried out for the index information option by determining to the patent document.
Patent document indexing method provided in an embodiment of the present invention and device, are indexed by the keyword in search type
Patent search condition, then search meets the patent document of index patent search condition, and the mark by being matched with patent document
Draw information option to index patent document, with it is existing in the art can only in patent document patent information (such as applicant,
Application number, patent name etc.) carry out index and compare, can to meet the patent document of index patent search condition by with these
The index information option of patent document matching carries out indexing operation, right such that it is able to the difference according to index patent search condition
Field and technology involved by patent document carry out the index of different aspect so that patent document index information can comprehensively,
Intuitively, field and the technology involved by patent document are compactly embodied, allows the user of access patent document more intuitively
Understand the field involved by patent document and technology, improve the experience of user.
To enable the above objects, features and advantages of the present invention to become apparent, preferred embodiment cited below particularly, and coordinate
Appended accompanying drawing, is described in detail below.
Brief description of the drawings
Technical scheme in order to illustrate more clearly the embodiments of the present invention, below will be attached to what is used needed for embodiment
Figure is briefly described, it will be appreciated that the following drawings illustrate only certain embodiments of the present invention, thus be not construed as it is right
The restriction of scope, for those of ordinary skill in the art, on the premise of not paying creative work, can also be according to this
A little accompanying drawings obtain other related accompanying drawings.
Fig. 1 shows the applied environment schematic diagram of the patent document indexing method that the embodiment of the present invention is provided;
Fig. 2 shows the flow chart of the patent document indexing method that the embodiment of the present invention 1 is provided;
Fig. 3 shows that a kind of patent document that the embodiment of the present invention 2 is provided indexes the structural representation of device;
Fig. 4 shows that the structure of formation module in a kind of patent document index device that the embodiment of the present invention 2 is provided is shown
It is intended to.
Specific embodiment
To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with the embodiment of the present invention
Middle accompanying drawing, is clearly and completely described to the technical scheme in the embodiment of the present invention, it is clear that described embodiment is only
It is a part of embodiment of the invention, rather than whole embodiments.The present invention generally described and illustrated in accompanying drawing herein is real
The component for applying example can be arranged and designed with a variety of configurations.Therefore, it is of the invention to what is provided in the accompanying drawings below
The detailed description of embodiment is not intended to limit the scope of claimed invention, but is merely representative of selected reality of the invention
Apply example.Based on embodiments of the invention, the institute that those skilled in the art are obtained on the premise of creative work is not made
There is other embodiment, belong to the scope of protection of the invention.
Fig. 1 shows and a kind of can be applied in the embodiment of the present invention perform the knot of the server of patent document indexing method
Structure block diagram.As shown in figure 1, server 100 includes:Memory 101, processor 102 and mixed-media network modules mixed-media 103.
Memory 101 can be used to store software program and module, the patent document index side such as in the embodiment of the present invention
Method and the corresponding programmed instruction/module of device, processor 102 by run software program of the storage in memory 201 and
Module, so as to perform various function application and data processing, that is, realizes the patent document indexing method in the embodiment of the present invention.
Memory 101 may include high speed random access memory, may also include nonvolatile memory, such as one or more magnetic storage dress
Put, flash memory or other non-volatile solid state memories.Further, above-mentioned software program and module may also include:Operation
System 121 and service module 122.Wherein operating system 121, for example, can be LINUX, UNIX, WINDOWS, and it may include respectively
The component software for management system task (such as memory management, storage device control, power management etc.) and/or driving are planted,
And can mutually be communicated with various hardware or component software, so as to provide the running environment of other software component.Service module 122 is transported
Row is monitored come the request of automatic network on the basis of operating system 121 by the network service of operating system 121, according to please
The corresponding data processing of completion is asked, and returns to result to client.That is, service module 122 is used for client
Network service is provided.
Mixed-media network modules mixed-media 103 is used to receive and send network signal.Above-mentioned network signal may include wireless signal or have
Line signal.
It is appreciated that the structure shown in Fig. 1 is only to illustrate, server 100 may also include more more than shown in Fig. 1 or more
Few component, or with the configuration different from shown in Fig. 1.Each component shown in Fig. 1 can use hardware, software or its group
Close and realize.In addition, the server in the embodiment of the present invention can also include the server of multiple specific difference in functionalitys.
At present, patent retrieval website is to obtain the index information of patent document, is by every patent document by user
Patent information (such as applicant, application number, patent name) typing patent retrieval website, carry out index work with to every patent
Make.Patent retrieval website can only be entered during the index for carrying out patent document to the above-mentioned patent information of every patent document
Rower draws, and the content of index is less, and the content for indexing can not intuitively, compactly embody the neck involved by patent document
Domain and technology, the help brought to user are very limited.Based on this, the application provide a kind of patent document indexing method and dress
Put.
Embodiment 1
A kind of patent document indexing method is present embodiments provided, the executive agent of the present embodiment is server.It is determined that
After index patent search condition, may search for meeting the patent document of index patent search condition, and by with patent document
The index information option matched somebody with somebody is indexed to patent document, so that the index information of patent document can comprehensively, intuitively, compactly
Embody the field involved by patent document and technology.
Provided in an embodiment of the present invention a kind of patent document indexing method flow chart shown in Figure 2, the method includes
Following steps:
Step 200, acquisition search type.
Wherein, above-mentioned search type includes:Search keyword.
Above-mentioned search type, is exactly patent document search expression, is user input, wants to carry out for searching out user
The patent document of index.Such as:Search type can be:
Ti=(load-sensitive hydraulic valve or heavy dutys flow valve) and ic=a61k35/78, search type explanation user
It is intended to include load-sensitive hydraulic valve and heavy duty flow valve to patent name (ti) under IPC (ic) classification numbers a61k35/78
Patent document indexed.
Above-mentioned user, refers to just the indexer indexed to patent document.
Above-mentioned search keyword, is exactly the word of appearance in search type, patent name, summary in patent document, right
The word being made up of Chinese word segmentation occurs in claim and specification.Therefore, it can the patent name to patent document, pluck
The word, to occur in claims and specification is scanned for, and searches out the patent document for wanting index.
After the search type including keyword is got by above-mentioned steps 200, following step 202 can be continued and form mark
Draw patent search condition.
Step 202, index patent search condition is formed by search keyword in above-mentioned search type.
Above-mentioned index patent search condition, is combined by above-mentioned search type and is obtained according to after processing above-mentioned search keyword
The search expression that the index limitation search expression for arriving is formed, the patent document for needing index to user is scanned for.
Step 204, search meet the patent document of above-mentioned index patent search condition.
Step 206, index behaviour is carried out to above-mentioned patent document by the index information option matched with above-mentioned patent document
Make.
Above-mentioned index information option, meets involved by the patent document of above-mentioned index patent search condition for embodying
Field and technology.
In sum, the patent document indexing method that the present embodiment is provided, is indexed by the keyword in search type
Patent search condition, then search meets the patent document of index patent search condition, and the mark by being matched with patent document
Draw information option to index patent document, with it is existing in the art can only in patent document patent information (such as applicant,
Application number, patent name etc.) carry out index and compare, can to meet the patent document of index patent search condition by with these
The index information option of patent document matching carries out indexing operation, right such that it is able to the difference according to index patent search condition
Field and technology involved by patent document carry out the index of different aspect so that patent document index information can comprehensively,
Intuitively, field and the technology involved by patent document are compactly embodied, allows the user of access patent document more intuitively
Understand the field involved by patent document and technology, improve the experience of user.
In correlation technique, user is not each those skilled in the art, can not be equal to each technical field
There is comprehensive and accurate understanding and be familiar with, so the search keyword of user input is unable to the special of comprehensive representation its desired index sometimes
Sharp file.It is crucial by the search in above-mentioned search type in order to comprehensively retrieve the patent document that user wants index
Morphology is comprised the following steps (1) to step (3) into index patent search condition:
(1) search keyword in above-mentioned search type is obtained;
(2) by segmentation methods, Chinese word segmentation is extracted from above-mentioned search keyword;
(3) above-mentioned index patent search condition is formed according to the above-mentioned Chinese word segmentation for extracting.
In above-mentioned steps (2), above-mentioned Chinese word segmentation, the multiple obtained after exactly being split to above-mentioned search keyword
Independent word.In one embodiment, in above-mentioned search type search keyword " load-sensitive hydraulic valve and heavy duty
Flow valve ", then can obtain the independent word of following multiple by segmentation methods:" load ", " sensitivity ", " hydraulic pressure ", " heavy burden
Load ", " flow " and " valve ".
And, the segmentation methods described in above-mentioned steps (2) any can be entered using existing to search keyword
The Chinese Word Automatic Segmentation that row splits, no longer repeats one by one here.
In sum, index patent search bar is formed rather than the search keyword of user input by extracting Chinese word segmentation
Part, can comprehensively retrieve the patent document that user wants index, improve the accuracy of index.
In correlation technique, if multiple keyword relevancies of user input are not high, then can include Search Results
Many patent documents that need not be indexed, also needing user's artificial screening from Search Results to be just determined needs the special of index
Sharp file, reduces the index efficiency of patent document.In order to improve the index efficiency of patent document, according to the above-mentioned Chinese for extracting
Participle forms above-mentioned index patent search condition, comprises the following steps (1) to step (3):
(1) from default index information aggregate, the index information option including above-mentioned Chinese word segmentation is inquired;
(2) will include that the index information option of above-mentioned Chinese word segmentation is defined as the index information matched with above-mentioned Chinese word segmentation
Option, and the index information option of above-mentioned Chinese word segmentation and matching is showed into above-mentioned user;
(3) when above-mentioned user's selection Chinese word segmentation is monitored, determine to include above-mentioned user from above-mentioned search keyword
The index search keyword of the Chinese word segmentation of selection, and according to above-mentioned index search keyword and above-mentioned search type, formed above-mentioned
Index patent search condition.
In above-mentioned steps (1), if the Chinese word segmentation after splitting is " flow ", then can be from default index information
Inquired in set including Chinese word segmentation " flow " as:Flow valve, flowmeter etc. index information option.
Above-mentioned steps (3) include step in detail below (31) to step (35):
(31) Chinese word segmentation of above-mentioned user's selection is monitored;
(32) Chinese word segmentation selected by above-mentioned user inquires about above-mentioned search type, and above-mentioned search type is included above-mentioned
The search keyword of the Chinese word segmentation of user's selection is defined as indexing search keyword;
(33) when it is determined that there is multiple index search keywords, by multiple above-mentioned index search keywords and above-mentioned search type
Logical operator between middle multiple above-mentioned index search keywords is combined, and forms index limitation search expression;
(34) accorded with by logic and operation and above-mentioned search type and above-mentioned index are limited into search expression combination, form index
Patent search condition;
(35) when it is determined that there is only one of which to index search keyword, accord with above-mentioned search type and be somebody's turn to do by logic and operation
Index search keyword is combined, and forms index patent search condition.
Alternatively, the flow performed by above-mentioned steps (32) to step (34) is described by implementation below,
Assuming that search type is:Ti=((one one by one or 22 2) not (3 33 or 44 4) and 55 5) and ic=(123
Or 321), then server is from the search keyword of the search type " one one by one, 222,333,444 and 555 "
The Chinese word segmentation of middle determination is respectively:First, two, three, four and five, if the Chinese word segmentation of user's selection is two and three, then clothes
Business device will be defined as the corresponding search keyword " 222 and 333 " of Chinese word segmentation two and three indexing search keyword,
And by the logical NOT operator in above-mentioned search type between search keyword " 222 " and " 333 " by search keyword
" 222 " and " 333 " combine, and the index for forming above-mentioned search type limits search expression " 222 not 333 ".That
, index limitation search expression " 222 not 333 " that will be obtained is by logic and operation symbol and above-mentioned search type knot
Close, obtain indexing patent search condition:
(ti=((one one by one or 22 2) not (3 33 or 44 4) and 55 5) and ic=(123 or
321)) and ti=(2 22 not 33 3).
In sum, search type is combined to form into final mark with the search keyword of the Chinese word segmentation for including user's selection
Draw patent search condition so that the patent document relevance that retrieval is obtained is higher, and patent document is manually determined without user, improve
The index efficiency of patent document.
In correlation technique, user is not each those skilled in the art, can not be equal to each technical field
Have comprehensive and accurate understanding and be familiar with, thus the search keyword of user input also usually occur it is unprofessional, lack of standardization and not
Accurate situation, if indexed to patent document by these search keywords, can cause index result inaccurate, reduce
To the index effect of patent document.In order to improve the index effect of patent document, by the mark matched with above-mentioned patent document
Draw information option carries out indexing operation to above-mentioned patent document, comprises the following steps (1) to step (2):
(1) the index information option that the Chinese word segmentation selected with above-mentioned user is matched is defined as above-mentioned patent document matching
Index information option;
(2) the above-mentioned index information option by determining carries out indexing operation to above-mentioned patent document.
In sum, the index information option by being matched with the Chinese word segmentation that user selects is carried out to above-mentioned patent document
Indexing operation, is not through the keyword that user uses and directly indexes, and can be obviously improved the normalization of index information option, carries
The accuracy and the index effect to patent document of height index result, beneficial to the later stage is to patent document retrieval, reading and analyzes
The carrying out of work.
Embodiment 2
The present embodiment provides a kind of patent document index device, for performing above-mentioned patent document indexing method.
Provided in an embodiment of the present invention a kind of patent document shown in Figure 3 indexes the structural representation of device, including:
Acquisition module 300, for obtaining search type, above-mentioned search type includes:Search keyword;
Module 302 is formed, for forming index patent search condition by the search keyword in above-mentioned search type;
Search module 304, the patent document of above-mentioned index patent search condition is met for searching for;
Index module 306, enters for the index information option by being matched with above-mentioned patent document to above-mentioned patent document
Row indexing operation.
In sum, the patent document index device that the present embodiment is provided, is indexed by the keyword in search type
Patent search condition, then search meets the patent document of index patent search condition, and the mark by being matched with patent document
Draw information option to index patent document, with it is existing in the art can only in patent document patent information (such as applicant,
Application number, patent name etc.) carry out index and compare, can to meet the patent document of index patent search condition by with these
The index information option of patent document matching carries out indexing operation, right such that it is able to the difference according to index patent search condition
Field and technology involved by patent document carry out the index of different aspect so that patent document index information can comprehensively,
Intuitively, field and the technology involved by patent document are compactly embodied, allows the user of access patent document more intuitively
Understand the field involved by patent document and technology, improve the experience of user.
In correlation technique, user is not each those skilled in the art, can not be equal to each technical field
There is comprehensive and accurate understanding and be familiar with, so the search keyword of user input is unable to the special of comprehensive representation its desired index sometimes
Sharp file.In order to comprehensively retrieve the patent document that user wants index, referring to Fig. 4, above-mentioned formation module 302, bag
Include:
Acquisition submodule 3020, for obtaining the search keyword in above-mentioned search type;
Extracting sub-module 3022, for by segmentation methods, Chinese word segmentation being extracted from above-mentioned search keyword;
Submodule 3024 is formed, for forming above-mentioned index patent search condition according to the above-mentioned Chinese word segmentation for extracting.
In sum, index patent search bar is formed rather than the search keyword of user input by extracting Chinese word segmentation
Part, can comprehensively retrieve the patent document that user wants index, improve the accuracy of index.
In correlation technique, if multiple keyword relevancies of user input are not high, then can include Search Results
Many patent documents that need not be indexed, also needing user's artificial screening from Search Results to be just determined needs the special of index
Sharp file, reduces the index efficiency of patent document.In order to improve the index efficiency of patent document, above-mentioned formation submodule
3024, including:
Query unit, for from default index information aggregate, inquiring the index information including above-mentioned Chinese word segmentation
Option;
Display unit, the index information option for will include above-mentioned Chinese word segmentation is defined as being matched with above-mentioned Chinese word segmentation
Index information option, and by above-mentioned Chinese word segmentation and matching index information option show above-mentioned user;
Unit is formed, for when above-mentioned user's selection Chinese word segmentation is monitored, bag being determined from above-mentioned search keyword
The index search keyword of the Chinese word segmentation of above-mentioned user's selection is included, and according to above-mentioned index search keyword and above-mentioned search
Formula, forms above-mentioned index patent search condition.
Above-mentioned formation unit, specifically for:
Monitor the Chinese word segmentation of above-mentioned user's selection;
The Chinese word segmentation selected by above-mentioned user inquires about above-mentioned search type, and above-mentioned search type is included into above-mentioned user
The search keyword of the Chinese word segmentation of selection is defined as indexing search keyword;
When it is determined that there is multiple index search keywords, will be many in multiple above-mentioned index search keywords and above-mentioned search type
Logical operator between individual above-mentioned index search keyword is combined, and forms index limitation search expression;
Accorded with by logic and operation and above-mentioned search type and above-mentioned index are limited into search expression combination, form index patent
Search condition.
In sum, search type is combined to form into final mark with the search keyword of the Chinese word segmentation for including user's selection
Draw patent search condition so that the patent document relevance that retrieval is obtained is higher, and patent document is manually determined without user, improve
The index efficiency of patent document.
In correlation technique, user is not each those skilled in the art, can not be equal to each technical field
Have comprehensive and accurate understanding and be familiar with, thus the search keyword of user input also usually occur it is unprofessional, lack of standardization and not
Accurate situation, if indexed to patent document by these search keywords, can cause index result inaccurate, reduce
To the index effect of patent document.In order to improve the index effect of patent document, above-mentioned index module 306, including:
Determination sub-module, it is above-mentioned for the index information option that the Chinese word segmentation selected with above-mentioned user is matched to be defined as
The index information option of patent document matching;
Index submodule, indexing operation is carried out for the above-mentioned index information option by determining to above-mentioned patent document.
In sum, the index information option by being matched with the Chinese word segmentation that user selects is carried out to above-mentioned patent document
Indexing operation, is not through the keyword that user uses and directly indexes, and can be obviously improved the normalization of index information option, carries
The accuracy and the index effect to patent document of height index result, beneficial to the later stage is to patent document retrieval, reading and analyzes
The carrying out of work.
What the embodiment of the present invention was provided carries out the computer program product of patent document indexing method, including stores journey
The computer-readable recording medium of sequence code, the instruction that said procedure code includes can be used to perform in previous methods embodiment
The method stated, implements and can be found in embodiment of the method, will not be repeated here.
It is apparent to those skilled in the art that, for convenience and simplicity of description, the system of foregoing description,
The specific work process of device and unit, may be referred to the corresponding process in preceding method embodiment, will not be repeated here.
In several embodiments provided herein, it should be understood that disclosed system, apparatus and method, can be with
Realize by another way.Device embodiment described above is only schematical, for example, the division of said units,
It is only a kind of division of logic function, there can be other dividing mode when actually realizing, but for example, multiple units or component can
To combine or be desirably integrated into another system, or some features can be ignored, or not perform.It is another, it is shown or beg for
The coupling each other of opinion or direct-coupling or communication connection can be by some communication interfaces, device or unit it is indirect
Coupling is communicated to connect, and can be electrical, mechanical or other forms.
The above-mentioned unit that is illustrated as separating component can be or may not be it is physically separate, it is aobvious as unit
The part for showing can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple
On NE.Some or all of unit therein can be according to the actual needs selected to realize the mesh of this embodiment scheme
's.
In addition, during each functional unit in each embodiment of the invention can be integrated in a processing unit, it is also possible to
It is that unit is individually physically present, it is also possible to which two or more units are integrated in a unit.
If above-mentioned functions are to realize in the form of SFU software functional unit and as independent production marketing or when using, can be with
Storage is in a computer read/write memory medium.Based on such understanding, technical scheme is substantially in other words
The part contributed to prior art or the part of the technical scheme can be embodied in the form of software product, the meter
Calculation machine software product is stored in a storage medium, including some instructions are used to so that a computer equipment (can be individual
People's computer, server, or network equipment etc.) perform all or part of step of each embodiment above method of the invention.
And foregoing storage medium includes:USB flash disk, mobile hard disk, read-only storage (ROM, Read-Only Memory), arbitrary access are deposited
Reservoir (RAM, Random Access Memory), magnetic disc or CD etc. are various can be with the medium of store program codes.
Above-mentioned, specific embodiment only of the invention, but protection scope of the present invention above is not limited thereto, any
Those familiar with the art the invention discloses technical scope in, change or replacement can be readily occurred in, should all contain
Cover within protection scope of the present invention.Therefore, protection scope of the present invention is answered and above-mentioned is defined by scope of the claims.
Claims (10)
1. a kind of patent document indexing method, it is characterised in that including:
Search type is obtained, the search type includes:Search keyword;
Index patent search condition is formed by the search keyword in the search type;
Search meets the patent document of the index patent search condition;
Indexing operation is carried out to the patent document by the index information option matched with the patent document.
2. method according to claim 1, it is characterised in that index patent is formed by the search keyword in the search type
Search condition, including:
Obtain the search keyword in the search type;
By segmentation methods, Chinese word segmentation is extracted from the search keyword;
The Chinese word segmentation according to extracting forms the index patent search condition.
3. method according to claim 2, it is characterised in that it is special that the Chinese word segmentation according to extracting forms the index
Sharp search condition, including:
From default index information aggregate, the index information option including the Chinese word segmentation is inquired;
The index information option for including the Chinese word segmentation is defined as the index information option matched with the Chinese word segmentation, and
The index information option of the Chinese word segmentation and matching is showed into user;
When user's selection Chinese word segmentation is monitored, in determining to include that the user selects from the search keyword
The index search keyword of literary participle, and according to index search keyword and the search type, form the index patent
Search condition.
4. method according to claim 3, it is characterised in that when user's selection Chinese word segmentation is monitored, searched from described
Determine to include the index search keyword of the Chinese word segmentation that the user selects in rope keyword, and closed according to the index search
Keyword and the search type, form the index patent search condition, including:
Monitor the Chinese word segmentation of user's selection;
The Chinese word segmentation selected by the user inquires about the search type, and the search type is included into user's selection
Chinese word segmentation search keyword be defined as index search keyword;
When it is determined that there is multiple index search keywords, by multiple institutes in the multiple index search keyword and the search type
The logical operator stated between index search keyword is combined, and forms index limitation search expression;
Accorded with by logic and operation and the search type and the index are limited into search expression combination, form index patent search
Condition.
5. method according to claim 3, it is characterised in that by the index information option that is matched with the patent document to institute
Stating patent document carries out indexing operation, including:
The index information option that the Chinese word segmentation selected with the user is matched is defined as the index of the patent document matching
Information option;
Indexing operation is carried out to the patent document by the index information option for determining.
6. a kind of patent document indexes device, it is characterised in that including:
Acquisition module, for obtaining search type, the search type includes:Search keyword;
Module is formed, for forming index patent search condition by the search keyword in the search type;
Search module, the patent document of the index patent search condition is met for searching for;
Index module, index behaviour is carried out for the index information option by being matched with the patent document to the patent document
Make.
7. device according to claim 6, it is characterised in that the formation module, including:
Acquisition submodule, for obtaining the search keyword in the search type;
Extracting sub-module, for by segmentation methods, Chinese word segmentation being extracted from the search keyword;
Submodule is formed, for forming the index patent search condition according to the Chinese word segmentation for extracting.
8. device according to claim 7, it is characterised in that the formation submodule, including:
Query unit, for from default index information aggregate, inquiring the index information option including the Chinese word segmentation;
Display unit, the index information option for will include the Chinese word segmentation is defined as the mark matched with the Chinese word segmentation
Draw information option, and the index information option of the Chinese word segmentation and matching is showed into user;
Unit is formed, for when user's selection Chinese word segmentation is monitored, determining to include institute from the search keyword
The index search keyword of the Chinese word segmentation of user's selection is stated, and according to index search keyword and the search type, shape
Into the index patent search condition.
9. device according to claim 8, it is characterised in that the formation unit, specifically for:
Monitor the Chinese word segmentation of user's selection;
The Chinese word segmentation selected by the user inquires about the search type, and the search type is included into user's selection
Chinese word segmentation search keyword be defined as index search keyword;
When it is determined that there is multiple index search keywords, by multiple institutes in the multiple index search keyword and the search type
The logical operator stated between index search keyword is combined, and forms index limitation search expression;
Accorded with by logic and operation and the search type and the index are limited into search expression combination, form index patent search
Condition.
10. device according to claim 8, it is characterised in that the index module, including:
Determination sub-module, for the index information option that the Chinese word segmentation selected with the user is matched to be defined as into the patent
The index information option of file matching;
Index submodule, indexing operation is carried out for the index information option by determining to the patent document.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611157229.1A CN106777103B (en) | 2016-12-15 | 2016-12-15 | Patent file indexing method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611157229.1A CN106777103B (en) | 2016-12-15 | 2016-12-15 | Patent file indexing method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106777103A true CN106777103A (en) | 2017-05-31 |
CN106777103B CN106777103B (en) | 2020-07-07 |
Family
ID=58888248
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611157229.1A Active CN106777103B (en) | 2016-12-15 | 2016-12-15 | Patent file indexing method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106777103B (en) |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1818906A (en) * | 2006-03-10 | 2006-08-16 | 上海汉光知识产权数据科技有限公司 | Indexing method of patent document |
TW200915107A (en) * | 2007-09-28 | 2009-04-01 | Hon Hai Prec Ind Co Ltd | System and method for creating index of patent full text search |
CN101661469A (en) * | 2008-09-09 | 2010-03-03 | 山东科技大学 | System and method for indexing and retrieving keywords of academic documents |
CN101692228A (en) * | 2009-05-31 | 2010-04-07 | 上海汉光知识产权数据科技有限公司 | Accurate and rapid automatic indexing method of patent documents |
CN101692240A (en) * | 2009-08-14 | 2010-04-07 | 北京中献电子技术开发中心 | Rule-based method for patent abstract automatic extraction and keyword indexing |
CN102929925A (en) * | 2012-09-20 | 2013-02-13 | 百度在线网络技术(北京)有限公司 | Search method and device based on browsing content |
US20130086084A1 (en) * | 2011-10-03 | 2013-04-04 | Steven W. Lundberg | Patent mapping |
US20160004768A1 (en) * | 2005-09-27 | 2016-01-07 | Patentratings, Llc | Method and system for probabilistically quantifying and visualizing relevance between two or more citationally or contextually related data objects |
-
2016
- 2016-12-15 CN CN201611157229.1A patent/CN106777103B/en active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160004768A1 (en) * | 2005-09-27 | 2016-01-07 | Patentratings, Llc | Method and system for probabilistically quantifying and visualizing relevance between two or more citationally or contextually related data objects |
CN1818906A (en) * | 2006-03-10 | 2006-08-16 | 上海汉光知识产权数据科技有限公司 | Indexing method of patent document |
TW200915107A (en) * | 2007-09-28 | 2009-04-01 | Hon Hai Prec Ind Co Ltd | System and method for creating index of patent full text search |
CN101661469A (en) * | 2008-09-09 | 2010-03-03 | 山东科技大学 | System and method for indexing and retrieving keywords of academic documents |
CN101692228A (en) * | 2009-05-31 | 2010-04-07 | 上海汉光知识产权数据科技有限公司 | Accurate and rapid automatic indexing method of patent documents |
CN101692240A (en) * | 2009-08-14 | 2010-04-07 | 北京中献电子技术开发中心 | Rule-based method for patent abstract automatic extraction and keyword indexing |
US20130086084A1 (en) * | 2011-10-03 | 2013-04-04 | Steven W. Lundberg | Patent mapping |
CN102929925A (en) * | 2012-09-20 | 2013-02-13 | 百度在线网络技术(北京)有限公司 | Search method and device based on browsing content |
Non-Patent Citations (1)
Title |
---|
汤才祥: "关于关键词标引的讨论", 《HTTP://WENKU.BAIDU.COM/VIEW/F63B173D5901020206409C12.HTML》 * |
Also Published As
Publication number | Publication date |
---|---|
CN106777103B (en) | 2020-07-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108804641B (en) | Text similarity calculation method, device, equipment and storage medium | |
RU2671044C1 (en) | Method and device for data storage | |
US20200089699A1 (en) | Leveraging Concepts with Information Retrieval Techniques and Knowledge Bases | |
Trippe | Patinformatics: Tasks to tools | |
CN109634698B (en) | Menu display method and device, computer equipment and storage medium | |
CN106339756A (en) | Training data generation method and device and searching method and device | |
EP3539018A1 (en) | Apparatus and method for semantic search | |
CN109033105A (en) | The method and apparatus for obtaining judgement document's focus | |
WO2009009192A2 (en) | Adaptive archive data management | |
CN111125086B (en) | Method, device, storage medium and processor for acquiring data resources | |
CN108875065B (en) | Indonesia news webpage recommendation method based on content | |
CN109213921A (en) | A kind of searching method and device of merchandise news | |
CN110263021B (en) | Theme library generation method based on personalized label system | |
CN104484392A (en) | Method and device for generating database query statement | |
JP6223721B2 (en) | Formation of optimal comparison criteria within associative memory | |
CN104598485B (en) | The method and apparatus for handling database table | |
US20080158160A1 (en) | Central storage for data entry processing | |
CN104778202B (en) | The analysis method and system of event evolutionary process based on keyword | |
JP7256357B2 (en) | Information processing device, control method, program | |
CN106777103A (en) | A kind of patent document indexing method and device | |
CN114022086B (en) | Purchasing method, device, equipment and storage medium based on BOM identification | |
CN105095225A (en) | Method and apparatus for obtaining file data | |
CN106708793B (en) | Annotate footnote recognition methods, device and electronic equipment | |
CN114780589A (en) | Multi-table connection query method, device, equipment and storage medium | |
CN114385436A (en) | Server grouping method and device, electronic equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |