CN1818906A - Indexing method of patent document - Google Patents
Indexing method of patent document Download PDFInfo
- Publication number
- CN1818906A CN1818906A CN 200610024618 CN200610024618A CN1818906A CN 1818906 A CN1818906 A CN 1818906A CN 200610024618 CN200610024618 CN 200610024618 CN 200610024618 A CN200610024618 A CN 200610024618A CN 1818906 A CN1818906 A CN 1818906A
- Authority
- CN
- China
- Prior art keywords
- patent documentation
- word
- technical
- index
- classification
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
A patent documentation index method, the steps are: providing a patent documentation database of correlating technologic theme; set up the technologic class of the theme and corresponding key word/words; the compartmentalization of the technologic class is according to the included content of a common patent documentation; and also classify by the technologic means. It can classify more of the class of the technologic means if it is necessary; index the part or the whole patent documentation, and set up the corresponding relation between every documentation and its key word/words. The index process may for the part of the patent documentation, for the extra part is automatic index.
Description
Technical field
The present invention relates to a kind of data indexing method, particularly the indexing method of patent documentation.
Background technology
The retrieval of present patent documentation and the important process that index has become numerous science-and-technology enterprises.The patent documentation and the corresponding access entry thereof that utilize various countries Patent Office or other intellecture property tissue to provide can find relevant patent documentation easily.And these documents are read and index, to make things convenient for other people finder and reading.
But because the patent documentation enormous amount, under the common technical theme, just may exist hundreds and thousands of, even up to ten thousand pieces of relevant patent documentations.If an industry or a technical field are carried out the patent strategy analysis, the patent documentation quantity that may find so will be tens thousand of or hundreds thousand of pieces of writing.For example, human gene just comprises the 300000 pieces of patent documentations of having an appointment, piece patent documentation surplus low-voltage electrical apparatus also has 80,000.After finding these patents, it is read with hand indexing need drop into a large amount of man power and materials.Traditional treatment method has two kinds, and a kind of is that hand indexing is read and done to all documents, and this mode wastes time and energy, and inefficiency.Another kind method is, dwindles range of search, with the quantity control of patent documentation within limits, for example is controlled in 10,000 pieces.Read one by one then and index, this method may cause some to have the patent documentation of important value to be rejected, and has increased the risk of invading other people patent.
In addition,, make when doing index ninety-nine times out of a hundred, all need hand filling keyword/word, technical classification or other patent information owing to lack effective index instrument.This makes indexing work itself also become a heavy task.
How accelerating the reading and the index speed of patent documentation, reduce the man power and material's that works in this respect input, is the technical problem to be solved in the present invention.
Summary of the invention
The objective of the invention is to, a kind of method of carrying out the patent documentation index fast is provided.For reaching above-mentioned purpose, the present invention adopts following technical scheme:
A kind of indexing method of patent documentation may further comprise the steps:
1) provides a patent literature database for one technical topic;
2) set up the technical classification of this technical theme and the crucial character/word of correspondence thereof; Technical classification is to divide according to the content that one piece of patent documentation should comprise usually herein, and it can comprise technological means, technical characterstic, technology effect, application etc.; In addition, also can classify to technological means.If necessary can also further the classification to the classification of technological means;
3) select part or all of patent documentation to carry out index,, set up the corresponding relation of itself and crucial character/word and technical classification each piece patent documentation; In this step, according to the patent documentation of index, correction technique classification or technical classification correspondence key word,
Wherein, above-mentioned patent documentation data storehouse, technical classification, crucial character/word etc. all are stored in the computing machine.By input equipment input informations such as mouse or keyboards, related computer program responds this incident, thereby sets up the corresponding relation of patent documentation and crucial character/word and technical classification, makes the indexer need not the too much out of Memory of typing, reduce workload, simplified the patent indexing process.
As a kind of improvement of the present invention, above-mentioned index process can an index partial monopoly document, for not index part, adopts following method to finish the index process automatically:
The crucial character/word and the technical classification that obtain according to the patent documentation of index, retrieve all or part of patent documentation, according to result for retrieval, set up the corresponding relation of patent documentation and crucial character/word and technical classification, finish the indexing work of the patent documentation of index not.Can not need like this index is not partly read and hand indexing, reduce workload.
As another improvement of the present invention, after the index of finishing a small amount of patent documentation, the crucial character/word and the technical classification that just begin to utilize the patent documentation of index to obtain, retrieve the patent documentation of not index, set up the not corresponding relation of index patent documentation and crucial character/word and technical classification according to result for retrieval.At this moment, may there be following two kinds of situations:
First kind of situation is, the partial monopoly document is not by index, need this moment it is replenished index, revise crucial character/word or technical classification, according to revising crucial character/word in back or technical classification, retrieve all or part of patent documentation, set up the corresponding relation of patent documentation and crucial character/word and technical classification according to result for retrieval.
Another kind of situation is that some patent documentation fails to occur in a plurality of technical classifications.Also may exist following two kinds of situations this moment, and the one, the technical theme of this patent documentation and retrieval is uncorrelated, at this moment only needs this patent documentation of deletion to get final product; Another kind of possible situation is, this technical classification or crucial character/word exist to be omitted, can read and index this moment to these patent documentations, further revise crucial character/word or technical classification, and according to actual needs, whether decision is retrieved all or part of patent documentation, and is set up the corresponding relation of new patent documentation and crucial character/word and technical classification according to result for retrieval according to further revised crucial character/word or technical classification.
Adopt technical solution of the present invention, can accelerate the reading and the index speed of patent documentation, reduce the man power and material's that works in this respect input, particularly when the patent documentation enormous amount of needs reading and index, its beneficial effect is very obvious.
Further specify the present invention below in conjunction with drawings and Examples.
Description of drawings
The embodiment that Fig. 1 embodiment of the invention hand indexing and automatic indexing combine.
Embodiment
Embodiment one
A kind of indexing method of patent documentation is an example to weld this technical field in the present embodiment, may further comprise the steps:
1) provides a patent literature database for one technical topic; Promptly relevant patent documentation data storehouse with solder technology, this patent documentation can be the patent documentation that each country, area or international organization provide, the patent documentation that for example more common China, the U.S., Japan, Britain, France, Germany, EUROPEAN PATENT OFFICE etc. provide.
2) set up the technical classification of solder technology theme and the crucial character/word of correspondence thereof, technical classification is to divide according to the content that one piece of patent documentation should comprise usually herein, and it can comprise " technological means ", " technical characterstic ", " technology effect ", " application " etc.; Wherein technological means can be divided into " process ", " welding material ", " welder ", " manufacture method " again; If necessaryly can also do further classification to the classification of technological means.The pairing crucial character/word of technical classification is the character/word according to the concrete feature extraction of technical classification; For example the corresponding key word of " process " technical classification has: " vertical direction ", " butt joint ", " tacking ", " laser is auxiliary ", " repairing ", " patch " etc.
3) select part or all of patent documentation to carry out hand indexing,, set up the corresponding relation of itself and crucial character/word and technical classification each piece patent documentation; In this step, can increase the key word of technical classification correspondence according to the patent documentation of index.
Above-mentioned patent documentation data storehouse, technical classification, crucial character/word etc. can be various spoken and written languages forms (only explaining with Chinese in the embodiment of the invention), by existing computing machine and software programming technique it are controlled and manage.By input equipment input informations such as mouse or keyboards, respond this incident by related computer program, finish the foundation of corresponding relation.For example, when the technological means of one piece of patent documentation of front opening is " process ", when its corresponding key word is " vertical direction ", can adopt click " process " and " vertical direction " this moment, can finish the foundation of the corresponding relation of getting speech and current patent documentation and " process " and " vertical direction " from summary.The indexer need not typing " process ", " vertical direction " or other patent documentation information, just can finish the index of " process " and " vertical direction " like a cork.Thereby reduced workload, simplified the patent indexing process.
Embodiment two
On embodiment one basis, improve.Among the embodiment one, only artificial index partial monopoly document, for not index part, adopt following automatic indexing method to finish the index process:
The crucial character/word and the technical classification that obtain according to the patent documentation of index, retrieve all or part of patent documentation, according to result for retrieval, set up the corresponding relation of patent documentation and crucial character/word and technical classification, finish the indexing work of the patent documentation of index not.Can not need like this index is not partly read and hand indexing, reduce workload.
Embodiment three
What adopt in the foregoing description two is first index partial monopoly document, utilizes the technical classification of index and key word/word information to retrieve then, finishes index.For the patent documentation that makes hand indexing more representative, promptly represented in all relevant patent documentation dissimilar basically, in embodiment three, just after the index of finishing a small amount of patent documentation, the crucial character/word and the technical classification that just begin to utilize the patent documentation of index to obtain, retrieve the patent documentation of not index, set up the not corresponding relation of index patent documentation and crucial character/word and technical classification according to result for retrieval.When the partial monopoly document occurring not by index, can select partly or entirely to replenish index as required, revise crucial character/word or technical classification, according to revising crucial character/word in back or technical classification, retrieve all or part of patent documentation, set up the corresponding relation of patent documentation and crucial character/word and technical classification according to result for retrieval.As shown in Figure 1, after this process circulation several times, promptly carry out a small amount of index earlier, obtain crucial character/word or technical classification, retrieve automatic indexing then, crucial character/word of hand indexing correction or technical classification are retrieved automatic indexing more again,, the final patent documentation index result that must get will be more accurate.
Among above-mentioned three embodiment, may have a small amount of patent documentation and in a plurality of technical classifications, not occur.This may following reason cause: a kind of reason is that the technical theme of this patent documentation and retrieval is uncorrelated, at this moment only needs this patent documentation of deletion to get final product; Another kind of reason is, this technical classification or crucial character/word exist to be omitted, can read and index this moment to these patent documentations, further revise crucial character/word or technical classification, and according to actual needs, whether decision is retrieved all or part of patent documentation, and is rebulid the corresponding relation of patent documentation and crucial character/word and technical classification according to result for retrieval according to further revised crucial character/word or technical classification.
Above-mentioned three embodiment only are specifying technical solution of the present invention; the present invention is not limited to above-mentioned three embodiment; as long as adopt this technological means of automatic indexing, all within protection scope of the present invention by computer program search complete patent documentation.
Claims (6)
1, a kind of indexing method of patent documentation is characterized in that may further comprise the steps:
1) provides a patent literature database for one technical topic;
2) set up the technical classification of this technical theme and the crucial character/word of correspondence thereof;
3) select part or all of patent documentation to carry out index,, set up the corresponding relation of itself and crucial character/word and technical classification each piece patent documentation; In this process, according to the patent documentation of index, the key word of correction technique classification or technical classification correspondence,
Wherein, above-mentioned patent documentation data storehouse, technical classification, crucial character/word are stored in the computing machine, and by the input equipment input information, related computer program responds this input information, sets up the corresponding relation of patent documentation and crucial character/word and technical classification.
2, a kind of indexing method of patent documentation is characterized in that may further comprise the steps:
1) provides a patent literature database for one technical topic;
2) set up the technical classification of this technical theme and the crucial character/word of correspondence thereof;
3) select the part patent documentation to carry out index,, set up the corresponding relation of itself and crucial character/word and technical classification each piece patent documentation; In this step, according to the patent documentation of index, correction technique classification or technical classification correspondence key word,
4) the crucial character/word and the technical classification of the patent documentation acquisition of basis index, retrieve all or part of patent documentation, according to result for retrieval, set up the corresponding relation of patent documentation and crucial character/word and technical classification, finish the not index of the patent documentation of index;
Wherein, above-mentioned patent documentation data storehouse, technical classification, crucial character/word are stored in the computing machine, and by the input equipment input information, related computer program responds this input information, sets up the corresponding relation of patent documentation and crucial character/word and technical classification.
3, the indexing method of a kind of patent documentation according to claim 2 is characterized in that: after the described step 4), also wrap following steps:
When existing the partial monopoly document not by index, it is replenished index, revise crucial character/word or technical classification, according to revising crucial character/word in back or technical classification, retrieve all or part of patent documentation, set up the corresponding relation of patent documentation and crucial character/word and technical classification according to result for retrieval.
4, according to the indexing method of claim 2 or 3 described a kind of patent documentations, it is characterized in that: when some patent documentation fails to occur,, then delete this patent documentation in a plurality of technical classifications if the technical theme of this patent documentation and retrieval is uncorrelated.
5, according to the indexing method of claim 2 or 3 described a kind of patent documentations, it is characterized in that: when some patent documentation fails to occur in a plurality of technical classifications, if this patent documentation is relevant with the technical theme of retrieval, this patent documentation of index then, correction technique classification or crucial character/word.
6, the indexing method of a kind of patent documentation according to claim 5, it is characterized in that: after revising crucial character/word or technical classification, retrieve all or part of patent documentation, and set up the corresponding relation of new patent documentation and crucial character/word and technical classification according to result for retrieval.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 200610024618 CN1818906A (en) | 2006-03-10 | 2006-03-10 | Indexing method of patent document |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 200610024618 CN1818906A (en) | 2006-03-10 | 2006-03-10 | Indexing method of patent document |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1818906A true CN1818906A (en) | 2006-08-16 |
Family
ID=36918918
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 200610024618 Pending CN1818906A (en) | 2006-03-10 | 2006-03-10 | Indexing method of patent document |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1818906A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106777103A (en) * | 2016-12-15 | 2017-05-31 | 北京科华万象科技有限公司 | A kind of patent document indexing method and device |
CN107609169A (en) * | 2017-09-27 | 2018-01-19 | 合肥博力生产力促进中心有限公司 | A kind of patent name back-stage management analysis system based on database |
CN109213855A (en) * | 2018-09-12 | 2019-01-15 | 合肥汇众知识产权管理有限公司 | Document labeling method based on patent drafting |
CN109582154A (en) * | 2018-11-15 | 2019-04-05 | 苏州征之魂专利技术服务有限公司 | A kind of calibration of patent document high speed and its storage method |
CN114297312A (en) * | 2021-12-31 | 2022-04-08 | 北京中知智慧科技有限公司 | Method and device for indexing patent data by multi-user cooperative operation database |
-
2006
- 2006-03-10 CN CN 200610024618 patent/CN1818906A/en active Pending
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106777103A (en) * | 2016-12-15 | 2017-05-31 | 北京科华万象科技有限公司 | A kind of patent document indexing method and device |
CN107609169A (en) * | 2017-09-27 | 2018-01-19 | 合肥博力生产力促进中心有限公司 | A kind of patent name back-stage management analysis system based on database |
CN109213855A (en) * | 2018-09-12 | 2019-01-15 | 合肥汇众知识产权管理有限公司 | Document labeling method based on patent drafting |
CN109582154A (en) * | 2018-11-15 | 2019-04-05 | 苏州征之魂专利技术服务有限公司 | A kind of calibration of patent document high speed and its storage method |
CN114297312A (en) * | 2021-12-31 | 2022-04-08 | 北京中知智慧科技有限公司 | Method and device for indexing patent data by multi-user cooperative operation database |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Tallis | Semantic word processing for content authors | |
CN113391871B (en) | RPA element intelligent fusion picking method and system | |
CN1818906A (en) | Indexing method of patent document | |
CN1591401A (en) | Data processing system using application program edition | |
CN1517904A (en) | Ink marking device and associated application programmed interface | |
CN1867894A (en) | Automatic generation of user interface descriptions through sketching | |
CN1656457A (en) | System and method for managing native application data | |
CN101079024A (en) | Special word list dynamic generation system and method | |
CN116361487A (en) | Multi-source heterogeneous policy knowledge graph construction and storage method and system | |
CN102722495A (en) | Indexing method of patent document | |
KR100697359B1 (en) | Automatic recording method of description details and file names for title block of AUTO CAD | |
CN1975723A (en) | Fast indexing method for patent documents | |
CN110008313A (en) | A kind of unsupervised text snippet method of extraction-type | |
CN113609838A (en) | Document information extraction and mapping method and system | |
Ciravegna et al. | LearningPinocchio: Adaptive information extraction for real world applications | |
CN1834954A (en) | System and method of realizing automatic generation of electronic file | |
Grabar et al. | WikiWars-UA: Ukrainian corpus annotated with temporal expressions | |
Brambilla | Generation of webml web application models from business process specifications | |
CN116861337A (en) | Electric power engineering label draws and discernment platform based on fuse LSTM | |
CN1342967A (en) | Unified recognizing method for multi-speed working pattern | |
CN101685463A (en) | Classified indexing method for patent literature | |
WO2007049800A1 (en) | Document creation support device | |
Sporleder | A galois lattice based approach to lexical inheritance hierarchy learning | |
CN112559753A (en) | Management framework of natural language text processing and analyzing task based on business process management technology | |
Costa et al. | A sign matching technique to support searches in sign language texts |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |