CN110276079A - A kind of dictionary method for building up, information retrieval method and corresponding system - Google Patents
A kind of dictionary method for building up, information retrieval method and corresponding system Download PDFInfo
- Publication number
- CN110276079A CN110276079A CN201910568339.4A CN201910568339A CN110276079A CN 110276079 A CN110276079 A CN 110276079A CN 201910568339 A CN201910568339 A CN 201910568339A CN 110276079 A CN110276079 A CN 110276079A
- Authority
- CN
- China
- Prior art keywords
- vocabulary
- term
- word
- library
- dictionary
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The embodiment of the invention discloses a kind of dictionary method for building up, information retrieval method and corresponding systems, wherein, the dictionary method for building up includes: to obtain the association vocabulary of each vocabulary according to the specific explanations corresponding with vocabulary described in each of each vocabulary in dictionary library;Each vocabulary is stored in the meaning of a word library pre-established using the association vocabulary of the vocabulary and the vocabulary as vocabulary group;Sorted logic relationship between each vocabulary and the association vocabulary of the vocabulary is stored in the classification associated library pre-established.The embodiment of the present invention is according to the vocabulary and specific explanations of dictionary library, the association vocabulary of each vocabulary is stored in meaning of a word library, and the sorted logic relationship between each vocabulary is stored in classification associated library, when for information retrieval, vocabulary extension is carried out to term, obtains associative search word, and then retrieved according to associative search word, obtained search result is more comprehensive, is extended to initial results.
Description
Technical field
The present embodiments relate to technical field of information retrieval, and in particular to a kind of dictionary method for building up, information retrieval side
Method and corresponding system.
Background technique
Currently, common information retrieval method is, according to the term (being referred to as keyword) that user inputs, search
Engine is retrieved according to term, and is provided search result and responded.Such search engine, can be with for keyword search
The higher search result of specific aim is provided, in most cases, for the term of input, opinion can be directly given or result is answered
Case.
But usually, the scalability for the search result that this term according to input obtains is relatively limited, nothing
Method provides preferably judgement and decision support foundation.
Summary of the invention
For this purpose, the embodiment of the present invention provides a kind of dictionary method for building up, information retrieval method and corresponding system, to solve
In the prior art due to term it is single caused by the big problem of search result limitation.
To achieve the goals above, the embodiment of the present invention provides the following technical solutions:
According to a first aspect of the embodiments of the present invention, a kind of dictionary method for building up is provided, comprising:
S1 obtains each institute according to the specific explanations corresponding with vocabulary described in each of each vocabulary in dictionary library
The association vocabulary that predicate converges;
S2 is stored in and pre-establishes using the association vocabulary of the vocabulary and the vocabulary as vocabulary group for each vocabulary
Meaning of a word library in;
Sorted logic relationship between each vocabulary and the association vocabulary of the vocabulary is stored in point pre-established by S3
In class correlation database.
Further, the step S1 is specifically included:
S11 collects each vocabulary and the corresponding specific explanations of each vocabulary in dictionary library;
S12 carries out participle fractionation to the specific explanations of each vocabulary, and is obtained according to logic characterization of relation therein
Take the association vocabulary of each vocabulary, wherein the association vocabulary include the near synonym of each vocabulary, synonym, antonym,
Superordinate term and hyponym;
Corresponding, the step S2 is specifically included:
For each vocabulary, by the near synonym of the vocabulary and the vocabulary, synonym, antonym, superordinate term and lower justice
Word is stored in meaning of a word library as vocabulary group, and chooses one of vocabulary as first vocabulary.
Further, the sorted logic relationship includes synonym, near synonym, antonym, superordinate term, hyponym and key
Word.
Further, after the step S2 further include:
A, the corpus material other than dictionary library is collected, and cutting is carried out to the corpus material using Chinese word cutting method,
Obtain multiple participles;
B, for participle described in each, the meaning of a word library is accessed, if the participle is not in any vocabulary in the meaning of a word library
In group, a or c is thened follow the steps;
C, verification verifying is carried out to the participle, which is brought into existing vocabulary group or newly-built vocabulary group pair
The meaning of a word library is updated, and using the participle as first vocabulary of newly-built vocabulary group.
According to a second aspect of the embodiments of the present invention, a kind of information retrieval method is provided, comprising:
S1 ' inquires the pass of first term according to the first term of input from meaning of a word library or classification associated library
Join term;
S2 ' is retrieved according to the associative search word of first term, obtains corresponding search result;Alternatively, root
It is retrieved according to the second term of input, obtains corresponding search result;
Wherein, second term is the term selected from the associative search word of first term, described
Meaning of a word library and the classification associated library are established based on the dictionary method for building up.
Further, the step S1 ' is specifically included:
According to first term, the same of first term is inquired in the meaning of a word library or the classification associated library
The associative search word of adopted word, near synonym, antonym, superordinate term and hyponym as first term.
Further, further includes:
With first term for original term, by the association of first term and first term
Term is presented with tree;
And the sorted logic relationship of first term He its associative search word is presented in the form of statements.
In terms of third according to an embodiment of the present invention, provides a kind of dictionary and establishes system, comprising:
Module is obtained, for obtaining according to the specific explanations corresponding with vocabulary described in each of each vocabulary in dictionary library
Take the association vocabulary of each vocabulary;
First preserving module, for for each vocabulary, using the association vocabulary of the vocabulary and the vocabulary as vocabulary group,
It is stored in the meaning of a word library pre-established;
Second preserving module, for saving the sorted logic relationship between each vocabulary and the association vocabulary of the vocabulary
In the classification associated library pre-established.
4th aspect according to an embodiment of the present invention, provides a kind of information retrieval system, comprising:
Enquiry module inquires described first from meaning of a word library or classification associated library for the first term according to input
The associative search word of term;
Retrieval module obtains retrieving knot accordingly for being retrieved according to the associative search word of first term
Fruit;Alternatively, obtaining corresponding search result for being retrieved according to the second term of input;
Wherein, second term is the term selected from the associative search word of first term, described
Meaning of a word library and the classification associated library are established based on the dictionary method for building up.
5th aspect according to an embodiment of the present invention, provides a kind of computer storage medium, is stored thereon with calculating
Machine program when the computer program is executed by processor, realizes dictionary method for building up or information retrieval method.
The embodiment of the present invention has the advantages that the vocabulary and specific explanations according to dictionary library, by the pass of each vocabulary
Connection vocabulary is stored in meaning of a word library, and the sorted logic relationship between each vocabulary is stored in classification associated library, for information
When retrieval, vocabulary extension is carried out to term, associative search word is obtained, and then retrieved according to associative search word, obtains
Search result is more comprehensive, is extended to initial results.
Detailed description of the invention
It, below will be to embodiment party in order to illustrate more clearly of embodiments of the present invention or technical solution in the prior art
Formula or attached drawing needed to be used in the description of the prior art are briefly described.It should be evident that the accompanying drawings in the following description is only
It is merely exemplary, it for those of ordinary skill in the art, without creative efforts, can also basis
The attached drawing of offer, which is extended, obtains other implementation attached drawings.
Structure depicted in this specification, ratio, size etc., only to cooperate the revealed content of specification, for
Those skilled in the art understands and reads, and is not intended to limit the invention enforceable qualifications, therefore does not have technical
Essential meaning, the modification of any structure, the change of proportionate relationship or the adjustment of size are not influencing the function of the invention that can be generated
Under effect and the purpose that can reach, should all still it fall in the range of disclosed technology contents obtain and can cover.
Fig. 1 is a kind of dictionary method for building up flow chart provided by one embodiment of the present invention;
Fig. 2 is the meaning of a word library method for building up flow chart of one embodiment of the invention;
Fig. 3 is the update method flow chart in the meaning of a word library of one embodiment of the invention;
Fig. 4 is a kind of information retrieval method flow chart of one embodiment of the invention;
Fig. 5 is that a kind of dictionary of one embodiment of the invention establishes system connection block diagram;
Fig. 6 is that a kind of information retrieval system of one embodiment of the invention connects block diagram.
Specific embodiment
Embodiments of the present invention are illustrated by particular specific embodiment below, those skilled in the art can be by this explanation
Content disclosed by book is understood other advantages and efficacy of the present invention easily, it is clear that described embodiment is the present invention one
Section Example, instead of all the embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art are not doing
Every other embodiment obtained under the premise of creative work out, shall fall within the protection scope of the present invention.
Referring to Fig. 1, a kind of dictionary method for building up of one embodiment of the invention is provided, comprising: S1, according in dictionary library
Each vocabulary specific explanations corresponding with vocabulary described in each, obtain the association vocabulary of each vocabulary;S2, for
Each vocabulary is stored in the meaning of a word library pre-established using the association vocabulary of the vocabulary and the vocabulary as vocabulary group;S3,
Sorted logic relationship between each vocabulary and the association vocabulary of the vocabulary is stored in the classification associated library pre-established.
The embodiment of the present invention parses every according to each vocabulary specific explanations corresponding with each vocabulary of dictionary library
Each vocabulary and association vocabulary are stored in meaning of a word library by the association vocabulary of one vocabulary, and by point between each vocabulary
Logic of class relationship is stored in classification associated library, when carrying out for information retrieval, in meaning of a word library and can be divided according to original term
The conjunctive word that original term is found in class correlation database carries out vocabulary extension to original term, obtains associative search word, in turn
It is retrieved according to associative search word, it is comprehensive compared to the search result obtained only according to original term, to search result
It is extended.
Referring to fig. 2, in one embodiment of the invention, the step S1 is specifically included: S11, is collected every in dictionary library
One vocabulary and the corresponding specific explanations of each vocabulary;S12 carries out participle to the specific explanations of each vocabulary and tears open
Point, and according to the association vocabulary of logic therein characterization each vocabulary of Relation acquisition, wherein the association vocabulary includes each
The near synonym of a vocabulary, synonym, antonym, superordinate term, hyponym.
Corresponding, the step S2 is specifically included: for each vocabulary, by the near synonym of the vocabulary and the vocabulary,
Synonym, antonym, superordinate term and hyponym are stored in meaning of a word library as vocabulary group, and choose one of vocabulary conduct
First vocabulary.
Specifically, the detailed process for establishing meaning of a word library is, from dictionary library, such as in Xinhua dictionary, modern Chinese dictionary
Collect each vocabulary specific explanations corresponding with each vocabulary.It is directed to each vocabulary, for the specific solution of the vocabulary
It releases and fractionation participle, filtering and acquisition word segmentation result is carried out using Chinese word segmentation component.Artificial check and correction, core can be added in the process
To, work for correction, so that more reliable basic words participle table and words filter table are formed, get words=participle 1,
Words=participle 2 ..., words n }.Into the specific explanations of vocabulary, according to the specific vocabulary in specific explanations, search out specific
Explain the correlation with vocabulary, the vocabulary classification such as superordinate term, hyponym, same/near synonym, antonym including vocabulary summarizes out
The association word finder of each vocabulary.
After having found the association vocabulary of each vocabulary, for each vocabulary, using the vocabulary be associated with vocabulary as word
Remittance group is stored in meaning of a word library, and selects a vocabulary as first vocabulary of the vocabulary group in vocabulary group, and will be in vocabulary group
Sorted logic relationship between every two vocabulary is stored in classification associated library.Wherein, the sorted logic between two vocabulary closes
System includes synonym, near synonym, antonym, superordinate term, hyponym and keyword.
Wherein, sorted logic relationship is stated format by vocabulary are as follows: [{ classification=specific vocabulary, original vocabulary=vocabulary 1, mesh
Vocabulary=vocabulary 2, { ... } ...], wherein specific vocabulary refers to: superordinate term, hyponym, with/near synonym, antonym etc..
Wherein, when specific vocabulary is superordinate term, the superordinate term for illustrating vocabulary 1 is vocabulary 2 or vocabulary 2 is vocabulary 1
Hyponym;When specific vocabulary is hyponym, the hyponym for illustrating vocabulary 1 is vocabulary 2 or vocabulary 2 is the hyponym of vocabulary 1;Tool
When pronouns, general term for nouns, numerals and measure words remittance is synonym, the synonym for illustrating vocabulary 1 is vocabulary 2 or vocabulary 2 is the synonym of vocabulary 1;Specifically vocabulary is
When antonym, the antonym for illustrating vocabulary 1 is vocabulary 2 or vocabulary 2 is the antonym of vocabulary 1.
Illustrate to establish meaning of a word library, classification associated library and key in the embodiment of the present invention below with several specific examples
The method and process of dictionary, since Xinhua dictionary, modern Chinese dictionary are mainly used for the explanation of words, use corresponding words
Allusion quotation mainly obtains the superordinate term of vocabulary, hyponym, according to the acquisitions such as specific vocabulary " also crying ", " non-" with/near synonym, antisense
Word.It is as follows:
Example one: " [natural law] is present in the rule inside the objective things of nature, is also law of nature."
It obtains superordinate term: parsing superordinate term, { classification=superordinate term, original vocabulary=natural law, purpose vocabulary=rule
Rule }, meaning is that the superordinate term of the natural law is rule.
Obtain with/near synonym: for example: " also crying ", " also cry ... or ", "Yes", " being exactly ", " also saying " back can be cut roughly
It is taken as synonym, example: " natural law] it is present in the rule inside the objective things of nature, also it is law of nature.", solution
It is precipitated with/near synonym association, { classification=synonym, original vocabulary=natural law, purpose vocabulary=law of nature }, meaning
Synonym for the natural law is law of nature.
Sorted logic relationship is got in the example: [{ classification=synonym, original vocabulary=natural law, purpose vocabulary
=law of nature }, { classification=superordinate term, original vocabulary=natural law, purpose vocabulary=rule }];Keyword: it { advises naturally
Rule=[nature, objective things, rule] }, i.e., classification=keyword, and original vocabulary=natural law, purpose vocabulary=from
Right boundary }, { classification=keyword, original vocabulary=natural law, purpose vocabulary=objective things }, classification=keyword, it is original
Vocabulary=the natural law, purpose vocabulary=rule } }, meaning be the natural law keyword include nature, objective things,
Rule.
The example two: " science of [natural science] research nature various substances and phenomenon.Including physics, chemistry, animal
, botany, mineralogy, physiology, mathematics etc. ".
Obtain hyponym: information " physics, chemistry, zoology, botany, mineralogy, physiology behind including in explanation
, mathematics " is the hyponym of vocabulary " natural science ", parses hyponym association, [classification=hyponym, original vocabulary=from
So science, purpose vocabulary=physics }, { classification=hyponym, original vocabulary=natural science, purpose vocabulary=chemistry }, { class
Not=hyponym, original vocabulary=natural science, purpose vocabulary=zoology }, { classification=hyponym, original vocabulary=nature
Science, purpose vocabulary=botany }, { classification=hyponym, original vocabulary=natural science, purpose vocabulary=mineralogy }, { class
Not=hyponym, original vocabulary=natural science, purpose vocabulary=physiology }, { classification=hyponym, original vocabulary=nature
Science, purpose vocabulary=mathematics }].
Classification is got in the example: [{ classification=hyponym, original vocabulary=natural science, purpose vocabulary=physics
Learn, { classification=hyponym, original vocabulary=natural science, purpose vocabulary=chemistry }, classification=hyponym, original vocabulary=
Natural science, purpose vocabulary=zoology }, { classification=hyponym, original vocabulary=natural science, purpose vocabulary=plant
Learn, { classification=hyponym, original vocabulary=natural science, purpose vocabulary=mineralogy }, { classification=hyponym, original vocabulary
=natural science, purpose vocabulary=physiology }, { classification=hyponym, original vocabulary=natural science, purpose vocabulary=number
Learn, { classification=superordinate term, original vocabulary=natural science, purpose vocabulary=science }], keyword: { natural science=[natural
Boundary, substance, phenomenon, science] }.
Obtain antonym: for example: " non-", " no ", "no", behind can intercept roughly as antonymy.Further really
Whether recognize is antonym.Example: " [artificial] is made, non-natural :~fiber |~ice |~earth satellite.", parsing
Antisense word association out, { classification=antonym, original vocabulary=artificial, purpose vocabulary=natural }, meaning is artificial antisense
Word is natural.
Same/near synonym, the antonym of vocabulary, which can inquire same/near synonym, the dictionary of antonyms or corpus logic analysis, to be come
The lexical relation on basis is obtained, is read with/near synonym, dictionary of antonyms acquisition vocabulary association.With/near synonym example: [{ classification
=synonym, original vocabulary=happiness, purpose vocabulary=happiness }, { classification=synonym, original vocabulary=happiness, purpose vocabulary
=joyful].Antonym example: [{ classification=antonym, original vocabulary=happiness, purpose vocabulary=sadness }, { classification=antisense
Word, original vocabulary=happiness, purpose vocabulary=sad }].
Corpus logic analysis mainly remits fractionation sentence according to logical word, gets identical or opposite meaning.It obtains
When taking opposite meaning, i.e., the former piece of same or similar meaning, comparison obtains the suitable result and turnover of holding as a result, two kinds of results are thick
Slightly it is determined as opposite.Example: " although this bridge has built many years, she is still very firm." and " this bridge has been built
Many years, it appears that some slack and undisciplined appearance ".Wherein " although this bridge has built many years, she is still very hard
Gu." in " still " below for turnover as a result, extracting vocabulary " firm ";" this bridge has built many years, it appears that some
Slack and undisciplined appearance." hold below to be suitable as a result, extracting vocabulary " slack and undisciplined ".Antisense word association is parsed, and classification=antonym, it is former
Beginning vocabulary=firm, purpose vocabulary=slack and undisciplined }, meaning is that firm antonym is slack and undisciplined.
By above-mentioned to various dictionaries and corpus logic analysis, each vocabulary and the conjunctive word of each vocabulary are obtained
Converge (with/near synonym, antonym, superordinate term, hyponym etc.), using each vocabulary be associated with vocabulary as one group of vocabulary, i.e. word
Remittance group, and choose one of vocabulary and each vocabulary group is stored in meaning of a word library for first vocabulary, and point of each vocabulary
Logic of class relationship is stored in classification associated library.The associative key of each vocabulary can also be stored in keywords database,
The associative key of each vocabulary can be inquired by keywords database.
Referring to Fig. 3, after the step S2 further include: the corpus material other than a, collection dictionary library, and use Chinese word segmentation
Method carries out cutting to the corpus material, obtains multiple participles;B, for participle described in each, the meaning of a word library is accessed,
If the participle in any vocabulary group in the meaning of a word library, does not then follow the steps a or c;C, the participle test
Card, which is brought into existing vocabulary group or newly-built vocabulary group is updated the meaning of a word library, and by the participle
First vocabulary as newly-built vocabulary group.
The above-mentioned data source used when establishing meaning of a word library and classification associated library is mainly various dictionaries, therefore, based on each
The data source in meaning of a word library and class library that kind dictionary is established is comprehensive not enough, and the embodiment of the present invention carries out data source rich
Richness is constantly updated the meaning of a word library and classification associated library that have built up.
Specifically, website corpus can be acquired or be obtained, using Chinese word cutting method cutting corpus material, participle knot is obtained
Fruit;Meaning of a word library is accessed for each participle, if the participle in the vocabulary group in meaning of a word library, reads first word in the vocabulary group
It converges, and then accesses classification associated library, to obtain the classification associated of the participle.
If the vocabulary further splits the participle not in vocabulary group, in meaning of a word library and classification associated library after fractionation
In inquired, as desk checking auxiliary input information.
For the participle not in all vocabulary of vocabulary group, using manually to the vocabulary verification, error correction, verifying, if should
Vocabulary can be included in meaning of a word library in existing vocabulary group, for example, the conjunctive word of the vocabulary is the vocabulary in a certain vocabulary group,
It then brings the vocabulary into existing vocabulary group, obtains first vocabulary, and then obtain the classification associated of the vocabulary;If the vocabulary is not
Belong to any vocabulary group, then create vocabulary group, vocabulary group is put into meaning of a word library by first vocabulary of the vocabulary as the vocabulary group
In, while this yuan of vocabulary being brought into classification associated library, to obtain the classification associated of the vocabulary.
Referring to fig. 4, a kind of information retrieval method of one embodiment of the invention is provided, comprising: S1 ', according to input
First term inquires the associative search word of first term from meaning of a word library or classification associated library;S2 ', according to described
The associative search word of first term is retrieved, and corresponding search result is obtained;Alternatively, according to the second term of input into
Row retrieval, obtains corresponding search result;Wherein, second term is from the associative search word of first term
The term of selection, the meaning of a word library and the classification associated library are established based on dictionary method for building up.
In an embodiment of the invention, the step S1 ' is specifically included: according to first term, in institute's predicate
Synonym, near synonym, antonym, superordinate term and the hyponym that first term is inquired in adopted library or the classification associated library are made
For the associative search word of first term.
The various embodiments described above establish meaning of a word library, classification associated library and keywords database, and the embodiment of the present invention is carrying out letter
When breath retrieval, according to the first term of input, the associative search of the first term is inquired from meaning of a word library or classification associated library
Word (same/near synonym of mainly the first term, superordinate term, hyponym).Then according to the associative search word of the first term
It is retrieved, obtains search result, it will comprehensively much compared to the search result retrieved only with the first term;
Also can according to need and choose a part of term from the associative search word of the first term and retrieved, targetedly into
Row retrieval.
In one embodiment of the invention, further includes: with first term for original term, by described first
The associative search word of term and first term is presented with tree;And by first term and
The sorted logic relationship of its associative search word is presented in the form of statements.
Specifically, inquiring the associative search of the first term from meaning of a word library according to the first term that user inputs
Word, i.e. same/near synonym of the first term, superordinate term, hyponym, antonym etc., by the first term and its associative search word
It is presented with tree.And it is patrolled in the classification inquired in classification associated library between the first term and its associative search word
The relationship of collecting is presented in the form of statements.That is the relevant information of the first term of user's input is presented to the user, when
When user retrieves, reference can be used as to be retrieved according to the relevant information of presentation.
Referring to Fig. 5, the dictionary for providing one embodiment of the invention establishes system, including obtains module 51, first and save
Module 52 and the second preserving module 53.
Module 51 is obtained, is used for according to the specific explanations corresponding with vocabulary described in each of each vocabulary in dictionary library,
Obtain the association vocabulary of each vocabulary.
First preserving module 52 is used for for each vocabulary, using the association vocabulary of the vocabulary and the vocabulary as vocabulary
Group is stored in the meaning of a word library pre-established.
Second preserving module 53, for protecting the sorted logic relationship between each vocabulary and the association vocabulary of the vocabulary
It is stored in the classification associated library pre-established.
A kind of dictionary provided in an embodiment of the present invention establishes a kind of dictionary method for building up that system and previous embodiment provide
Corresponding, the technical characteristic that dictionary provided in this embodiment establishes system can refer to the phase of dictionary method for building up in previous embodiment
Technical characteristic is closed, details are not described herein.
Referring to Fig. 6, a kind of information retrieval system of one embodiment of the invention, including enquiry module 61 and retrieval are provided
Module 62.
Enquiry module 61 inquires described for the first term according to input from meaning of a word library or classification associated library
The associative search word of one term.
Retrieval module 62 is retrieved accordingly for being retrieved according to the associative search word of first term
As a result;Alternatively, obtaining corresponding search result for being retrieved according to the second term of input.
Wherein, second term is the term selected from the associative search word of first term, described
Meaning of a word library and the classification associated library are established based on dictionary method for building up described in the various embodiments described above.
A kind of information retrieval method that a kind of information retrieval system provided in an embodiment of the present invention and previous embodiment provide
Corresponding, the technical characteristic of information retrieval system provided in this embodiment can refer to the phase of information retrieval method in previous embodiment
Technical characteristic is closed, details are not described herein.
In one embodiment of the invention, a kind of computer storage medium is additionally provided, computer journey is stored thereon with
Sequence when the computer program is executed by processor, realizes dictionary method for building up or information retrieval method.
A kind of dictionary method for building up, information retrieval method and corresponding system provided by the invention, according to the word of dictionary library
The association vocabulary of each vocabulary is stored in meaning of a word library, and the sorted logic between each vocabulary is closed by remittance and specific explanations
System is stored in classification associated library;When carrying out information retrieval, inspection can be found in meaning of a word library and classification associated library according to term
The associative search word of rope word carries out vocabulary extension to term, obtains associative search word, and then examined according to associative search word
Rope, obtained search result is more comprehensive, is extended to initial results;And also by the associative search word of term and
Sorted logic relationship between term and associative search word is presented to the user, for reference, is the judgement and decision of user
It supports to provide foundation.
Although above having used general explanation and specific embodiment, the present invention is described in detail, at this
On the basis of invention, it can be made some modifications or improvements, this will be apparent to those skilled in the art.Therefore,
These modifications or improvements without departing from theon the basis of the spirit of the present invention are fallen within the scope of the claimed invention.
Claims (10)
1. a kind of dictionary method for building up characterized by comprising
S1 obtains each institute's predicate according to the specific explanations corresponding with vocabulary described in each of each vocabulary in dictionary library
The association vocabulary of remittance;
S2 is stored in the word pre-established using the association vocabulary of the vocabulary and the vocabulary as vocabulary group for each vocabulary
In adopted library;
Sorted logic relationship between each vocabulary and the association vocabulary of the vocabulary is stored in the classification pre-established and closed by S3
Join in library.
2. dictionary method for building up according to claim 1, which is characterized in that the step S1 is specifically included:
S11 collects each vocabulary and the corresponding specific explanations of each vocabulary in dictionary library;
S12 carries out participle fractionation to the specific explanations of each vocabulary, and every according to logic therein characterization Relation acquisition
The association vocabulary of one vocabulary, wherein the association vocabulary includes the near synonym, synonym, antonym, upper justice of each vocabulary
Word and hyponym;
Corresponding, the step S2 is specifically included:
For each vocabulary, the near synonym of the vocabulary and the vocabulary, synonym, antonym, superordinate term and hyponym are made
It is stored in meaning of a word library for vocabulary group, and chooses one of vocabulary as first vocabulary.
3. dictionary method for building up according to claim 2, which is characterized in that the sorted logic relationship include synonym,
Near synonym, antonym, superordinate term, hyponym and keyword.
4. dictionary method for building up according to claim 2, which is characterized in that after the step S2 further include:
A, the corpus material other than dictionary library is collected, and cutting is carried out to the corpus material using Chinese word cutting method, is obtained
Multiple participles;
B, for participle described in each, access the meaning of a word library, if the participle not in any vocabulary group in the meaning of a word library,
Then follow the steps a or c;
C, verification verifying is carried out to the participle, which is brought into existing vocabulary group or newly-built vocabulary group is to described
Meaning of a word library is updated, and using the participle as first vocabulary of newly-built vocabulary group.
5. a kind of information retrieval method characterized by comprising
S1 ' inquires the association inspection of first term according to the first term of input from meaning of a word library or classification associated library
Rope word;
S2 ' is retrieved according to the associative search word of first term, obtains corresponding search result;Alternatively, according to defeated
The second term entered is retrieved, and corresponding search result is obtained;
Wherein, second term is the term selected from the associative search word of first term, the meaning of a word
Library and the classification associated library are established based on dictionary method for building up according to any one of claims 1-4.
6. information retrieval method according to claim 5, which is characterized in that the step S1 ' is specifically included:
According to first term, the synonymous of first term is inquired in the meaning of a word library or the classification associated library
The associative search word of word, near synonym, antonym, superordinate term and hyponym as first term.
7. information retrieval method according to claim 6, which is characterized in that further include:
With first term for original term, by first term and the associative search of first term
Word is presented with tree;
And the sorted logic relationship of first term He its associative search word is presented in the form of statements.
8. a kind of dictionary establishes system characterized by comprising
Module is obtained, for obtaining every according to the specific explanations corresponding with vocabulary described in each of each vocabulary in dictionary library
The association vocabulary of one vocabulary;
First preserving module, for being saved for each vocabulary using the association vocabulary of the vocabulary and the vocabulary as vocabulary group
In the meaning of a word library pre-established;
Second preserving module, it is pre- for the sorted logic relationship between each vocabulary and the association vocabulary of the vocabulary to be stored in
In the classification associated library first established.
9. a kind of information retrieval system characterized by comprising
Enquiry module inquires first retrieval for the first term according to input from meaning of a word library or classification associated library
The associative search word of word;
Retrieval module obtains corresponding search result for being retrieved according to the associative search word of first term;Or
Person obtains corresponding search result for being retrieved according to the second term of input;
Wherein, second term is the term selected from the associative search word of first term, the meaning of a word
Library and the classification associated library are established based on dictionary method for building up according to any one of claims 1-4.
10. a kind of computer storage medium, which is characterized in that be stored thereon with computer program, the computer program is located
When managing device execution, dictionary method for building up or information retrieval method are realized.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910568339.4A CN110276079B (en) | 2019-06-27 | 2019-06-27 | Word stock establishment method, information retrieval method and corresponding system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910568339.4A CN110276079B (en) | 2019-06-27 | 2019-06-27 | Word stock establishment method, information retrieval method and corresponding system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110276079A true CN110276079A (en) | 2019-09-24 |
CN110276079B CN110276079B (en) | 2023-05-26 |
Family
ID=67962399
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910568339.4A Active CN110276079B (en) | 2019-06-27 | 2019-06-27 | Word stock establishment method, information retrieval method and corresponding system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110276079B (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113051898A (en) * | 2019-12-27 | 2021-06-29 | 北京阿博茨科技有限公司 | Word meaning accumulation and word segmentation method, tool and system for structured data searched by natural language |
CN113407668A (en) * | 2021-06-11 | 2021-09-17 | 武夷学院 | Data processing method and device for cognitive association capacity training |
CN113515585A (en) * | 2020-04-10 | 2021-10-19 | 中国石油化工股份有限公司 | Construction method, retrieval method and system of special lexicon in dangerous chemical safety field |
CN117953875A (en) * | 2024-03-27 | 2024-04-30 | 成都启英泰伦科技有限公司 | Offline voice command word storage method based on semantic understanding |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2000222410A (en) * | 1999-01-28 | 2000-08-11 | Matsushita Electric Ind Co Ltd | Thesaurus retrieving device and thesaurus retrieval system |
TW200424874A (en) * | 2003-05-09 | 2004-11-16 | Webgenie Information Ltd | Automatic thesaurus construction method |
US20120124084A1 (en) * | 2010-11-06 | 2012-05-17 | Ning Zhu | Method to semantically search domain name by utilizing hyponym, hypernym, troponym, entailment and coordinate term |
CN108959314A (en) * | 2017-05-24 | 2018-12-07 | 西安科技大市场创新云服务股份有限公司 | A kind of semantic retrieving method and device |
-
2019
- 2019-06-27 CN CN201910568339.4A patent/CN110276079B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2000222410A (en) * | 1999-01-28 | 2000-08-11 | Matsushita Electric Ind Co Ltd | Thesaurus retrieving device and thesaurus retrieval system |
TW200424874A (en) * | 2003-05-09 | 2004-11-16 | Webgenie Information Ltd | Automatic thesaurus construction method |
US20120124084A1 (en) * | 2010-11-06 | 2012-05-17 | Ning Zhu | Method to semantically search domain name by utilizing hyponym, hypernym, troponym, entailment and coordinate term |
CN108959314A (en) * | 2017-05-24 | 2018-12-07 | 西安科技大市场创新云服务股份有限公司 | A kind of semantic retrieving method and device |
Non-Patent Citations (1)
Title |
---|
翟羽佳等: "基于文本挖掘的中文领域本体构建方法研究", 《情报科学》 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113051898A (en) * | 2019-12-27 | 2021-06-29 | 北京阿博茨科技有限公司 | Word meaning accumulation and word segmentation method, tool and system for structured data searched by natural language |
CN113515585A (en) * | 2020-04-10 | 2021-10-19 | 中国石油化工股份有限公司 | Construction method, retrieval method and system of special lexicon in dangerous chemical safety field |
CN113407668A (en) * | 2021-06-11 | 2021-09-17 | 武夷学院 | Data processing method and device for cognitive association capacity training |
CN113407668B (en) * | 2021-06-11 | 2022-10-11 | 武夷学院 | Data processing method and device for cognitive association capacity training |
CN117953875A (en) * | 2024-03-27 | 2024-04-30 | 成都启英泰伦科技有限公司 | Offline voice command word storage method based on semantic understanding |
Also Published As
Publication number | Publication date |
---|---|
CN110276079B (en) | 2023-05-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR101173561B1 (en) | Question type and domain identifying apparatus and method | |
CN103678576B (en) | The text retrieval system analyzed based on dynamic semantics | |
CN100416570C (en) | FAQ based Chinese natural language ask and answer method | |
CN103136352B (en) | Text retrieval system based on double-deck semantic analysis | |
CN100595763C (en) | Full text retrieval system based on natural language | |
CN110059311A (en) | A kind of keyword extracting method and system towards judicial style data | |
CN102144229B (en) | System for extracting term from document containing text segment | |
KR101524889B1 (en) | Identification of semantic relationships within reported speech | |
Yin et al. | Facto: a fact lookup engine based on web tables | |
Rahman et al. | STRICT: Information retrieval based search term identification for concept location | |
CN110276079A (en) | A kind of dictionary method for building up, information retrieval method and corresponding system | |
CN109582704A (en) | Recruitment information and the matched method of job seeker resume | |
CN107908712A (en) | Cross-language information matching process based on term extraction | |
CN107967290A (en) | A kind of knowledge mapping network establishing method and system, medium based on magnanimity scientific research data | |
CN111104488B (en) | Method, device and storage medium for integrating retrieval and similarity analysis | |
Al-Taani et al. | An extractive graph-based Arabic text summarization approach | |
KR100835706B1 (en) | System and method for korean morphological analysis for automatic indexing | |
CN108763272A (en) | A kind of event information analysis method, computer readable storage medium and terminal device | |
JP5718405B2 (en) | Utterance selection apparatus, method and program, dialogue apparatus and method | |
Mahendra et al. | Acquiring relational patterns from wikipedia: A case study | |
CN109284441A (en) | Dynamic self-adapting network sensitive information detection method and device | |
US20120072443A1 (en) | Data searching system and method for generating derivative keywords according to input keywords | |
CN108764972A (en) | A kind of film box office prediction technique and device | |
Hakkani-Tur et al. | Statistical sentence extraction for information distillation | |
CN110019814B (en) | News information aggregation method based on data mining and deep learning |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |