CN110276079A - A kind of dictionary method for building up, information retrieval method and corresponding system - Google Patents

A kind of dictionary method for building up, information retrieval method and corresponding system Download PDF

Info

Publication number
CN110276079A
CN110276079A CN201910568339.4A CN201910568339A CN110276079A CN 110276079 A CN110276079 A CN 110276079A CN 201910568339 A CN201910568339 A CN 201910568339A CN 110276079 A CN110276079 A CN 110276079A
Authority
CN
China
Prior art keywords
vocabulary
term
word
library
dictionary
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910568339.4A
Other languages
Chinese (zh)
Other versions
CN110276079B (en
Inventor
谷晓佳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201910568339.4A priority Critical patent/CN110276079B/en
Publication of CN110276079A publication Critical patent/CN110276079A/en
Application granted granted Critical
Publication of CN110276079B publication Critical patent/CN110276079B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention discloses a kind of dictionary method for building up, information retrieval method and corresponding systems, wherein, the dictionary method for building up includes: to obtain the association vocabulary of each vocabulary according to the specific explanations corresponding with vocabulary described in each of each vocabulary in dictionary library;Each vocabulary is stored in the meaning of a word library pre-established using the association vocabulary of the vocabulary and the vocabulary as vocabulary group;Sorted logic relationship between each vocabulary and the association vocabulary of the vocabulary is stored in the classification associated library pre-established.The embodiment of the present invention is according to the vocabulary and specific explanations of dictionary library, the association vocabulary of each vocabulary is stored in meaning of a word library, and the sorted logic relationship between each vocabulary is stored in classification associated library, when for information retrieval, vocabulary extension is carried out to term, obtains associative search word, and then retrieved according to associative search word, obtained search result is more comprehensive, is extended to initial results.

Description

A kind of dictionary method for building up, information retrieval method and corresponding system
Technical field
The present embodiments relate to technical field of information retrieval, and in particular to a kind of dictionary method for building up, information retrieval side Method and corresponding system.
Background technique
Currently, common information retrieval method is, according to the term (being referred to as keyword) that user inputs, search Engine is retrieved according to term, and is provided search result and responded.Such search engine, can be with for keyword search The higher search result of specific aim is provided, in most cases, for the term of input, opinion can be directly given or result is answered Case.
But usually, the scalability for the search result that this term according to input obtains is relatively limited, nothing Method provides preferably judgement and decision support foundation.
Summary of the invention
For this purpose, the embodiment of the present invention provides a kind of dictionary method for building up, information retrieval method and corresponding system, to solve In the prior art due to term it is single caused by the big problem of search result limitation.
To achieve the goals above, the embodiment of the present invention provides the following technical solutions:
According to a first aspect of the embodiments of the present invention, a kind of dictionary method for building up is provided, comprising:
S1 obtains each institute according to the specific explanations corresponding with vocabulary described in each of each vocabulary in dictionary library The association vocabulary that predicate converges;
S2 is stored in and pre-establishes using the association vocabulary of the vocabulary and the vocabulary as vocabulary group for each vocabulary Meaning of a word library in;
Sorted logic relationship between each vocabulary and the association vocabulary of the vocabulary is stored in point pre-established by S3 In class correlation database.
Further, the step S1 is specifically included:
S11 collects each vocabulary and the corresponding specific explanations of each vocabulary in dictionary library;
S12 carries out participle fractionation to the specific explanations of each vocabulary, and is obtained according to logic characterization of relation therein Take the association vocabulary of each vocabulary, wherein the association vocabulary include the near synonym of each vocabulary, synonym, antonym, Superordinate term and hyponym;
Corresponding, the step S2 is specifically included:
For each vocabulary, by the near synonym of the vocabulary and the vocabulary, synonym, antonym, superordinate term and lower justice Word is stored in meaning of a word library as vocabulary group, and chooses one of vocabulary as first vocabulary.
Further, the sorted logic relationship includes synonym, near synonym, antonym, superordinate term, hyponym and key Word.
Further, after the step S2 further include:
A, the corpus material other than dictionary library is collected, and cutting is carried out to the corpus material using Chinese word cutting method, Obtain multiple participles;
B, for participle described in each, the meaning of a word library is accessed, if the participle is not in any vocabulary in the meaning of a word library In group, a or c is thened follow the steps;
C, verification verifying is carried out to the participle, which is brought into existing vocabulary group or newly-built vocabulary group pair The meaning of a word library is updated, and using the participle as first vocabulary of newly-built vocabulary group.
According to a second aspect of the embodiments of the present invention, a kind of information retrieval method is provided, comprising:
S1 ' inquires the pass of first term according to the first term of input from meaning of a word library or classification associated library Join term;
S2 ' is retrieved according to the associative search word of first term, obtains corresponding search result;Alternatively, root It is retrieved according to the second term of input, obtains corresponding search result;
Wherein, second term is the term selected from the associative search word of first term, described Meaning of a word library and the classification associated library are established based on the dictionary method for building up.
Further, the step S1 ' is specifically included:
According to first term, the same of first term is inquired in the meaning of a word library or the classification associated library The associative search word of adopted word, near synonym, antonym, superordinate term and hyponym as first term.
Further, further includes:
With first term for original term, by the association of first term and first term Term is presented with tree;
And the sorted logic relationship of first term He its associative search word is presented in the form of statements.
In terms of third according to an embodiment of the present invention, provides a kind of dictionary and establishes system, comprising:
Module is obtained, for obtaining according to the specific explanations corresponding with vocabulary described in each of each vocabulary in dictionary library Take the association vocabulary of each vocabulary;
First preserving module, for for each vocabulary, using the association vocabulary of the vocabulary and the vocabulary as vocabulary group, It is stored in the meaning of a word library pre-established;
Second preserving module, for saving the sorted logic relationship between each vocabulary and the association vocabulary of the vocabulary In the classification associated library pre-established.
4th aspect according to an embodiment of the present invention, provides a kind of information retrieval system, comprising:
Enquiry module inquires described first from meaning of a word library or classification associated library for the first term according to input The associative search word of term;
Retrieval module obtains retrieving knot accordingly for being retrieved according to the associative search word of first term Fruit;Alternatively, obtaining corresponding search result for being retrieved according to the second term of input;
Wherein, second term is the term selected from the associative search word of first term, described Meaning of a word library and the classification associated library are established based on the dictionary method for building up.
5th aspect according to an embodiment of the present invention, provides a kind of computer storage medium, is stored thereon with calculating Machine program when the computer program is executed by processor, realizes dictionary method for building up or information retrieval method.
The embodiment of the present invention has the advantages that the vocabulary and specific explanations according to dictionary library, by the pass of each vocabulary Connection vocabulary is stored in meaning of a word library, and the sorted logic relationship between each vocabulary is stored in classification associated library, for information When retrieval, vocabulary extension is carried out to term, associative search word is obtained, and then retrieved according to associative search word, obtains Search result is more comprehensive, is extended to initial results.
Detailed description of the invention
It, below will be to embodiment party in order to illustrate more clearly of embodiments of the present invention or technical solution in the prior art Formula or attached drawing needed to be used in the description of the prior art are briefly described.It should be evident that the accompanying drawings in the following description is only It is merely exemplary, it for those of ordinary skill in the art, without creative efforts, can also basis The attached drawing of offer, which is extended, obtains other implementation attached drawings.
Structure depicted in this specification, ratio, size etc., only to cooperate the revealed content of specification, for Those skilled in the art understands and reads, and is not intended to limit the invention enforceable qualifications, therefore does not have technical Essential meaning, the modification of any structure, the change of proportionate relationship or the adjustment of size are not influencing the function of the invention that can be generated Under effect and the purpose that can reach, should all still it fall in the range of disclosed technology contents obtain and can cover.
Fig. 1 is a kind of dictionary method for building up flow chart provided by one embodiment of the present invention;
Fig. 2 is the meaning of a word library method for building up flow chart of one embodiment of the invention;
Fig. 3 is the update method flow chart in the meaning of a word library of one embodiment of the invention;
Fig. 4 is a kind of information retrieval method flow chart of one embodiment of the invention;
Fig. 5 is that a kind of dictionary of one embodiment of the invention establishes system connection block diagram;
Fig. 6 is that a kind of information retrieval system of one embodiment of the invention connects block diagram.
Specific embodiment
Embodiments of the present invention are illustrated by particular specific embodiment below, those skilled in the art can be by this explanation Content disclosed by book is understood other advantages and efficacy of the present invention easily, it is clear that described embodiment is the present invention one Section Example, instead of all the embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art are not doing Every other embodiment obtained under the premise of creative work out, shall fall within the protection scope of the present invention.
Referring to Fig. 1, a kind of dictionary method for building up of one embodiment of the invention is provided, comprising: S1, according in dictionary library Each vocabulary specific explanations corresponding with vocabulary described in each, obtain the association vocabulary of each vocabulary;S2, for Each vocabulary is stored in the meaning of a word library pre-established using the association vocabulary of the vocabulary and the vocabulary as vocabulary group;S3, Sorted logic relationship between each vocabulary and the association vocabulary of the vocabulary is stored in the classification associated library pre-established.
The embodiment of the present invention parses every according to each vocabulary specific explanations corresponding with each vocabulary of dictionary library Each vocabulary and association vocabulary are stored in meaning of a word library by the association vocabulary of one vocabulary, and by point between each vocabulary Logic of class relationship is stored in classification associated library, when carrying out for information retrieval, in meaning of a word library and can be divided according to original term The conjunctive word that original term is found in class correlation database carries out vocabulary extension to original term, obtains associative search word, in turn It is retrieved according to associative search word, it is comprehensive compared to the search result obtained only according to original term, to search result It is extended.
Referring to fig. 2, in one embodiment of the invention, the step S1 is specifically included: S11, is collected every in dictionary library One vocabulary and the corresponding specific explanations of each vocabulary;S12 carries out participle to the specific explanations of each vocabulary and tears open Point, and according to the association vocabulary of logic therein characterization each vocabulary of Relation acquisition, wherein the association vocabulary includes each The near synonym of a vocabulary, synonym, antonym, superordinate term, hyponym.
Corresponding, the step S2 is specifically included: for each vocabulary, by the near synonym of the vocabulary and the vocabulary, Synonym, antonym, superordinate term and hyponym are stored in meaning of a word library as vocabulary group, and choose one of vocabulary conduct First vocabulary.
Specifically, the detailed process for establishing meaning of a word library is, from dictionary library, such as in Xinhua dictionary, modern Chinese dictionary Collect each vocabulary specific explanations corresponding with each vocabulary.It is directed to each vocabulary, for the specific solution of the vocabulary It releases and fractionation participle, filtering and acquisition word segmentation result is carried out using Chinese word segmentation component.Artificial check and correction, core can be added in the process To, work for correction, so that more reliable basic words participle table and words filter table are formed, get words=participle 1, Words=participle 2 ..., words n }.Into the specific explanations of vocabulary, according to the specific vocabulary in specific explanations, search out specific Explain the correlation with vocabulary, the vocabulary classification such as superordinate term, hyponym, same/near synonym, antonym including vocabulary summarizes out The association word finder of each vocabulary.
After having found the association vocabulary of each vocabulary, for each vocabulary, using the vocabulary be associated with vocabulary as word Remittance group is stored in meaning of a word library, and selects a vocabulary as first vocabulary of the vocabulary group in vocabulary group, and will be in vocabulary group Sorted logic relationship between every two vocabulary is stored in classification associated library.Wherein, the sorted logic between two vocabulary closes System includes synonym, near synonym, antonym, superordinate term, hyponym and keyword.
Wherein, sorted logic relationship is stated format by vocabulary are as follows: [{ classification=specific vocabulary, original vocabulary=vocabulary 1, mesh Vocabulary=vocabulary 2, { ... } ...], wherein specific vocabulary refers to: superordinate term, hyponym, with/near synonym, antonym etc..
Wherein, when specific vocabulary is superordinate term, the superordinate term for illustrating vocabulary 1 is vocabulary 2 or vocabulary 2 is vocabulary 1 Hyponym;When specific vocabulary is hyponym, the hyponym for illustrating vocabulary 1 is vocabulary 2 or vocabulary 2 is the hyponym of vocabulary 1;Tool When pronouns, general term for nouns, numerals and measure words remittance is synonym, the synonym for illustrating vocabulary 1 is vocabulary 2 or vocabulary 2 is the synonym of vocabulary 1;Specifically vocabulary is When antonym, the antonym for illustrating vocabulary 1 is vocabulary 2 or vocabulary 2 is the antonym of vocabulary 1.
Illustrate to establish meaning of a word library, classification associated library and key in the embodiment of the present invention below with several specific examples The method and process of dictionary, since Xinhua dictionary, modern Chinese dictionary are mainly used for the explanation of words, use corresponding words Allusion quotation mainly obtains the superordinate term of vocabulary, hyponym, according to the acquisitions such as specific vocabulary " also crying ", " non-" with/near synonym, antisense Word.It is as follows:
Example one: " [natural law] is present in the rule inside the objective things of nature, is also law of nature."
It obtains superordinate term: parsing superordinate term, { classification=superordinate term, original vocabulary=natural law, purpose vocabulary=rule Rule }, meaning is that the superordinate term of the natural law is rule.
Obtain with/near synonym: for example: " also crying ", " also cry ... or ", "Yes", " being exactly ", " also saying " back can be cut roughly It is taken as synonym, example: " natural law] it is present in the rule inside the objective things of nature, also it is law of nature.", solution It is precipitated with/near synonym association, { classification=synonym, original vocabulary=natural law, purpose vocabulary=law of nature }, meaning Synonym for the natural law is law of nature.
Sorted logic relationship is got in the example: [{ classification=synonym, original vocabulary=natural law, purpose vocabulary =law of nature }, { classification=superordinate term, original vocabulary=natural law, purpose vocabulary=rule }];Keyword: it { advises naturally Rule=[nature, objective things, rule] }, i.e., classification=keyword, and original vocabulary=natural law, purpose vocabulary=from Right boundary }, { classification=keyword, original vocabulary=natural law, purpose vocabulary=objective things }, classification=keyword, it is original Vocabulary=the natural law, purpose vocabulary=rule } }, meaning be the natural law keyword include nature, objective things, Rule.
The example two: " science of [natural science] research nature various substances and phenomenon.Including physics, chemistry, animal , botany, mineralogy, physiology, mathematics etc. ".
Obtain hyponym: information " physics, chemistry, zoology, botany, mineralogy, physiology behind including in explanation , mathematics " is the hyponym of vocabulary " natural science ", parses hyponym association, [classification=hyponym, original vocabulary=from So science, purpose vocabulary=physics }, { classification=hyponym, original vocabulary=natural science, purpose vocabulary=chemistry }, { class Not=hyponym, original vocabulary=natural science, purpose vocabulary=zoology }, { classification=hyponym, original vocabulary=nature Science, purpose vocabulary=botany }, { classification=hyponym, original vocabulary=natural science, purpose vocabulary=mineralogy }, { class Not=hyponym, original vocabulary=natural science, purpose vocabulary=physiology }, { classification=hyponym, original vocabulary=nature Science, purpose vocabulary=mathematics }].
Classification is got in the example: [{ classification=hyponym, original vocabulary=natural science, purpose vocabulary=physics Learn, { classification=hyponym, original vocabulary=natural science, purpose vocabulary=chemistry }, classification=hyponym, original vocabulary= Natural science, purpose vocabulary=zoology }, { classification=hyponym, original vocabulary=natural science, purpose vocabulary=plant Learn, { classification=hyponym, original vocabulary=natural science, purpose vocabulary=mineralogy }, { classification=hyponym, original vocabulary =natural science, purpose vocabulary=physiology }, { classification=hyponym, original vocabulary=natural science, purpose vocabulary=number Learn, { classification=superordinate term, original vocabulary=natural science, purpose vocabulary=science }], keyword: { natural science=[natural Boundary, substance, phenomenon, science] }.
Obtain antonym: for example: " non-", " no ", "no", behind can intercept roughly as antonymy.Further really Whether recognize is antonym.Example: " [artificial] is made, non-natural :~fiber |~ice |~earth satellite.", parsing Antisense word association out, { classification=antonym, original vocabulary=artificial, purpose vocabulary=natural }, meaning is artificial antisense Word is natural.
Same/near synonym, the antonym of vocabulary, which can inquire same/near synonym, the dictionary of antonyms or corpus logic analysis, to be come The lexical relation on basis is obtained, is read with/near synonym, dictionary of antonyms acquisition vocabulary association.With/near synonym example: [{ classification =synonym, original vocabulary=happiness, purpose vocabulary=happiness }, { classification=synonym, original vocabulary=happiness, purpose vocabulary =joyful].Antonym example: [{ classification=antonym, original vocabulary=happiness, purpose vocabulary=sadness }, { classification=antisense Word, original vocabulary=happiness, purpose vocabulary=sad }].
Corpus logic analysis mainly remits fractionation sentence according to logical word, gets identical or opposite meaning.It obtains When taking opposite meaning, i.e., the former piece of same or similar meaning, comparison obtains the suitable result and turnover of holding as a result, two kinds of results are thick Slightly it is determined as opposite.Example: " although this bridge has built many years, she is still very firm." and " this bridge has been built Many years, it appears that some slack and undisciplined appearance ".Wherein " although this bridge has built many years, she is still very hard Gu." in " still " below for turnover as a result, extracting vocabulary " firm ";" this bridge has built many years, it appears that some Slack and undisciplined appearance." hold below to be suitable as a result, extracting vocabulary " slack and undisciplined ".Antisense word association is parsed, and classification=antonym, it is former Beginning vocabulary=firm, purpose vocabulary=slack and undisciplined }, meaning is that firm antonym is slack and undisciplined.
By above-mentioned to various dictionaries and corpus logic analysis, each vocabulary and the conjunctive word of each vocabulary are obtained Converge (with/near synonym, antonym, superordinate term, hyponym etc.), using each vocabulary be associated with vocabulary as one group of vocabulary, i.e. word Remittance group, and choose one of vocabulary and each vocabulary group is stored in meaning of a word library for first vocabulary, and point of each vocabulary Logic of class relationship is stored in classification associated library.The associative key of each vocabulary can also be stored in keywords database, The associative key of each vocabulary can be inquired by keywords database.
Referring to Fig. 3, after the step S2 further include: the corpus material other than a, collection dictionary library, and use Chinese word segmentation Method carries out cutting to the corpus material, obtains multiple participles;B, for participle described in each, the meaning of a word library is accessed, If the participle in any vocabulary group in the meaning of a word library, does not then follow the steps a or c;C, the participle test Card, which is brought into existing vocabulary group or newly-built vocabulary group is updated the meaning of a word library, and by the participle First vocabulary as newly-built vocabulary group.
The above-mentioned data source used when establishing meaning of a word library and classification associated library is mainly various dictionaries, therefore, based on each The data source in meaning of a word library and class library that kind dictionary is established is comprehensive not enough, and the embodiment of the present invention carries out data source rich Richness is constantly updated the meaning of a word library and classification associated library that have built up.
Specifically, website corpus can be acquired or be obtained, using Chinese word cutting method cutting corpus material, participle knot is obtained Fruit;Meaning of a word library is accessed for each participle, if the participle in the vocabulary group in meaning of a word library, reads first word in the vocabulary group It converges, and then accesses classification associated library, to obtain the classification associated of the participle.
If the vocabulary further splits the participle not in vocabulary group, in meaning of a word library and classification associated library after fractionation In inquired, as desk checking auxiliary input information.
For the participle not in all vocabulary of vocabulary group, using manually to the vocabulary verification, error correction, verifying, if should Vocabulary can be included in meaning of a word library in existing vocabulary group, for example, the conjunctive word of the vocabulary is the vocabulary in a certain vocabulary group, It then brings the vocabulary into existing vocabulary group, obtains first vocabulary, and then obtain the classification associated of the vocabulary;If the vocabulary is not Belong to any vocabulary group, then create vocabulary group, vocabulary group is put into meaning of a word library by first vocabulary of the vocabulary as the vocabulary group In, while this yuan of vocabulary being brought into classification associated library, to obtain the classification associated of the vocabulary.
Referring to fig. 4, a kind of information retrieval method of one embodiment of the invention is provided, comprising: S1 ', according to input First term inquires the associative search word of first term from meaning of a word library or classification associated library;S2 ', according to described The associative search word of first term is retrieved, and corresponding search result is obtained;Alternatively, according to the second term of input into Row retrieval, obtains corresponding search result;Wherein, second term is from the associative search word of first term The term of selection, the meaning of a word library and the classification associated library are established based on dictionary method for building up.
In an embodiment of the invention, the step S1 ' is specifically included: according to first term, in institute's predicate Synonym, near synonym, antonym, superordinate term and the hyponym that first term is inquired in adopted library or the classification associated library are made For the associative search word of first term.
The various embodiments described above establish meaning of a word library, classification associated library and keywords database, and the embodiment of the present invention is carrying out letter When breath retrieval, according to the first term of input, the associative search of the first term is inquired from meaning of a word library or classification associated library Word (same/near synonym of mainly the first term, superordinate term, hyponym).Then according to the associative search word of the first term It is retrieved, obtains search result, it will comprehensively much compared to the search result retrieved only with the first term; Also can according to need and choose a part of term from the associative search word of the first term and retrieved, targetedly into Row retrieval.
In one embodiment of the invention, further includes: with first term for original term, by described first The associative search word of term and first term is presented with tree;And by first term and The sorted logic relationship of its associative search word is presented in the form of statements.
Specifically, inquiring the associative search of the first term from meaning of a word library according to the first term that user inputs Word, i.e. same/near synonym of the first term, superordinate term, hyponym, antonym etc., by the first term and its associative search word It is presented with tree.And it is patrolled in the classification inquired in classification associated library between the first term and its associative search word The relationship of collecting is presented in the form of statements.That is the relevant information of the first term of user's input is presented to the user, when When user retrieves, reference can be used as to be retrieved according to the relevant information of presentation.
Referring to Fig. 5, the dictionary for providing one embodiment of the invention establishes system, including obtains module 51, first and save Module 52 and the second preserving module 53.
Module 51 is obtained, is used for according to the specific explanations corresponding with vocabulary described in each of each vocabulary in dictionary library, Obtain the association vocabulary of each vocabulary.
First preserving module 52 is used for for each vocabulary, using the association vocabulary of the vocabulary and the vocabulary as vocabulary Group is stored in the meaning of a word library pre-established.
Second preserving module 53, for protecting the sorted logic relationship between each vocabulary and the association vocabulary of the vocabulary It is stored in the classification associated library pre-established.
A kind of dictionary provided in an embodiment of the present invention establishes a kind of dictionary method for building up that system and previous embodiment provide Corresponding, the technical characteristic that dictionary provided in this embodiment establishes system can refer to the phase of dictionary method for building up in previous embodiment Technical characteristic is closed, details are not described herein.
Referring to Fig. 6, a kind of information retrieval system of one embodiment of the invention, including enquiry module 61 and retrieval are provided Module 62.
Enquiry module 61 inquires described for the first term according to input from meaning of a word library or classification associated library The associative search word of one term.
Retrieval module 62 is retrieved accordingly for being retrieved according to the associative search word of first term As a result;Alternatively, obtaining corresponding search result for being retrieved according to the second term of input.
Wherein, second term is the term selected from the associative search word of first term, described Meaning of a word library and the classification associated library are established based on dictionary method for building up described in the various embodiments described above.
A kind of information retrieval method that a kind of information retrieval system provided in an embodiment of the present invention and previous embodiment provide Corresponding, the technical characteristic of information retrieval system provided in this embodiment can refer to the phase of information retrieval method in previous embodiment Technical characteristic is closed, details are not described herein.
In one embodiment of the invention, a kind of computer storage medium is additionally provided, computer journey is stored thereon with Sequence when the computer program is executed by processor, realizes dictionary method for building up or information retrieval method.
A kind of dictionary method for building up, information retrieval method and corresponding system provided by the invention, according to the word of dictionary library The association vocabulary of each vocabulary is stored in meaning of a word library, and the sorted logic between each vocabulary is closed by remittance and specific explanations System is stored in classification associated library;When carrying out information retrieval, inspection can be found in meaning of a word library and classification associated library according to term The associative search word of rope word carries out vocabulary extension to term, obtains associative search word, and then examined according to associative search word Rope, obtained search result is more comprehensive, is extended to initial results;And also by the associative search word of term and Sorted logic relationship between term and associative search word is presented to the user, for reference, is the judgement and decision of user It supports to provide foundation.
Although above having used general explanation and specific embodiment, the present invention is described in detail, at this On the basis of invention, it can be made some modifications or improvements, this will be apparent to those skilled in the art.Therefore, These modifications or improvements without departing from theon the basis of the spirit of the present invention are fallen within the scope of the claimed invention.

Claims (10)

1. a kind of dictionary method for building up characterized by comprising
S1 obtains each institute's predicate according to the specific explanations corresponding with vocabulary described in each of each vocabulary in dictionary library The association vocabulary of remittance;
S2 is stored in the word pre-established using the association vocabulary of the vocabulary and the vocabulary as vocabulary group for each vocabulary In adopted library;
Sorted logic relationship between each vocabulary and the association vocabulary of the vocabulary is stored in the classification pre-established and closed by S3 Join in library.
2. dictionary method for building up according to claim 1, which is characterized in that the step S1 is specifically included:
S11 collects each vocabulary and the corresponding specific explanations of each vocabulary in dictionary library;
S12 carries out participle fractionation to the specific explanations of each vocabulary, and every according to logic therein characterization Relation acquisition The association vocabulary of one vocabulary, wherein the association vocabulary includes the near synonym, synonym, antonym, upper justice of each vocabulary Word and hyponym;
Corresponding, the step S2 is specifically included:
For each vocabulary, the near synonym of the vocabulary and the vocabulary, synonym, antonym, superordinate term and hyponym are made It is stored in meaning of a word library for vocabulary group, and chooses one of vocabulary as first vocabulary.
3. dictionary method for building up according to claim 2, which is characterized in that the sorted logic relationship include synonym, Near synonym, antonym, superordinate term, hyponym and keyword.
4. dictionary method for building up according to claim 2, which is characterized in that after the step S2 further include:
A, the corpus material other than dictionary library is collected, and cutting is carried out to the corpus material using Chinese word cutting method, is obtained Multiple participles;
B, for participle described in each, access the meaning of a word library, if the participle not in any vocabulary group in the meaning of a word library, Then follow the steps a or c;
C, verification verifying is carried out to the participle, which is brought into existing vocabulary group or newly-built vocabulary group is to described Meaning of a word library is updated, and using the participle as first vocabulary of newly-built vocabulary group.
5. a kind of information retrieval method characterized by comprising
S1 ' inquires the association inspection of first term according to the first term of input from meaning of a word library or classification associated library Rope word;
S2 ' is retrieved according to the associative search word of first term, obtains corresponding search result;Alternatively, according to defeated The second term entered is retrieved, and corresponding search result is obtained;
Wherein, second term is the term selected from the associative search word of first term, the meaning of a word Library and the classification associated library are established based on dictionary method for building up according to any one of claims 1-4.
6. information retrieval method according to claim 5, which is characterized in that the step S1 ' is specifically included:
According to first term, the synonymous of first term is inquired in the meaning of a word library or the classification associated library The associative search word of word, near synonym, antonym, superordinate term and hyponym as first term.
7. information retrieval method according to claim 6, which is characterized in that further include:
With first term for original term, by first term and the associative search of first term Word is presented with tree;
And the sorted logic relationship of first term He its associative search word is presented in the form of statements.
8. a kind of dictionary establishes system characterized by comprising
Module is obtained, for obtaining every according to the specific explanations corresponding with vocabulary described in each of each vocabulary in dictionary library The association vocabulary of one vocabulary;
First preserving module, for being saved for each vocabulary using the association vocabulary of the vocabulary and the vocabulary as vocabulary group In the meaning of a word library pre-established;
Second preserving module, it is pre- for the sorted logic relationship between each vocabulary and the association vocabulary of the vocabulary to be stored in In the classification associated library first established.
9. a kind of information retrieval system characterized by comprising
Enquiry module inquires first retrieval for the first term according to input from meaning of a word library or classification associated library The associative search word of word;
Retrieval module obtains corresponding search result for being retrieved according to the associative search word of first term;Or Person obtains corresponding search result for being retrieved according to the second term of input;
Wherein, second term is the term selected from the associative search word of first term, the meaning of a word Library and the classification associated library are established based on dictionary method for building up according to any one of claims 1-4.
10. a kind of computer storage medium, which is characterized in that be stored thereon with computer program, the computer program is located When managing device execution, dictionary method for building up or information retrieval method are realized.
CN201910568339.4A 2019-06-27 2019-06-27 Word stock establishment method, information retrieval method and corresponding system Active CN110276079B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910568339.4A CN110276079B (en) 2019-06-27 2019-06-27 Word stock establishment method, information retrieval method and corresponding system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910568339.4A CN110276079B (en) 2019-06-27 2019-06-27 Word stock establishment method, information retrieval method and corresponding system

Publications (2)

Publication Number Publication Date
CN110276079A true CN110276079A (en) 2019-09-24
CN110276079B CN110276079B (en) 2023-05-26

Family

ID=67962399

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910568339.4A Active CN110276079B (en) 2019-06-27 2019-06-27 Word stock establishment method, information retrieval method and corresponding system

Country Status (1)

Country Link
CN (1) CN110276079B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113051898A (en) * 2019-12-27 2021-06-29 北京阿博茨科技有限公司 Word meaning accumulation and word segmentation method, tool and system for structured data searched by natural language
CN113407668A (en) * 2021-06-11 2021-09-17 武夷学院 Data processing method and device for cognitive association capacity training
CN113515585A (en) * 2020-04-10 2021-10-19 中国石油化工股份有限公司 Construction method, retrieval method and system of special lexicon in dangerous chemical safety field
CN117953875A (en) * 2024-03-27 2024-04-30 成都启英泰伦科技有限公司 Offline voice command word storage method based on semantic understanding

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000222410A (en) * 1999-01-28 2000-08-11 Matsushita Electric Ind Co Ltd Thesaurus retrieving device and thesaurus retrieval system
TW200424874A (en) * 2003-05-09 2004-11-16 Webgenie Information Ltd Automatic thesaurus construction method
US20120124084A1 (en) * 2010-11-06 2012-05-17 Ning Zhu Method to semantically search domain name by utilizing hyponym, hypernym, troponym, entailment and coordinate term
CN108959314A (en) * 2017-05-24 2018-12-07 西安科技大市场创新云服务股份有限公司 A kind of semantic retrieving method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000222410A (en) * 1999-01-28 2000-08-11 Matsushita Electric Ind Co Ltd Thesaurus retrieving device and thesaurus retrieval system
TW200424874A (en) * 2003-05-09 2004-11-16 Webgenie Information Ltd Automatic thesaurus construction method
US20120124084A1 (en) * 2010-11-06 2012-05-17 Ning Zhu Method to semantically search domain name by utilizing hyponym, hypernym, troponym, entailment and coordinate term
CN108959314A (en) * 2017-05-24 2018-12-07 西安科技大市场创新云服务股份有限公司 A kind of semantic retrieving method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
翟羽佳等: "基于文本挖掘的中文领域本体构建方法研究", 《情报科学》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113051898A (en) * 2019-12-27 2021-06-29 北京阿博茨科技有限公司 Word meaning accumulation and word segmentation method, tool and system for structured data searched by natural language
CN113515585A (en) * 2020-04-10 2021-10-19 中国石油化工股份有限公司 Construction method, retrieval method and system of special lexicon in dangerous chemical safety field
CN113407668A (en) * 2021-06-11 2021-09-17 武夷学院 Data processing method and device for cognitive association capacity training
CN113407668B (en) * 2021-06-11 2022-10-11 武夷学院 Data processing method and device for cognitive association capacity training
CN117953875A (en) * 2024-03-27 2024-04-30 成都启英泰伦科技有限公司 Offline voice command word storage method based on semantic understanding

Also Published As

Publication number Publication date
CN110276079B (en) 2023-05-26

Similar Documents

Publication Publication Date Title
KR101173561B1 (en) Question type and domain identifying apparatus and method
CN103678576B (en) The text retrieval system analyzed based on dynamic semantics
CN100416570C (en) FAQ based Chinese natural language ask and answer method
CN103136352B (en) Text retrieval system based on double-deck semantic analysis
CN100595763C (en) Full text retrieval system based on natural language
CN110059311A (en) A kind of keyword extracting method and system towards judicial style data
CN102144229B (en) System for extracting term from document containing text segment
KR101524889B1 (en) Identification of semantic relationships within reported speech
Yin et al. Facto: a fact lookup engine based on web tables
Rahman et al. STRICT: Information retrieval based search term identification for concept location
CN110276079A (en) A kind of dictionary method for building up, information retrieval method and corresponding system
CN109582704A (en) Recruitment information and the matched method of job seeker resume
CN107908712A (en) Cross-language information matching process based on term extraction
CN107967290A (en) A kind of knowledge mapping network establishing method and system, medium based on magnanimity scientific research data
CN111104488B (en) Method, device and storage medium for integrating retrieval and similarity analysis
Al-Taani et al. An extractive graph-based Arabic text summarization approach
KR100835706B1 (en) System and method for korean morphological analysis for automatic indexing
CN108763272A (en) A kind of event information analysis method, computer readable storage medium and terminal device
JP5718405B2 (en) Utterance selection apparatus, method and program, dialogue apparatus and method
Mahendra et al. Acquiring relational patterns from wikipedia: A case study
CN109284441A (en) Dynamic self-adapting network sensitive information detection method and device
US20120072443A1 (en) Data searching system and method for generating derivative keywords according to input keywords
CN108764972A (en) A kind of film box office prediction technique and device
Hakkani-Tur et al. Statistical sentence extraction for information distillation
CN110019814B (en) News information aggregation method based on data mining and deep learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant