CN105550253A - Method and device for obtaining type relation - Google Patents

Method and device for obtaining type relation Download PDF

Info

Publication number
CN105550253A
CN105550253A CN201510900876.6A CN201510900876A CN105550253A CN 105550253 A CN105550253 A CN 105550253A CN 201510900876 A CN201510900876 A CN 201510900876A CN 105550253 A CN105550253 A CN 105550253A
Authority
CN
China
Prior art keywords
type
relationship
entity
description text
specifying
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510900876.6A
Other languages
Chinese (zh)
Other versions
CN105550253B (en
Inventor
葛宁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chongqing Duxiaoman Youyang Technology Co ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201510900876.6A priority Critical patent/CN105550253B/en
Publication of CN105550253A publication Critical patent/CN105550253A/en
Application granted granted Critical
Publication of CN105550253B publication Critical patent/CN105550253B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/5854Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using shape and object relationship
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/288Entity relationship models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/5846Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using extracted text
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing
    • G06F16/90344Query processing by using string matching techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)

Abstract

Embodiments of the invention provide a method and a device for obtaining a type relation. On one hand, the method provided by the embodiments of the invention comprises the following steps: obtaining entireties and description texts of the entireties to obtain types corresponding to the entireties, and generating the description text of each type according to the description texts of the entities corresponding to each type; and then, extracting M groups of types conforming to the specified type relation from the description text of each type, wherein M is a positive integer. Therefore, the method and the device in the technical scheme provided by the embodiments of the invention can be used for automatically obtaining the relation of the types of the entities, improving the efficiency of obtaining the relation of the types of the entities and reducing the cost of obtaining the relation of the types of the entities.

Description

A kind of acquisition methods of type of relationship and device
[technical field]
The present invention relates to technical field of internet application, particularly relate to a kind of acquisition methods and device of type of relationship.
[background technology]
Mainly comprise entity and the relation between entity and entity in knowledge mapping, therefore the relation obtained between two entities builds and improves the requisite step of knowledge mapping.In prior art, according to relation between the type of entity and type, more entity and inter-entity relation can be excavated, to realize constantly improving knowledge mapping.
In prior art, relation between type and type needs manually gather and add in knowledge mapping, artificial acquisition mode mainly relies on experimental knowledge to carry out artificially collecting and analyzing, require higher to the know-how of personnel, and artificial gatherer process also compares very complicated, therefore, between this artificial grab type, to obtain efficiency comparison low for the mode of relation, and procurement cost is higher.
[summary of the invention]
In view of this, embodiments provide a kind of acquisition methods and device of type of relationship, automatically can obtain the relation between the type of entity and type, improve the acquisition efficiency of the relation between the type of entity and type, reduce the procurement cost of the type of entity and the relation of type.
The one side of the embodiment of the present invention, provides a kind of acquisition methods of type of relationship, comprising:
Obtain the description text of each entity and each entity;
Obtain the type that each entity is corresponding;
According to the description text of each entity corresponding to each type, generate the description text of each type;
According to the type of relationship of specifying, from the description text of each type, extract the M set type meeting described type of relationship of specifying, M is positive integer.
Aspect as above and arbitrary possible implementation, provide a kind of implementation further, the type that each entity of described acquisition is corresponding, comprising:
According to classification of type knowledge, and according to type, each entity is polymerized, to obtain type corresponding to each entity; Or,
By each entity input type disaggregated model respectively, to make classification of type model carry out classification of type to each entity, to obtain type corresponding to each entity.
Aspect as above and arbitrary possible implementation, provide a kind of implementation further, and the description text of the described each entity corresponding according to each type, generates the description text of each type, comprising:
Word process is cut to the description text of each entity corresponding to each type, cuts word result to obtain;
Use pattern knowledge base is mated respectively cutting in word result;
If one is cut in word result the keyword comprising and define in described type knowledge base, extract and comprise the text fragments that this cuts word result;
According to each text fragments extracted, generate the description text of each type.
Aspect as above and arbitrary possible implementation, provide a kind of implementation further, the described type of relationship according to specifying, and extracts the M set type meeting described type of relationship of specifying, comprising from the description text of each type:
Obtain the relationship templates of specifying, the corresponding type of relationship of described relationship templates, described relationship templates comprises the content of text of the type of relationship between instruction two types;
Utilize described relationship templates, in the description text of each type, carry out character match, from the description text of each type, extract N set type; N is greater than or equal to M, and is positive integer;
According to the N set type extracted, obtain the M set type meeting described type of relationship of specifying.
Aspect as above and arbitrary possible implementation, provide a kind of implementation further, the N set type that described basis extracts, and obtains the M set type meeting described type of relationship of specifying, comprising:
Carry out title normalized to P type in described N set type, P is positive integer;
For the N set type after normalized, according to belonging to the same type of different group and described type of relationship of specifying, N set type is merged into described M set type.
Aspect as above and arbitrary possible implementation, provide a kind of implementation further, described method also comprises: add described type of relationship of specifying and the M set type meeting described type of relationship of specifying to knowledge mapping.
The one side of the embodiment of the present invention, provides a kind of acquisition device of type of relationship, comprising:
Receiver module, for obtaining the description text of each entity and each entity;
Sort module, for obtaining type corresponding to each entity;
Generation module, for the description text according to each entity corresponding to each type, generates the description text of each type;
Acquisition module, for according to the type of relationship of specifying, from the description text of each type, extract the M set type meeting described type of relationship of specifying, M is positive integer.
Aspect as above and arbitrary possible implementation, provide a kind of implementation, described sort module further, specifically for:
According to classification of type knowledge, and according to type, each entity is polymerized, to obtain type corresponding to each entity; Or,
By each entity input type disaggregated model respectively, to make classification of type model carry out classification of type to each entity, to obtain type corresponding to each entity.
Aspect as above and arbitrary possible implementation, provide a kind of implementation, described generation module further, specifically for:
Word process is cut to the description text of each entity corresponding to each type, cuts word result to obtain;
Use pattern knowledge base is mated respectively cutting in word result;
If one is cut in word result the keyword comprising and define in described type knowledge base, extract and comprise the text fragments that this cuts word result;
According to each text fragments extracted, generate the description text of each type.
Aspect as above and arbitrary possible implementation, provide a kind of implementation, described acquisition module further, specifically for:
Obtain the relationship templates of specifying, the corresponding type of relationship of described relationship templates, described relationship templates comprises the content of text of the type of relationship between instruction two types;
Utilize described relationship templates, in the description text of each type, carry out character match, from the description text of each type, extract N set type; N is greater than or equal to M, and is positive integer;
According to the N set type extracted, obtain the M set type meeting described type of relationship of specifying.
Aspect as above and arbitrary possible implementation, provide a kind of implementation further, and described acquisition module is used for the N set type according to extracting, when acquisition meets the M set type of described type of relationship of specifying, specifically for:
Carry out title normalized to P type in described N set type, P is positive integer;
For the N set type after normalized, according to belonging to the same type of different group and described type of relationship of specifying, N set type is merged into described M set type.
Aspect as above and arbitrary possible implementation, provide a kind of implementation further, described device also comprises: processing module, for adding described type of relationship of specifying and the M set type meeting described type of relationship of specifying to knowledge mapping.
As can be seen from the above technical solutions, the embodiment of the present invention has following beneficial effect:
The technical scheme that the embodiment of the present invention provides, automatically the relation between the type of entity and type can be obtained, compared with relying on the mode of relation between the artificial type gathering entity in prior art, by relation between the automatic type gathering entity in the embodiment of the present invention, artificial collection can be avoided, therefore improve the acquisition efficiency of the relation between the type of entity and type, reduce the procurement cost of the type of entity and the relation of type.
[accompanying drawing explanation]
In order to be illustrated more clearly in the technical scheme of the embodiment of the present invention, be briefly described to the accompanying drawing used required in embodiment below, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to these accompanying drawings.
Fig. 1 is the schematic flow sheet of the acquisition methods of the type of relationship that the embodiment of the present invention provides;
Fig. 2 is the functional block diagram of the acquisition device of the type of relationship that the embodiment of the present invention provides.
[embodiment]
Technical scheme for a better understanding of the present invention, is described in detail the embodiment of the present invention below in conjunction with accompanying drawing.
Should be clear and definite, described embodiment is only the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making other embodiments all obtained under creative work prerequisite, belong to the scope of protection of the invention.
The term used in embodiments of the present invention is only for the object describing specific embodiment, and not intended to be limiting the present invention." one ", " described " and " being somebody's turn to do " of the singulative used in the embodiment of the present invention and appended claims is also intended to comprise most form, unless context clearly represents other implications.
Should be appreciated that term "and/or" used herein is only a kind of incidence relation describing affiliated partner, can there are three kinds of relations in expression, and such as, A and/or B, can represent: individualism A, exists A and B simultaneously, these three kinds of situations of individualism B.In addition, character "/" herein, general expression forward-backward correlation is to the relation liking a kind of "or".
Depend on linguistic context, word as used in this " if " can be construed as into " ... time " or " when ... time " or " in response to determining " or " in response to detection ".Similarly, depend on linguistic context, phrase " if determination " or " if detecting (the conditioned disjunction event of statement) " can be construed as " when determining " or " in response to determining " or " when detecting (the conditioned disjunction event of statement) " or " in response to detection (the conditioned disjunction event of statement) ".
Embodiment one
The embodiment of the present invention provides a kind of acquisition methods of type of relationship, please refer to Fig. 1, the schematic flow sheet of the acquisition methods of its type of relationship provided for the embodiment of the present invention, and as shown in the figure, the method comprises the following steps:
S101, obtains the description text of each entity and each entity.
Concrete, given some entities can be obtained, and the description text of each entity in given some entities.
Be company for entity, the description text of company can be the description text of corporate business.Such as, the product that the scope of business that can comprise company in the description text of corporate business describes, the title of company, company produce, company produce the vendor name etc. of required starting material and necessary parts.
Such as, for entity " Huawei Tech Co., Ltd ", it describes text can for " Huawei Tech Co., Ltd is the private communication scientific & technical corporation of a production and sales communication facilities, and general headquarters are positioned at Chinese Guangdong province Huawei Base, Bantian, Longgang District, Shenzhen City.The product of Huawei relates generally to exchange network, transmission network, wireless and wired fixed access network network and data communication network in communication network and wireless terminal product, for common carrier and specialized network owner provide hardware device, software, service and solution all over the world ".
S102, obtains the type that each entity is corresponding.
Concrete, for given each entity, need first to obtain type corresponding to each entity.
Illustrate, the method obtaining type corresponding to each entity can include but not limited to following two kinds:
The first: according to classification of type knowledge, and according to type, each entity is polymerized, to obtain type corresponding to each entity.
Be understandable that, described classification of type knowledge comprises classification of type tree, and the node in classification of type tree is type, and the child node of type is Extended-type word.
In a concrete implementation procedure, word can be cut to the description text of given each entity, with the description text obtaining each entity corresponding cut word result.Then the type in use pattern classification tree or Extended-type word, carry out character match respectively cutting in word result.If cut word result can hit certain type or certain Extended-type word, then can think that the type that this cuts entity corresponding to word result is that this cuts the type of word result hit, or this cuts the type corresponding to Extended-type word of word result hit.Like this, for certain entity, just can according to the description text of this entity corresponding cut word result, determine the type of this entity, achieve and according to type, each entity be polymerized, obtain the type that each entity is corresponding, be also just equivalent to obtain all types of in some entities corresponding to each type.
Be company with entity below, the type of entity is industry belonging to company is that example is illustrated.
For entity " Huawei Tech Co., Ltd ", it describes text for " Huawei Tech Co., Ltd is the private communication scientific & technical corporation of a production and sales communication facilities, and general headquarters are positioned at Chinese Guangdong province Huawei Base, Bantian, Longgang District, Shenzhen City.The product of Huawei relates generally to exchange network, transmission network, wireless and wired fixed access network network and data communication network in communication network and wireless terminal product, for common carrier and specialized network owner provide hardware device, software, service and solution all over the world ".
Describe after text cuts word to this, all types of or all types of expansion word is utilized to cut in word result carry out character match at this, word result is cut in discovery can hit type " communication equipment manufacturing industry ", be somebody's turn to do " communication equipment manufacturing industry " subtype as " manufacturing industry ", can using the type of subtype " communication equipment manufacturing industry " as above-mentioned entity in the embodiment of the present invention.Or, cut word result and can also hit Extended-type word, as " telephone set, mobile phone, the network switch ", then according to type corresponding to Extended-type word that this cuts the hit of word result, also can determine that the type of above-mentioned entity is for " communication equipment manufacturing industry ".Like this, just its type can be obtained for entity " Huawei Tech Co., Ltd ", and, utilize said method also can obtain some entities corresponding to the type " communication equipment manufacturing industry ", thus realize by carrying out cluster to each entity, obtain the type of each entity, and all types of in some types corresponding to each entity.
The second: by each entity input type disaggregated model respectively, to make classification of type model carry out classification of type to each entity, to obtain type corresponding to each entity.
In a concrete implementation procedure, the description text of a large amount of entities, entity and entity corresponding types can be utilized as training sample, machine learning is carried out to this training sample, generates classification of type model.Then, for given each entity, can by description Text Input the type disaggregated model of the title of each entity or entity, to make classification of type model, type identification is carried out to its entity, classification of type model can obtain and export the type of each entity, be also just equivalent to obtain all types of in some entities corresponding to each type.
S103, according to the description text of each entity corresponding to each type, generates the description text of each type.
Concrete, due to the type of each entity can be obtained in S102, therefore can according to the type of each entity, obtain all types of in some entities corresponding to each type.In this step, need the description text according to each entity corresponding to each type, describe text for each type generates.
Illustrate, according to the description text of each entity corresponding to each type, the method generating the description text of each type can include but not limited to:
First, word process is cut to the description text of each entity corresponding to each type, cuts word result to obtain.Then, use pattern knowledge base is mated respectively cutting in word result; If one is cut in word result the keyword comprising and define in described type knowledge base, extract and comprise the text fragments that this cuts word result.Finally, according to each text fragments extracted, generate the description text of each type.
It should be noted that, the title of type, another name and Extended-type word etc. in type knowledge base, can be comprised.
In a concrete implementation procedure, for each type, the description text of each entity corresponding for the type can be carried out cutting word, obtain the description text of this entity corresponding cut word result.Then, use pattern knowledge base, carry out character match respectively cutting in word result, if find, certain is cut word result and has hit the title of the type, another name or Extended-type word, just can extracting from the description text of this entity and comprise the text fragments that this cuts word result, cutting sentence or one section of text of word result as comprised.So, corresponding text fragments can be extracted from the description text of each entity corresponding to the type, then utilize the some text fragments extracted to form set, using the description text of this set as the type.Aforesaid way can be used to obtain the description text of the type for each type, thus all types of description texts can be obtained.
Take entity as company, type industry belonging to entity is example, the description text of type can comprise " starting material of industry a are provided by industry b ", " industry a relies on industry b " or " industry a is by the impact of industry b " etc., these can as the description text of industry a, and industry a belongs to the type of company.
S104, according to the type of relationship of specifying, from the description text of each type, extract the M set type meeting described type of relationship of specifying, M is positive integer.
Concrete, after the description text obtaining each entity, according to the type of relationship of specifying, can extract the some set types meeting this specified relationship from the description text of each type, wherein, every set type can comprise two different types.
Illustrate, in the embodiment of the present invention, according to the type of relationship of specifying, from the description text of each type, extract the method meeting the M set type of described type of relationship of specifying can include but not limited to:
First, obtain the relationship templates of specifying, the corresponding type of relationship of this relationship templates, described relationship templates comprises the content of text of the type of relationship between instruction two types.Then, utilize this relationship templates, in the description text of each type, carry out character match, from the description text of each type, extract N set type; N is greater than or equal to M, and is positive integer; Finally, according to the N set type extracted, obtain the M set type meeting described type of relationship of specifying.
Be understandable that, type of relationship between two types due to what define in relationship templates, therefore, when utilizing relationship templates to mate in the description text of type, if the content in description text and the character of relationship templates match, then can extract a set type from description text, some text fragments can be comprised in the description text of each type, for the description text of the type, some set types can be extracted, two types in the every set type extracted are exactly the type meeting type of relationship corresponding to this relationship templates, thus the type of relationship of two types can be obtained.
Be company with entity, with type, belonging to entity, industry is example, and the type of relationship that the relationship templates of specifying is corresponding is Relationship.Such as, relationship templates can comprise " xx starting material are provided by xx ", " xx relies on xx " or the impact of xx " xx by ", the Relationship that these relationship templates are corresponding is " relation of downstream and upstream ", namely in two industries that the often group industry utilizing these relationship templates to extract from the description text of type comprises, the former is the downstream industry of the latter, the latter is the former upstream industry, and therefore, the relation of two industries is relations of upstream industry and downstream industry.
Illustrate, in the embodiment of the present invention, according to the N set type extracted, obtaining the method meeting the M set type of described type of relationship of specifying can include but not limited to:
First, carry out title normalized to P type in described N set type, wherein, P is positive integer, and P is less than 2N.Then, for the N set type after normalized, according to belonging to the same type of different group and described type of relationship of specifying, N set type is merged into described M set type.
Such as, if comprise same type in certain two set type, but the title of the type in two groups is different, and one of them is title accurately, and another is another name, then can by unified for the title of the type in two groups be title accurately.After Uniform Name, just easily two set types comprising an identical type can be merged, generate one group of new type, the type of relationship that before this set type still meets, two set types meet.
Industry for type, if the first set type meeting the relation of upstream industry and downstream industry comprises " industry 1-industry 2 ", second set type comprises " industry 2-industry 3 ", therefore, two groups of industries can be combined, be merged into one group of industry, be i.e. " industry 1-industry 2-industry 3 ", in this group industry, previous industry is all the upstream industry of a rear industry, and a rear industry is all the downstream industry of previous industry.
Alternatively, in one of the present embodiment possible implementation, obtain after meeting the M set type of the type of relationship of specifying in S104, can described type of relationship and the M set type meeting described type of relationship be added to knowledge mapping.
Such as, the M set type of acquisition can be added under type of relationship described in knowledge mapping, in M set type, plural type can be comprised.Or, can, in knowledge mapping in each type, be also the upper another type of the type mark, and the type of relationship of mark above and between another type.
Industry for type, relation between two industries can be added and add some groups of industries for Relationship in the relevant knowledge mapping of the relevant knowledge mapping of business or market, represent that the two or more industry in every group industry has the sector relation.Thus the excavation of relation between industry can be realized, obtain the relation between two industries, the relation between industry may be used for disclosing business supply chain, be the important component part of process commercial competition information.Therefore, the pass between industry ties up in real life has very big effect.
The embodiment of the present invention provides the device embodiment realizing each step and method in said method embodiment further.
Please refer to Fig. 2, the functional block diagram of the acquisition device of its type of relationship provided for the embodiment of the present invention.As shown in the figure, this device comprises:
Receiver module 21, for obtaining the description text of each entity and each entity;
Sort module 22, for obtaining type corresponding to each entity;
Generation module 23, for the description text according to each entity corresponding to each type, generates the description text of each type;
Acquisition module 24, for according to the type of relationship of specifying, from the description text of each type, extract the M set type meeting described type of relationship of specifying, M is positive integer.
In a concrete implementation procedure, described sort module 22, specifically for:
According to classification of type knowledge, and according to type, each entity is polymerized, to obtain type corresponding to each entity; Or, by each entity input type disaggregated model respectively, to make classification of type model carry out classification of type to each entity, to obtain type corresponding to each entity.
In a concrete implementation procedure, described generation module 23, specifically for:
Word process is cut to the description text of each entity corresponding to each type, cuts word result to obtain;
Use pattern knowledge base is mated respectively cutting in word result;
If one is cut in word result the keyword comprising and define in described type knowledge base, extract and comprise the text fragments that this cuts word result;
According to each text fragments extracted, generate the description text of each type.
In a concrete implementation procedure, described acquisition module 24, specifically for:
Obtain the relationship templates of specifying, the corresponding type of relationship of described relationship templates, described relationship templates comprises the content of text of the type of relationship between instruction two types;
Utilize described relationship templates, in the description text of each type, carry out character match, from the description text of each type, extract N set type; N is greater than or equal to M, and is positive integer;
According to the N set type extracted, obtain the M set type meeting described type of relationship of specifying.
In a concrete implementation procedure, described acquisition module 24 for according to the N set type extracted, obtains when meeting the M set type of described type of relationship of specifying, specifically for:
Carry out title normalized to P type in described N set type, P is positive integer;
For the N set type after normalized, according to belonging to the same type of different group and described type of relationship of specifying, N set type is merged into described M set type.
Optionally, in one of the present embodiment possible implementation, described device also comprises:
Processing module 25, for adding described type of relationship of specifying and the M set type meeting described type of relationship of specifying to knowledge mapping.
Because each unit in the present embodiment can perform the method shown in Fig. 1, the part that the present embodiment is not described in detail, can with reference to the related description to Fig. 1.
The technical scheme of the embodiment of the present invention has following beneficial effect:
In the embodiment of the present invention, by obtaining the description text of each entity and each entity; Thus, obtain the type that each entity is corresponding, and, according to the description text of each entity corresponding to each type, generate the description text of each type; And then according to the type of relationship of specifying, extract the M set type meeting described type of relationship of specifying from the description text of each type, M is positive integer.
The technical scheme that the embodiment of the present invention provides, automatically the relation between the type of entity and type can be obtained, compared with relying on the mode of relation between the artificial type gathering entity in prior art, by relation between the automatic type gathering entity in the embodiment of the present invention, artificial collection can be avoided, therefore improve the acquisition efficiency of the relation between the type of entity and type, reduce the procurement cost of the type of entity and the relation of type, use manpower and material resources sparingly.
Those skilled in the art can be well understood to, and for convenience and simplicity of description, the system of foregoing description, the specific works process of device and unit, with reference to the corresponding process in preceding method embodiment, can not repeat them here.
In several embodiment provided by the present invention, should be understood that, disclosed system, apparatus and method, can realize by another way.Such as, device embodiment described above is only schematic, such as, the division of described unit, is only a kind of logic function and divides, and actual can have other dividing mode when realizing, such as, multiple unit or assembly can in conjunction with or another system can be integrated into, or some features can be ignored, or do not perform.Another point, shown or discussed coupling each other or direct-coupling or communication connection can be by some interfaces, and the indirect coupling of device or unit or communication connection can be electrical, machinery or other form.
The described unit illustrated as separating component or can may not be and physically separates, and the parts as unit display can be or may not be physical location, namely can be positioned at a place, or also can be distributed in multiple network element.Some or all of unit wherein can be selected according to the actual needs to realize the object of the present embodiment scheme.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, also can be that the independent physics of unit exists, also can two or more unit in a unit integrated.Above-mentioned integrated unit both can adopt the form of hardware to realize, and the form that hardware also can be adopted to add SFU software functional unit realizes.
The above-mentioned integrated unit realized with the form of SFU software functional unit, can be stored in a computer read/write memory medium.Above-mentioned SFU software functional unit is stored in a storage medium, comprising some instructions in order to make a computer installation (can be personal computer, server, or network equipment etc.) or processor (Processor) perform the part steps of method described in each embodiment of the present invention.And aforesaid storage medium comprises: USB flash disk, portable hard drive, ROM (read-only memory) (Read-OnlyMemory, ROM), random access memory (RandomAccessMemory, RAM), magnetic disc or CD etc. various can be program code stored medium.
The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, within the spirit and principles in the present invention all, any amendment made, equivalent replacement, improvement etc., all should be included within the scope of protection of the invention.

Claims (12)

1. an acquisition methods for type of relationship, is characterized in that, described method comprises:
Obtain the description text of each entity and each entity;
Obtain the type that each entity is corresponding;
According to the description text of each entity corresponding to each type, generate the description text of each type;
According to the type of relationship of specifying, from the description text of each type, extract the M set type meeting described type of relationship of specifying, M is positive integer.
2. method according to claim 1, is characterized in that, the type that each entity of described acquisition is corresponding, comprising:
According to classification of type knowledge, and according to type, each entity is polymerized, to obtain type corresponding to each entity; Or,
By each entity input type disaggregated model respectively, to make classification of type model carry out classification of type to each entity, to obtain type corresponding to each entity.
3. method according to claim 1, is characterized in that, the description text of the described each entity corresponding according to each type, generates the description text of each type, comprising:
Word process is cut to the description text of each entity corresponding to each type, cuts word result to obtain;
Use pattern knowledge base is mated respectively cutting in word result;
If one is cut in word result the keyword comprising and define in described type knowledge base, extract and comprise the text fragments that this cuts word result;
According to each text fragments extracted, generate the description text of each type.
4. method according to claim 1, is characterized in that, the described type of relationship according to specifying, and extracts the M set type meeting described type of relationship of specifying, comprising from the description text of each type:
Obtain the relationship templates of specifying, the corresponding type of relationship of described relationship templates, described relationship templates comprises the content of text of the type of relationship between instruction two types;
Utilize described relationship templates, in the description text of each type, carry out character match, from the description text of each type, extract N set type; N is greater than or equal to M, and is positive integer;
According to the N set type extracted, obtain the M set type meeting described type of relationship of specifying.
5. method according to claim 4, is characterized in that, the N set type that described basis extracts, and obtains the M set type meeting described type of relationship of specifying, comprising:
Carry out title normalized to P type in described N set type, P is positive integer;
For the N set type after normalized, according to belonging to the same type of different group and described type of relationship of specifying, N set type is merged into described M set type.
6. method according to any one of claim 1 to 5, is characterized in that, described method also comprises: add described type of relationship of specifying and the M set type meeting described type of relationship of specifying to knowledge mapping.
7. an acquisition device for type of relationship, is characterized in that, described device comprises:
Receiver module, for obtaining the description text of each entity and each entity;
Sort module, for obtaining type corresponding to each entity;
Generation module, for the description text according to each entity corresponding to each type, generates the description text of each type;
Acquisition module, for according to the type of relationship of specifying, from the description text of each type, extract the M set type meeting described type of relationship of specifying, M is positive integer.
8. device according to claim 7, is characterized in that, described sort module, specifically for:
According to classification of type knowledge, and according to type, each entity is polymerized, to obtain type corresponding to each entity; Or,
By each entity input type disaggregated model respectively, to make classification of type model carry out classification of type to each entity, to obtain type corresponding to each entity.
9. device according to claim 7, is characterized in that, described generation module, specifically for:
Word process is cut to the description text of each entity corresponding to each type, cuts word result to obtain;
Use pattern knowledge base is mated respectively cutting in word result;
If one is cut in word result the keyword comprising and define in described type knowledge base, extract and comprise the text fragments that this cuts word result;
According to each text fragments extracted, generate the description text of each type.
10. device according to claim 7, is characterized in that, described acquisition module, specifically for:
Obtain the relationship templates of specifying, the corresponding type of relationship of described relationship templates, described relationship templates comprises the content of text of the type of relationship between instruction two types;
Utilize described relationship templates, in the description text of each type, carry out character match, from the description text of each type, extract N set type; N is greater than or equal to M, and is positive integer;
According to the N set type extracted, obtain the M set type meeting described type of relationship of specifying.
11. devices according to claim 10, is characterized in that, described acquisition module is used for the N set type according to extracting, when acquisition meets the M set type of described type of relationship of specifying, specifically for:
Carry out title normalized to P type in described N set type, P is positive integer;
For the N set type after normalized, according to belonging to the same type of different group and described type of relationship of specifying, N set type is merged into described M set type.
12. devices according to any one of claim 7 to 11, it is characterized in that, described device also comprises:
Processing module, for adding described type of relationship of specifying and the M set type meeting described type of relationship of specifying to knowledge mapping.
CN201510900876.6A 2015-12-09 2015-12-09 Method and device for acquiring type relationship Active CN105550253B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510900876.6A CN105550253B (en) 2015-12-09 2015-12-09 Method and device for acquiring type relationship

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510900876.6A CN105550253B (en) 2015-12-09 2015-12-09 Method and device for acquiring type relationship

Publications (2)

Publication Number Publication Date
CN105550253A true CN105550253A (en) 2016-05-04
CN105550253B CN105550253B (en) 2021-02-12

Family

ID=55829442

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510900876.6A Active CN105550253B (en) 2015-12-09 2015-12-09 Method and device for acquiring type relationship

Country Status (1)

Country Link
CN (1) CN105550253B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106156286A (en) * 2016-06-24 2016-11-23 广东工业大学 Type extraction system and method towards technical literature knowledge entity
CN107031617A (en) * 2017-02-22 2017-08-11 湖北文理学院 The method and device that a kind of automobile intelligent drives
CN107844471A (en) * 2016-09-20 2018-03-27 科大讯飞股份有限公司 A kind of text describes kind identification method and device
CN108154198A (en) * 2018-01-25 2018-06-12 北京百度网讯科技有限公司 Knowledge base entity normalizing method, system, terminal and computer readable storage medium
CN109558584A (en) * 2018-10-26 2019-04-02 平安科技(深圳)有限公司 Business connection prediction technique, device, computer equipment and storage medium
CN111209389A (en) * 2019-12-31 2020-05-29 天津外国语大学 Movie story generation method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103049490A (en) * 2012-12-05 2013-04-17 北京海量融通软件技术有限公司 Attribute generation system and generation method among knowledge network nodes
CN103488789A (en) * 2013-10-08 2014-01-01 百度在线网络技术(北京)有限公司 Recommendation method, recommendation device and search engine

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103049490A (en) * 2012-12-05 2013-04-17 北京海量融通软件技术有限公司 Attribute generation system and generation method among knowledge network nodes
CN103488789A (en) * 2013-10-08 2014-01-01 百度在线网络技术(北京)有限公司 Recommendation method, recommendation device and search engine

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106156286A (en) * 2016-06-24 2016-11-23 广东工业大学 Type extraction system and method towards technical literature knowledge entity
CN106156286B (en) * 2016-06-24 2019-09-17 广东工业大学 Type extraction system and method towards technical literature knowledge entity
CN107844471A (en) * 2016-09-20 2018-03-27 科大讯飞股份有限公司 A kind of text describes kind identification method and device
CN107031617A (en) * 2017-02-22 2017-08-11 湖北文理学院 The method and device that a kind of automobile intelligent drives
CN108154198A (en) * 2018-01-25 2018-06-12 北京百度网讯科技有限公司 Knowledge base entity normalizing method, system, terminal and computer readable storage medium
CN108154198B (en) * 2018-01-25 2021-07-13 北京百度网讯科技有限公司 Knowledge base entity normalization method, system, terminal and computer readable storage medium
CN109558584A (en) * 2018-10-26 2019-04-02 平安科技(深圳)有限公司 Business connection prediction technique, device, computer equipment and storage medium
CN111209389A (en) * 2019-12-31 2020-05-29 天津外国语大学 Movie story generation method
CN111209389B (en) * 2019-12-31 2023-08-11 天津外国语大学 Movie story generation method

Also Published As

Publication number Publication date
CN105550253B (en) 2021-02-12

Similar Documents

Publication Publication Date Title
CN105550253A (en) Method and device for obtaining type relation
CN102722709B (en) Method and device for identifying garbage pictures
CN102591880B (en) Information providing method and device
CN104750789A (en) Label recommendation method and device
CN102567494B (en) Website classification method and device
CN104102639B (en) Popularization triggering method based on text classification and device
CN103324666A (en) Topic tracing method and device based on micro-blog data
CN105468583A (en) Entity relationship obtaining method and device
CN106201890B (en) The performance optimization method and server of a kind of application
CN111309910A (en) Text information mining method and device
CN102890683B (en) Information providing method and device
CN105677857B (en) method and device for accurately matching keywords with marketing landing pages
CN104866308A (en) Scenario image generation method and apparatus
CN103778226A (en) Method for establishing language information recognition model and language information recognition device
CN105183710A (en) Method for automatically generating document summary
CN105302807A (en) Method and apparatus for obtaining information category
CN104346408A (en) Method and equipment for labeling network user
CN105045412A (en) Method and system for generating candidate item of input method
CN104657437A (en) Monitoring method and monitoring device for promotion status data
CN116881430A (en) Industrial chain identification method and device, electronic equipment and readable storage medium
TR200300425T2 (en) Method of displaying the operating behavior of the facilities
CN103309851B (en) The rubbish recognition methods of short text and system
CN113569118B (en) Self-media pushing method, device, computer equipment and storage medium
CN107395695A (en) Service push method and device
CN104391798A (en) Software feature information extracting method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20191209

Address after: 201210 room j1328, floor 3, building 8, No. 55, Huiyuan Road, Jiading District, Shanghai

Applicant after: SHANGHAI YOUYANG NEW MEDIA INFORMATION TECHNOLOGY Co.,Ltd.

Address before: 100085 Baidu building, No. 10, ten Street, Haidian District, Beijing

Applicant before: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY Co.,Ltd.

GR01 Patent grant
GR01 Patent grant
CP03 Change of name, title or address
CP03 Change of name, title or address

Address after: 401120 b7-7-2, Yuxing Plaza, No.5, Huangyang Road, Yubei District, Chongqing

Patentee after: Chongqing duxiaoman Youyang Technology Co.,Ltd.

Address before: 201210 room j1328, 3 / F, building 8, 55 Huiyuan Road, Jiading District, Shanghai

Patentee before: SHANGHAI YOUYANG NEW MEDIA INFORMATION TECHNOLOGY Co.,Ltd.