CN104899304A

CN104899304A - Named entity identification method and device

Info

Publication number: CN104899304A
Application number: CN201510321448.8A
Authority: CN
Inventors: 姜文
Original assignee: Beijing Jingdong Century Trading Co Ltd; Beijing Jingdong Shangke Information Technology Co Ltd
Current assignee: Beijing Jingdong Century Trading Co Ltd; Beijing Jingdong Shangke Information Technology Co Ltd
Priority date: 2015-06-12
Filing date: 2015-06-12
Publication date: 2015-09-09
Anticipated expiration: 2035-06-12
Also published as: CN104899304B

Abstract

The invention provides a named entity identification method and a named entity identification device capable of accurately identifying a named entity, in particular to a named entity in the field of E-business. The method comprises: acquiring a vector library; carrying out word segmentation on a training corpus text string to obtain a plurality of sample words; inquiring the vector library of each sample word sequentially to obtain a first feature vector which comprises a word vector and a word class vector corresponding to the same word as well as an entity marking vector corresponding to the last word of the sample word; taking all the first feature vectors integrally as an input quantity, and training a named entity identification model of a neutral network; carrying out word segmentation on a to-be-predicted text string to obtain a plurality of to-be-tested words; inquiring the vector library of each sample word sequentially to obtain a second feature vector which comprises a word vector and a word class vector corresponding to the same word as well as an entity marking vector corresponding to the last word of the sample word; respectively inputting the second feature vectors corresponding to all the to-be-tested words into the model, and outputting entity identifiers of the to-be-tested words.

Description

Named entity recognition method and device

Technical field

The present invention relates to natural language processing technique field, particularly relate to a kind of named entity recognition method and device.

Background technology

Along with the fast development of Internet technology, information service becomes more and more universal.Wherein, the identification of named entity is the important foundation work of the information service application such as metadata mark of information extraction, question answering system, syntactic analysis, mechanical translation, Internet.Named entity (abbreviation entity), refer to name, mechanism's name, place name and other all entities being called mark with name, named entity also comprises numeral, date, currency, address etc. widely.

The technology adopting nerual network technique training named entity recognition has been had in prior art.Existing method at least has following several shortcoming: (1) mainly relies on word itself as input feature vector, the aspect of model is single, directly do not introduce the front and back dependence between entity indicia, causing the accuracy rate of identification not high, particularly often identifying when identifying the named entity in electric business field inaccurate; (2) because the initial value of network is stochastic generation, final parameter optimization result is probably good not, and the training time, the longer development efficiency that causes was low; (3) do not take into full account that the distribution situation of training data causes the fitting degree of model to entity uneven.

The named entity in electricity business field, such as trade name (Nokia 1020, ThinkPad E431 14 inches of notebook computers), price, item property etc., these named entities are made up of continuous print word one or more in sentence usually, forms such as " noun+numbers " that part of speech is generally.In a word, the named entity in electric business field has salient feature, and the named entity needed badly at present for electric business field develops recognition methods or recognition device.

Summary of the invention

In view of this, the invention provides a kind of named entity recognition method and device, named entity can be identified exactly, particularly the named entity in electric business field.

For achieving the above object, according to an aspect of the present invention, provide a kind of named entity recognition method, comprise: obtain vectorial storehouse, described vectorial storehouse comprises multiple word term vector corresponding respectively, the part of speech vector that multiclass part of speech is corresponding respectively, and the entity indicia vector that multiclass entity indicia is corresponding respectively; Corpus text string participle is obtained ordered multiple sample words; According to priority for the described vectorial storehouse of each sample word inquiry to build first eigenvector, described first eigenvector comprises part of speech vector corresponding to term vector corresponding to sample word, sample word and entity indicia corresponding to the last word of sample word is vectorial; Using overall for described first eigenvector corresponding for all sample words training input quantity as neural network, utilize BP algorithm of neural network to carry out network parameter and solve, obtain neural network Named Entity Extraction Model; Text string participle to be predicted is obtained ordered word multiple to be measured; According to priority for the described vectorial storehouse of each word to be measured inquiry to build second feature vector, described second feature vector comprises part of speech vector corresponding to term vector corresponding to word to be measured, word to be measured and entity indicia corresponding to the last word of word to be measured is vectorial; Described second feature vector corresponding for each word to be measured described is inputted described neural network Named Entity Extraction Model respectively, exports the entity indicia of described word to be measured.

Alternatively, also comprise in described first eigenvector: described sample word is close to term vector corresponding to word and described sample word is close to part of speech vector corresponding to word, and, also comprise in described second feature vector: described word to be measured is close to term vector corresponding to word and described word to be measured is close to part of speech vector corresponding to word.

Alternatively, when described first eigenvector is built for the first sample word in ordered multiple sample words, the last word of described first sample word is book character string, and, when building described second feature vector for the word first to be measured in ordered word multiple to be measured, the last word of described first word to be measured is book character string.

Alternatively, negative routine sample is also comprised in the training input quantity of described neural network.

For achieving the above object, according to a further aspect in the invention, provide a kind of named entity recognition device, comprise: vectorial storehouse acquisition module, for obtaining vectorial storehouse, described vectorial storehouse comprises multiple word term vector corresponding respectively, the part of speech vector that multiclass part of speech is corresponding respectively, and the entity indicia vector that multiclass entity indicia is corresponding respectively; First participle module, for obtaining ordered multiple sample words by corpus text string participle; First builds module, for according to priority for the described vectorial storehouse of each sample word inquiry to build first eigenvector, described first eigenvector comprises part of speech vector corresponding to term vector corresponding to sample word, sample word and entity indicia corresponding to the last word of sample word is vectorial; Training module, for using overall for described first eigenvector corresponding for all sample words training input quantity as neural network, utilizes BP algorithm of neural network to carry out network parameter and solves, obtain neural network Named Entity Extraction Model; Second word-dividing mode, for obtaining ordered word multiple to be measured by text string participle to be predicted; Second builds module, for according to priority for the described vectorial storehouse of each word to be measured inquiry to build second feature vector, described second feature vector comprises part of speech vector corresponding to term vector corresponding to word to be measured, word to be measured and entity indicia corresponding to the last word of word to be measured is vectorial; Prediction module, for described second feature vector corresponding for each word to be measured described is inputted described neural network Named Entity Extraction Model respectively, exports the entity indicia of described word to be measured.

Alternatively, described first build module also for: when described first eigenvector is built for the first sample word in ordered multiple sample words, use book character string as the last word of described first sample word, and, described second build module also for: when building described second feature vector for the word first to be measured in ordered word multiple to be measured, use book character string as the last word of described first word to be measured.

Alternatively, in described training module, in the training input quantity of described neural network, also comprise negative routine sample.

According to technical scheme of the present invention, have employed more reasonably proper vector to carry out training pattern and utilize model to predict, this proper vector not only comprises the feature of current word word itself, also comprise the entity indicia feature of current word part of speech feature, the last word of current word, compared with the existing recognition technology only considering word itself, the information considered is more comprehensive, causes the recognition result that finally obtains more accurate, particularly higher to accuracy rate during electric business's domain entities identification.

Accompanying drawing explanation

Accompanying drawing is used for understanding the present invention better, does not form inappropriate limitation of the present invention.Wherein:

Fig. 1 is the process flow diagram of the key step of named entity recognition method according to the embodiment of the present invention;

Fig. 2 is the schematic diagram of the critical piece of named entity recognition device according to the embodiment of the present invention.

Embodiment

Below in conjunction with accompanying drawing, one exemplary embodiment of the present invention is explained, comprising the various details of the embodiment of the present invention to help understanding, they should be thought it is only exemplary.Therefore, those of ordinary skill in the art will be appreciated that, can make various change and amendment, and can not deviate from scope and spirit of the present invention to the embodiments described herein.Equally, for clarity and conciseness, the description to known function and structure is eliminated in following description.

For making those skilled in the art understand better, first relational language is briefly introduced.

Word: the word of word itself.

Term vector: the vectorization of word represents, each word vector of a multidimensional represents.

Part of speech: the character of word.Usually word is divided into two classes, 12 kinds of parts of speech.One class is notional word: noun, verb, adjective, number, adverbial word, onomatopoeia, measure word and pronoun.One class is function word: preposition, conjunction, auxiliary word and interjection.

Part of speech vector: the vectorization of part of speech represents, often kind of part of speech multi-C vector represents, preferably adopts the multi-C vector of discrete form to represent.

Entity indicia: each entity indicia represents a kind of entity type, such as WID represents that commodity ID, WB represent first word of trade name, and WI represents the medium term of trade name, and WE represents the end word of trade name, and O represents other words etc.Such as: how (O) red (WI) mobile phone (WE) of millet (WB) 2s (WI).

Entity indicia vector: the vectorization of entity indicia represents, often kind of entity indicia multi-C vector represents, preferably adopts the multi-C vector of discrete form to represent.

It should be noted that, term vector, part of speech vector and vectorial these three the vectorial dimensions of entity indicia do not need to be consistent, and can arrange flexibly as required.

Fig. 1 is the process flow diagram of the key step of named entity recognition method according to the embodiment of the present invention.As shown in Figure 1, this named entity recognition method can comprise steps A to step G.

Steps A: obtain vectorial storehouse.This vectorial storehouse comprises multiple word term vector corresponding respectively, the part of speech vector that multiclass part of speech is corresponding respectively, and the entity indicia vector that multiclass entity indicia is corresponding respectively.

In an embodiment of the invention, for given language material, word2dec can be utilized to determine the term vector that each word in language material is corresponding.Word2vec is a instrument word being characterized by real number value vector that Google increased income in 2013, and word can be mapped to K gt, the vector operations even between word with word can also be corresponding with semanteme.Therefore utilize word2vec to precalculate term vector, can save time, raise the efficiency, and can accuracy rate be improved.Part of speech vector sum entity indicia vector can adopt the method for random initializtion, obtains random vector.The term vector obtained by said process, part of speech vector sum entity indicia vector are stored in vectorial storehouse for subsequent use.

Step B: corpus text string participle is obtained ordered multiple sample words.

In embodiments of the present invention, corpus text string can be extracted from the data of electric business website and then carry out participle, obtain multiple ordered sample word, as shown in table 1:

Table 1 corpus text string and sample word

Corpus text string	Ordered sample word
		" iphone price "	" iphone " " price "
" Huawei's honor 6 "	" Huawei " " honor " " 6 "
		" the red mobile phone of millet 1s "	" millet " " 1s " " redness " " mobile phone "
……	……

Step C: according to priority for each sample word query vector storehouse to build first eigenvector.First eigenvector comprises part of speech vector corresponding to term vector corresponding to sample word, sample word and entity indicia vector corresponding to the last word of sample word.Outside the information that first eigenvector contains the word of sample word itself and part-of-speech information, also comprise the entity indicia information of the last word of sample word.Method of the present invention carrys out training pattern based on first eigenvector, comes compared with the prior art of training pattern with only relying on the information of word itself, and the information of consideration is more comprehensive, causes the recognition result that finally obtains more accurate.

It should be noted that the implication of " first eigenvector comprises part of speech vector corresponding to term vector corresponding to sample word, sample word and entity indicia vector corresponding to the last word of sample word " refers to that first eigenvector is spliced by three vectors below, such as: first eigenvector=[term vector that sample word is corresponding, the part of speech vector that sample word is corresponding, the entity indicia vector that the last word of sample word is corresponding].When the present invention is not spliced vector, splicing order limits, and different splicing orders does not affect principle of the present invention.But the splicing order in whole method is once determine, no longer changes, consistent to ensure all first eigenvector forms.

The detailed process of step C is exemplified below: obtain ordered multiple sample words " sample word 1+ sample word 2+ sample word 3+ sample word 4 ... " before supposing, then need according to priority to sample word 1, sample word 2, sample word 3, sample word 4 etc. builds first eigenvector respectively.It is 0 that word window width is got in setting.Wherein, when building first eigenvector to sample word 1 (i.e. first sample word), because sample word 1 does not exist word, so need to increase the last word of book character string " $ BEGIN " as sample word 1 artificially above originally.The entity indicia vector of this book character string " $ BEGIN " has been pre-existing in vectorial storehouse, is generally random initialization vector.At this moment, for sample word 1, suppose that the term vector inquiring sample word 1 from vectorial storehouse is designated as X1, the part of speech vector of sample word 1 is designated as Z1, and the entity indicia vector of " $ BEGIN " is designated as T0, then the first resultant vector=[X1 of sample word 1, Z1, T0].Then, for sample word 2, suppose that the term vector inquiring sample word 2 from vectorial storehouse is designated as X2, the part of speech vector of sample word 2 is Z2, the entity indicia vector of the last word (i.e. sample word 1) of sample word 2 is designated as T1, then the first resultant vector=[X2, Z2, the T1] of sample word 2.By that analogy, first eigenvector corresponding to all sample words can be obtained.

In embodiments of the present invention, can also comprise in first eigenvector: sample word is close to term vector corresponding to word and sample word is close to part of speech vector corresponding to word.The meaning herein " also comprised " refers to " being also spliced by vector below "." the contiguous word of sample word " refers to that before being positioned at current sample word or after being positioned at current sample word, distance is not more than the sample word getting word window width.Be exemplified below: suppose that getting word window width is 1, then the contiguous word of sample word refers to 1 word after front 1 word of current sample word and current sample word.The first eigenvector of current sample word can be designated as [the term vector that the last word of current sample word is corresponding, the term vector that current sample word is corresponding, the term vector that after current sample word, a word is corresponding, the part of speech vector that the last word of current sample word is corresponding, the part of speech vector that current sample word is corresponding, the part of speech vector that after current sample word, a word is corresponding, the entity indicia vector that the last word of current sample word is corresponding].The situation that other numerical value get word window width can be analogized, and repeats no more herein.It should be noted that, the present invention does not limit the numerical value getting word window width, can arrange flexibly as required, but once determine, no longer changes, consistent to ensure all first eigenvector forms.Also it should be noted that, when getting word window width and increasing, the contiguous word be positioned at before first sample word can be served as to the preset characters string increased before first sample word, can also to increasing preset characters string to serve as the contiguous word be positioned at after the sample word of end after the sample word of end, those skilled in the art can derive specific practice by content above, repeat no more herein.In this embodiment, first eigenvector has further contemplated word information and the part-of-speech information of the contiguous word of sample word, and the information of consideration is more comprehensive, causes the recognition result that finally obtains more accurate.

Step D: using overall for first eigenvector corresponding for all sample words training input quantity as neural network, utilize BP algorithm of neural network to carry out network parameter and solve, obtain neural network Named Entity Extraction Model.Particularly, square error can be adopted to build the objective function of model entirety, utilize stochastic gradient method to solve the parameter of neural network, obtain final neural network Named Entity Extraction Model.

In embodiments of the present invention, negative routine sample can also be comprised in the training input quantity of neural network.Due to the normally skewness of the entity indicia in the corpus text string of reality, this can cause model poor to a part of named entity matching.Be directed to this, in the process of training pattern, according to the distribution situation of these entity indicia, the sampling of data minus example can be carried out in proportion at random, ensure that its distribution is even as much as possible, thus ensure that the matching that model marks all named entities is more accurate.

Step e: text string participle to be predicted is obtained ordered word multiple to be measured.

In embodiments of the present invention, text string to be predicted can be obtained from user's read statement and then carry out participle, obtain multiple ordered word to be measured.

Step F: according to priority for each word query vector storehouse to be measured to build second feature vector, second feature vector comprises part of speech vector corresponding to term vector corresponding to word to be measured, word to be measured and entity indicia vector corresponding to the last word of word to be measured.

It should be noted that, when second feature vector is built for the word first to be measured in ordered word multiple to be measured, the last word of book character string " $ BEGIN " as first word to be measured can be increased before first word to be measured.Operation herein with above before first sample word, increase the class of operation of book character string seemingly.

Also it should be noted that, the form of the first eigenvector that the second feature vector that word to be measured is corresponding should be corresponding with sample word is consistent.This means to comprise point vectorial kind in second feature vector and divide vectorial splicing order needs consistent with first eigenvector.Such as: when also comprising term vector corresponding to the contiguous word of sample word and part of speech vector corresponding to the contiguous word of sample word in first eigenvector, correspondingly, term vector corresponding to the contiguous word of word to be measured and part of speech vector corresponding to the contiguous word of word to be measured is also comprised in second feature vector.

Step G: respectively by second feature corresponding for word to be measured vector input neural network Named Entity Extraction Model, export the entity indicia of word to be measured.

For making those skilled in the art understand better, the specific embodiment enumerating a named entity recognition method is as follows.

(1) word2vec instrument is utilized to obtain vectorial storehouse.

(2) suppose that some corpus text strings are for " iphone price ", can obtain two sample words " iphone " and " price " through participle.The part of speech of " iphone " is noun n, and entity indicia is commodity entity indicia W.The part of speech of " price " is noun n, and entity indicia is other entity indicia O.

(3) first eigenvector corresponding to " iphone " is first built.Because " iphone " is first sample word, therefore need to add " $ BEGIN " (its term vector, part of speech are vectorial, entity indicia is vectorial is all random initializtion) above.Suppose that the word window width of getting in the present embodiment is 1.Query word vector storehouse, the term vector that after taking out the last word of current sample word " $ BEGIN ", current sample word " iphone ", current sample word, these three words of a word " price " are corresponding is expressed as Xi-1, Xi, Xi+1, and part of speech vector representation corresponding to these three words is Zi-1, Zi, Zi+1, the entity tag of adding " $ BEGIN " is expressed as Ti-1.These seven vectors are stitched together in order, form first eigenvector=[Xi-1, Xi, Xi+1, Zi-1, Zi, Zi+1, the Ti-1] that " iphone " is corresponding.

(4) using the input layer of first eigenvector as input quantity input neural network, obtain exporting h (X).In the present embodiment, entity indicia W/O is converted to the discrete representation of 1/0.Entity indicia due to known " iphone " is " W " desired output is here 1.Utilize gradient descent algorithm to carry out parameter optimization, make error minimum.By all corpus text strings through above training process, final neural network Named Entity Extraction Model can be obtained.(5) suppose some text strings to be predicted " Nokia white ", word segmentation result is two words to be measured " Nokia " and " white ", and the part of speech of known " Nokia " and " white " is noun n.

(6) process building second feature vector corresponding to " Nokia " is as follows: before " Nokia ", add " $ BEGIN ".Query word vector storehouse, obtains the term vector that " $ BEGIN " " Nokia " " white " is corresponding, then obtains the part of speech vector that " $ BEGIN " " Nokia " " white " is corresponding, and obtains the entity indicia vector of " $ BEGIN ".These seven vectors are stitched together in order, namely obtain the second feature vector that " Nokia " is corresponding.

(7) by the neural network Named Entity Extraction Model that second feature vector input step (4) corresponding to " Nokia " obtains, to predict the entity indicia of " Nokia ".If model exports h (X)=0.8, numerical value is greater than intermediate value 0.5, then " Nokia " is labeled as W (commodity entity).Export h (X)=0.2 as crossed model, numerical value is less than intermediate value 0.5, then " Nokia " is labeled as O (other entities).

Fig. 2 is the schematic diagram of the critical piece of named entity recognition method according to the embodiment of the present invention.As shown in Figure 2, this named entity recognition device 20 can comprise: vectorial storehouse acquisition module 21, first participle module 22, first build module 23, training module 24, second word-dividing mode 25, second builds module 26 and prediction module 27.

Vector storehouse acquisition module 21 is for obtaining vectorial storehouse, and vectorial storehouse comprises multiple word term vector corresponding respectively, the part of speech vector that multiclass part of speech is corresponding respectively, and the entity indicia vector that multiclass entity indicia is corresponding respectively.Alternatively, word2dec is utilized to determine the term vector that multiple word is corresponding.Utilize word2dec to precalculate, save the training time.

First participle module 22 is for obtaining ordered multiple sample words by corpus text string participle.

First build module 23 for according to priority for each sample word query vector storehouse to build first eigenvector, first eigenvector comprises part of speech vector corresponding to term vector corresponding to sample word, sample word and entity indicia corresponding to the last word of sample word is vectorial.

Training module 24, for using overall for first eigenvector corresponding for all sample words training input quantity as neural network, utilizes BP algorithm of neural network to carry out network parameter and solves, obtain neural network Named Entity Extraction Model.

Second word-dividing mode 25 is for obtaining ordered word multiple to be measured by text string participle to be predicted.

Second build module 26 for according to priority for each word query vector storehouse to be measured to build second feature vector, second feature vector comprises part of speech vector corresponding to term vector corresponding to word to be measured, word to be measured and entity indicia corresponding to the last word of word to be measured is vectorial.

Prediction module 27, for by second feature corresponding for each word to be measured vector input neural network Named Entity Extraction Model respectively, exports the entity indicia of word to be measured.

In embodiments of the present invention, can also comprise in first eigenvector: sample word is close to term vector corresponding to word and sample word is close to part of speech vector corresponding to word, and, can also comprise in second feature vector: word to be measured is close to term vector corresponding to word and word to be measured is close to part of speech vector corresponding to word.In this embodiment, first eigenvector and second feature vector have further contemplated word information and the part-of-speech information of contiguous word, and the information of consideration is more comprehensive, cause the recognition result that finally obtains more accurate.

In embodiments of the present invention, first builds module 23 can also be used for: when building first eigenvector for the first sample word in ordered multiple sample words, the last word of first sample word is book character string, and, second builds module 26 can also be used for: when building second feature vector for the word first to be measured in ordered word multiple to be measured, the last word of first word to be measured is book character string.This addresses the problem before first sample word or first word to be measured and originally lack word problem.

In embodiments of the present invention, in training module 27, in the training input quantity of neural network, also comprise negative routine sample.Introduce negative routine sample and can ensure that sample distribution is even as much as possible, thus ensure that the matching that model marks all named entities is more accurate.

In sum, named entity recognition method of the present invention and device have employed more reasonably proper vector to be carried out training pattern and utilizes model to predict, this proper vector not only comprises the feature of current word word itself, also comprise the entity indicia feature of current word part of speech feature, the last word of current word, compared with the existing recognition technology only considering word itself, the information considered is more comprehensive, cause the recognition result that finally obtains more accurate, particularly higher to accuracy rate during electric business's domain entities identification.

Above-mentioned embodiment, does not form limiting the scope of the invention.It is to be understood that depend on designing requirement and other factors, various amendment, combination, sub-portfolio can be there is and substitute in those skilled in the art.Any amendment done within the spirit and principles in the present invention, equivalent replacement and improvement etc., all should be included within scope.

Claims

1. a named entity recognition method, is characterized in that, comprising:

Obtain vectorial storehouse, described vectorial storehouse comprises multiple word term vector corresponding respectively, the part of speech vector that multiclass part of speech is corresponding respectively, and the entity indicia vector that multiclass entity indicia is corresponding respectively;

Corpus text string participle is obtained ordered multiple sample words;

According to priority for the described vectorial storehouse of each sample word inquiry to build first eigenvector, described first eigenvector comprises part of speech vector corresponding to term vector corresponding to sample word, sample word and entity indicia corresponding to the last word of sample word is vectorial;

Using overall for described first eigenvector corresponding for all sample words training input quantity as neural network, utilize BP algorithm of neural network to carry out network parameter and solve, obtain neural network Named Entity Extraction Model;

Text string participle to be predicted is obtained ordered word multiple to be measured;

According to priority for the described vectorial storehouse of each word to be measured inquiry to build second feature vector, described second feature vector comprises part of speech vector corresponding to term vector corresponding to word to be measured, word to be measured and entity indicia corresponding to the last word of word to be measured is vectorial;

Described second feature vector corresponding for each word to be measured described is inputted described neural network Named Entity Extraction Model respectively, exports the entity indicia of described word to be measured.

2. method according to claim 1, is characterized in that,

Also comprise in described first eigenvector: described sample word is close to term vector corresponding to word and described sample word is close to part of speech vector corresponding to word, and,

Also comprise in described second feature vector: described word to be measured is close to term vector corresponding to word and described word to be measured is close to part of speech vector corresponding to word.

3. method according to claim 1, is characterized in that,

When building described first eigenvector for the first sample word in ordered multiple sample words, the last word of described first sample word is book character string, and,

When building described second feature vector for the word first to be measured in ordered word multiple to be measured, the last word of described first word to be measured is book character string.

4. method according to claim 1, is characterized in that, also comprises negative routine sample in the training input quantity of described neural network.

5. a named entity recognition device, is characterized in that, comprising:

Vector storehouse acquisition module, for obtaining vectorial storehouse, described vectorial storehouse comprises multiple word term vector corresponding respectively, the part of speech vector that multiclass part of speech is corresponding respectively, and the entity indicia vector that multiclass entity indicia is corresponding respectively;

First participle module, for obtaining ordered multiple sample words by corpus text string participle;

First builds module, for according to priority for the described vectorial storehouse of each sample word inquiry to build first eigenvector, described first eigenvector comprises part of speech vector corresponding to term vector corresponding to sample word, sample word and entity indicia corresponding to the last word of sample word is vectorial;

Training module, for using overall for described first eigenvector corresponding for all sample words training input quantity as neural network, utilizes BP algorithm of neural network to carry out network parameter and solves, obtain neural network Named Entity Extraction Model;

Second word-dividing mode, for obtaining ordered word multiple to be measured by text string participle to be predicted;

Second builds module, for according to priority for the described vectorial storehouse of each word to be measured inquiry to build second feature vector, described second feature vector comprises part of speech vector corresponding to term vector corresponding to word to be measured, word to be measured and entity indicia corresponding to the last word of word to be measured is vectorial;

Prediction module, for described second feature vector corresponding for each word to be measured described is inputted described neural network Named Entity Extraction Model respectively, exports the entity indicia of described word to be measured.

6. device according to claim 5, is characterized in that,

7. device according to claim 5, is characterized in that,

Described first build module also for: when building described first eigenvector for the first sample word in ordered multiple sample words, use book character string as the last word of described first sample word, and,

Described second build module also for: when building described second feature vector for the word first to be measured in ordered word multiple to be measured, use book character string as the last word of described first word to be measured.

8. device according to claim 5, is characterized in that, in described training module, also comprises negative routine sample in the training input quantity of described neural network.