CN113688233A

CN113688233A - Text understanding method for semantic search of knowledge graph

Info

Publication number: CN113688233A
Application number: CN202110870572.5A
Authority: CN
Inventors: 陈运文; 王文广; 贺梦洁; 纪达麒; 桂洪冠; 金克; 冯佳妮; 纪传俊
Original assignee: Daguan Data Suzhou Co ltd
Current assignee: Daguan Data Suzhou Co ltd
Priority date: 2021-07-30
Filing date: 2021-07-30
Publication date: 2021-11-23

Abstract

The invention discloses a text understanding method for knowledge graph semantic search, which aims at the input text to be understood and comprises the following steps: obtaining semantic information of each word element in the text through a large-scale pre-training model, and generating a semantic vector; identifying an entity type and a relationship type through a convolutional neural network, a first softmax classifier for entity classification and a second softmax classifier for relationship classification based on the semantic vector; based on the semantic vector, performing sequence annotation through a CRF (conditional random access memory), and extracting an entity; classifying the text through a Bi-LSTM model and a third softmax classifier for classifying question sentences based on the semantic vector; and retrieving knowledge graph acquisition information as feedback based on the recognized entity type and relationship type, the extracted entity and the classification result of the text. The invention uses a uniform method to complete four tasks simultaneously, so that the system is more concise.

Description

Text understanding method for semantic search of knowledge graph

Technical Field

The invention belongs to the field of natural language processing, and particularly relates to a text understanding method for semantic search of a knowledge graph.

Background

Semantic search refers to a search engine that, instead of being limited to the literal of a requested sentence input by a user, accurately captures the true intention behind the sentence input by the user by looking at the essence through phenomena, and searches the sentence so as to more accurately return a search result that best meets the needs of the user.

In the knowledge graph, semantic search can be carried out through anthropomorphic question asking, under the scene, the question to be retrieved can be clearly described like a person, the text of the question description is analyzed and understood, the result is searched and matched from the knowledge graph, and the result required by the user is returned.

Disclosure of Invention

Aiming at the problems in the prior art, the invention provides a text understanding method for semantic search of a knowledge graph.

In order to achieve the purpose, the invention adopts the following technical scheme:

a method of text understanding for a knowledge-graph semantic search, the method comprising, for an input text to be understood, the steps of:

obtaining semantic information of each word element in the text through a large-scale pre-training model, and generating a semantic vector;

identifying an entity type and a relationship type through a convolutional neural network, a first softmax classifier for entity classification and a second softmax classifier for relationship classification based on the semantic vector;

based on the semantic vector, performing sequence annotation through a CRF (conditional random access memory), and extracting an entity;

classifying the text through a Bi-LSTM model and a third softmax classifier for classifying question sentences based on the semantic vector;

and retrieving knowledge graph acquisition information as feedback based on the recognized entity type and relationship type, the extracted entity and the classification result of the text.

Preferably, the large scale pre-training model uses one of BERT, RoBERTa, ERINE, Albert, GPT.

Preferably, the classification result of the text includes a fact class, a statistic class, a non-class and a relation class.

A computer storage medium having a computer program stored therein which, when executed, implements the method.

A system for text understanding for knowledge-graph semantic search, the system comprising:

the semantic acquisition module is used for acquiring semantic information of each word element in the text through a large-scale pre-training model and generating a semantic vector;

the entity classification module is used for identifying entity types through a convolutional neural network and a first softmax classifier for entity classification based on the semantic vector;

the relation classification module is used for identifying the relation type through a convolutional neural network and a second softmax classifier for relation classification based on the semantic vector;

the entity extraction module is used for carrying out sequence marking through CRF based on the semantic vector to extract an entity;

the question classification module is used for classifying the texts through a Bi-LSTM model and a third softmax classifier for question classification based on the semantic vector;

and the retrieval feedback module is used for retrieving the knowledge graph acquisition information as feedback based on the recognized entity type and relationship type, the extracted entity and the classification result of the text.

Compared with the prior art, the invention has the beneficial effects that:

1. the intention recognition is realized through question classification, and answers which a user wants are understood, specifically, the problem is divided into four types, namely an actual type, a statistic type, a non-type, a relation type and the like through the model, but the model is not limited to the four types, and the problem can be subdivided or adjusted according to specific conditions, for example, in the professional field of a power grid, the problem can be divided into a maintenance type, an operation and maintenance type, a financial type, a party building type and the like;

2. the information needed for better retrieval in semantic search is guided by identifying the input text segments as different entity types or relationship types;

3. extracting the entity of the input text by an entity extraction method so as to directly match the entity from the knowledge graph;

4. the invention uses a uniform method to complete four tasks simultaneously, so that the system is more concise;

5. since these four tasks are closely related in a single text entry, the effect is better than four separate tasks.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.

Fig. 1 is a schematic flow chart of 4 downstream tasks performed simultaneously according to an embodiment of the present invention.

Fig. 2 is a system architecture diagram of an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be obtained by a person skilled in the art without inventive effort based on the embodiments of the present invention, are within the scope of the present invention.

In the description of the present invention, it is to be understood that the terms "longitudinal", "lateral", "upper", "lower", "front", "rear", "left", "right", "vertical", "horizontal", "top", "bottom", "inner", "outer", and the like, indicate orientations or positional relationships based on those shown in the drawings, and are used merely for convenience of description and for simplicity of description, and do not indicate or imply that the referenced devices or elements must have a particular orientation, be constructed in a particular orientation, and be operated, and thus, are not to be construed as limiting the present invention.

As shown in fig. 1-2, semantic information of each token (tokens) in the input text is obtained through the same large-scale pre-training model. The large-scale pre-training model may use a general pre-training model such as BERT, RoBERTA, ERINE, Albert, GPT, etc., and may be a special pre-training model similar to a network structure, such as Fin-BERT trained with financial corpora, Ele-BERT trained with power grid domain corpora, Ship-BERT trained with Ship domain, etc.

After the semantic vectors are obtained, the downstream task uses four different models to achieve the four tasks of text understanding.

The entity type identification and the relation type identification share a Convolutional Neural Network (CNN), and the entity type classification and the relation type classification are realized by using respective softmax classifiers, so that the corresponding entity type and the corresponding relation type are understood, and the corresponding key text segment is identified. For example, in the architecture diagram, "who" characterizes "people" in this entity category, and two lemmas, "look" and "play", characterize the relationship of "< people, rehearsal, people >".

And the entity identification task directly uses CRF to label the sequence after the semantic vector, so that entity extraction is realized, and the entities in the input text and the corresponding entity types are extracted. Note that the entity extraction and the entity type identification are different, the entity extraction is to extract which text segments in the text are an entity, and the corresponding entity type, for example, "sunny and Wen" in the architecture diagram is an entity of "person". The entity type identification identifies that the entity is not an entity, but words which may indicate the entity type, such as "who" in the architecture diagram indicates "person" in the entity category.

In question classification, a Bi-LSTM model is used for further understanding from semantic vectors of the lemmas to obtain global semantic information, and the question is classified into different categories, such as four categories in the architecture diagram, including fact category, statistic category, non-category and relationship category. However, the present invention is not limited to a specific category, and may be adjusted to suit the category in different application scenarios. The above figure returns "fact class", and as a result, the entity of a person and various attribute information corresponding to the entity should be returned. For the statistical class, various aggregation algorithms are called to perform statistical calculation, and a statistical result is returned. For example, "do you have several prefectures in Fujian province? "then need to calculate the number of grade city of Fujian province, return the calculated result. For "nongeneric" the logic judgment result is returned, for example, "how is the city of building belong to the province of Fujian? "return result" yes ". For the "relationship class", the relationship between the entities needs to be obtained from the graph. For example, "what is a relationship between the city of the good fortune and the province of the good fortune? "return" < location, province, location > ".

Although the present invention has been described in detail with respect to the above embodiments, it will be understood by those skilled in the art that modifications or improvements based on the disclosure of the present invention may be made without departing from the spirit and scope of the invention, and these modifications and improvements are within the spirit and scope of the invention.

Claims

1. A method of text understanding for knowledge-graph semantic search, characterized in that, for an input text to be understood, the method comprises the steps of:

obtaining semantic information of each word element in the text through a large-scale pre-training model, and generating a semantic vector; identifying an entity type and a relationship type through a convolutional neural network, a first softmax classifier for entity classification and a second softmax classifier for relationship classification based on the semantic vector; based on the semantic vector, performing sequence annotation through a CRF (conditional random access memory), and extracting an entity;

2. The method for textual understanding of knowledge-graph semantic search of claim 1, wherein the large scale pre-training model uses one of BERT, RoBERTa, ERINE, Albert, GPT.

3. The method for textual understanding of knowledge-graph semantic search of claim 1, wherein the classification results of text include fact, statistical, non-category, and relationship categories.

4. A computer storage medium, characterized in that the storage medium has stored therein a computer program which, when executed, implements the method of any one of claims 1-3.

5. A system for text understanding for knowledge-graph semantic search, the system comprising: