CN109933773B

CN109933773B - Multiple semantic statement analysis system and method

Info

Publication number: CN109933773B
Application number: CN201711353550.1A
Authority: CN
Inventors: 杨敏
Original assignee: Shanghai Aigine Information Technology Co ltd
Current assignee: Shanghai Aigine Information Technology Co ltd
Priority date: 2017-12-15
Filing date: 2017-12-15
Publication date: 2023-05-26
Anticipated expiration: 2037-12-15
Also published as: CN109933773A

Abstract

The invention relates to a multiple semantic sentence analysis system which comprises a text encoding module, a neural network module and a text decoding module, wherein the text encoding module converts text information into a plurality of groups of text vector information, the neural network module converts the plurality of groups of text vector information into a plurality of groups of semantic vectors according to a set algorithm, and the text decoding module converts the plurality of groups of semantic vectors into text information and outputs the text information. The invention also discloses a multi-semantic sentence analysis method, which comprises the steps of text coding, text logic analysis, text decoding output and the like. The invention adopts the neural network module containing a plurality of neural network algorithms to analyze the multiple semantic sentences, and has the characteristic of being capable of accurately and efficiently analyzing the multiple semantic sentences.

Description

Multiple semantic statement analysis system and method

Technical Field

The invention relates to a semantic sentence analysis system and a semantic sentence analysis method, in particular to a semantic sentence analysis system and a semantic sentence analysis method with multiple semantic sentences, and belongs to the field of natural language artificial intelligence.

Background

With the continuous perfection and development of neural network technology, artificial intelligent products gradually enter various fields, and products with man-machine interaction functions are more and more favored by people. The intelligent product can complete man-machine interaction, understand the intention of the user and then make corresponding actions. For example, the apple Siri company, the user can wake it up and tell it to make a call, listen to music, etc., and then the electronic product will automatically complete these instructions. Microsoft ice can be used to simply chat with the user. The man-machine interaction function of the products saves the time of users (without hands, direct speaking) on one hand, and increases the interest (chat interaction) of the products on the other hand. In the back of achieving these functions, semantic understanding is a critical technology and is also a technology to be improved.

Current semantic understanding techniques are capable of accurately understanding sentences that contain only a single semantic meaning, e.g., users: navigating to the people square. And (3) a robot: is navigating to people squares. However, for sentences containing multiple semantics, such as users: the user needs two instructions executed by the robot to navigate to the people square and eat meal at the affordable restaurant, namely, navigate to the people square and search for the affordable restaurant near the people square. This type of statement contains two or more semantics, which cannot be resolved accurately and efficiently by current semantic understanding techniques.

Disclosure of Invention

The invention discloses a novel scheme for analyzing multiple semantic sentences by adopting a neural network module comprising a plurality of neural network algorithms, and solves the problem that the conventional scheme cannot accurately and efficiently analyze the multiple semantic sentences.

The multi-semantic sentence analysis system comprises a text encoding module, a neural network module and a text decoding module, wherein the text encoding module converts text information into a plurality of groups of text vector information, the neural network module converts the plurality of groups of text vector information into a plurality of groups of semantic vectors according to a set algorithm, and the text decoding module converts the plurality of groups of semantic vectors into text information and outputs the text information.

The invention also discloses a multiple semantic sentence analysis method, which is based on a multiple semantic sentence analysis system, wherein the multiple semantic sentence analysis system comprises a text encoding module, a neural network module and a text decoding module, and comprises the following steps: the text encoding module performs text encoding on input text information and then converts the text information into a word vector format; the neural network module breaks down text information in a word vector format into a plurality of sentences in the word vector format; thirdly, the neural network module carries out logic analysis and calculation on the sentences in the multiple word vector formats according to a set algorithm to obtain multiple groups of semantic vectors; and the text decoding module decodes the texts by the plurality of groups of semantic vectors, converts the text information into text information and outputs the text information to a specific application layer.

Further, the processing procedure of the neural network module of the method of the present solution includes: the method comprises the steps of processing an input sentence in a word vector format by adopting a convolutional neural network, extracting a feature vector representing the sentence, and adjusting the feature vector according to an application scene; secondly, decomposing the feature vectors into a corresponding number of feature vectors according to the number of the semantics contained in the sentences by adopting a cyclic neural network, wherein the decomposed feature vectors correspond to the semantics contained in the sentences; thirdly, the cyclic neural network and the deep neural network are combined to respectively carry out logic operation on the decomposed feature vectors to extract semantic vectors.

Furthermore, in the processing procedure of the neural network module of the method of the scheme, the cyclic neural network and the deep neural network are combined to respectively carry out logic operation on the decomposed feature vectors to obtain semantic vectors and logic vectors, the logic vectors represent the dependency relationship among the semantics, and the subsequent semantic vectors are obtained by combining the corresponding feature vectors with the logic vectors of the last feature vector.

The multiple semantic sentence analysis system and the multiple semantic sentence analysis method adopt the neural network module comprising a plurality of neural network algorithms to analyze the multiple semantic sentences, and have the characteristic of being capable of accurately and efficiently analyzing the multiple semantic sentences.

Drawings

FIG. 1 is a schematic diagram of a multiple semantic statement parsing system and method of the present invention.

Fig. 2 is a flow chart of the neural network module processing information.

Detailed Description

The invention discloses a method for understanding natural language multiple semantics by adopting artificial intelligence. The current deep learning technology has great success in the fields of text classification, text labeling, translation and the like, so the scheme provides a semantic understanding method based on the deep learning technology, which is used for solving the problem of multiple semantic sentence analysis. As shown in fig. 1, the multiple semantic understanding method of the present scheme is a block diagram. The scheme comprises three modules, namely a text encoding module, a neural network module and a text decoding module. The text coding module is a module capable of converting a text into a calculation, the text is coded in a word embedding mode, after the step, the text is coded into a plurality of groups of vectors, the step is also called vectorization of the text, and a large number of texts are needed to train the model in advance. The neural network module is a core module of the present solution, and the neural network model is generally built in an end-to-end (end to end) manner, that is, input corresponds to output. The method adds a logical reasoning processing method on an end-to-end basis, calculates the logical relation between the current sentence and the first several sentences, so that the model can be combined with the context to comprehensively understand multiple semantics in the input text. The text decoding module is similar to the text encoding module, and is used for converting the output vector of the neural network into specific words, so that the model also needs to be trained by using a large amount of texts in advance.

As shown in fig. 1, taking "navigate to people square and eat rice" as an example, the multiple semantic method of the present solution includes the following processing steps:

the method includes the steps of text encoding and converting the text encoding into word vectors.

The neural network module analyzes the text, and decomposes the text into a first sentence: navigating to get to people squares; statement two: and eat meal.

Thirdly, the neural network module carries out logic analysis and calculation on the first statement and the second statement, and outputs the first semantic meaning: navigation, people squares; semantic two: searching for restaurants and people in the vicinity of squares.

And decoding the text and outputting the decoded text to a specific application layer.

Fig. 2 shows a neural network model block diagram of the present solution. The input of the model is the coded text (word vector), and the output is the semantic vector after the text is analyzed. The algorithm used by the neural network model of the scheme is as follows: convolutional neural network (Convolutional Neural Network, CNN), cyclic neural network (Recurrent Neural Networks, RNN), deep neural network (Deep Neural Network, DNN).

CNN is a multi-layer neural network used to process two-dimensional data, and CNN is considered to be the first truly successful robust deep learning method employing a multi-layer hierarchical network. CNN reduces the number of trainable parameters in the network by mining spatial correlations in the data, achieving improved back-propagation algorithm efficiency for forward-propagation networks, which is also considered a method of deep learning because CNN requires very little data preprocessing effort.

The RNN is used to process sequence data, i.e. one sequence current output is related to the previous output. The specific expression is that the network will memorize the previous information and apply it to the calculation of the current output, i.e. the nodes between the hidden layers are no longer connectionless but connected, and the input of the hidden layer includes not only the output of the input layer but also the output of the hidden layer at the previous moment. In theory, RNNs can process sequence data of any length, thus solving the problem that a common neural network is fully connected from an input layer to an implied layer to an output layer, but nodes between layers are connectionless, for example, you want to predict what the next word of a sentence is, and it is generally necessary to use the previous word because the words before and after in a sentence are not independent. RNNs have proven to be very successful in practice for word vector expressions, statement legitimacy checks, part-of-speech tagging, etc. Among RNNs, the most widely used and successful model at present is LSTMs (Long Short-Term Memory model) which can better express Long-Short Term dependency.

DNN refers to deep neural network, and is different from RNN recurrent neural network and CNN convolutional neural network in that DNN refers to fully-connected neuron structure, and layers are fully-connected and do not contain convolutional units or are associated in time, that is, any neuron of the i layer must be connected with any neuron of the i+1th layer. DNN can also be understood as a neural network with many hidden layers.

Taking the example of processing "navigate to people square, eat meal, then watch movie", assume that this sentence has been processed into a word vector, the word vector size is (R ₁ ，F ₁ )，R ₁ Representing the number of words, F ₁ Representing the length of the word vector, the main steps of the model include the following.

First, the input word vector is processed with CNN, and feature vectors that can represent the sentence are extracted. The size of the feature vector can be customized, and generally needs to be adjusted according to the application scene. The processed feature vector has a size (R ₂ ，F ₂ )，R ₂ Represents the length, F ₂ Representing the feature number.

Then, based on the number of semantics contained in the sentence, the feature vector is decomposed by RNN, and the sentence contains three kinds of semanticsSemantic, and therefore, decomposed into feature vectors 1, of size (N ₁ ，F ₂ ) Feature vector 2, size (N ₂ ，F ₂ ) Feature vector 3, size (N ₃ ，F ₂ ) The three feature vectors correspond to three semantics of navigation, eating and film watching in sentences respectively.

Finally, combining RNN and DNN, and respectively carrying out logic operation on the decomposed feature vectors to extract semantic vectors. In order for the neural network to "remember" the logical relationships between the semantics, the present model adds a logical vector to each semantic, which is used to represent the dependency of the present semantic with the previous semantic. In this example, for the first semantic vector, since there is no text in front of it, only feature vector 1 is used to calculate to obtain semantic vector 1 and logic vector 1, and the subsequent semantic vector is combined by the feature vector and the logic vector of the previous semantic to obtain the result.

The model finally outputs semantic vectors, and specific semantics can be obtained after decoding, so that the semantic vectors are provided for upper-layer applications to react to instructions of users.

The multi-semantic statement analysis system and the multi-semantic statement analysis method of the scheme combine the advanced deep learning technology, comprehensively use the CNN, DNN, RNN technology and provide a method capable of accurately analyzing multi-semantics. Practice proves that the method can effectively solve the problem of multi-semantic sentence analysis in man-machine interaction after large-scale corpus training. Based on the characteristics, the multi-semantic statement analysis system and the multi-semantic statement analysis method have outstanding substantive characteristics and remarkable progress compared with the existing scheme.

The system and method for parsing multiple semantic sentences in the present solution are not limited to those disclosed in the specific embodiments, and the technical solutions presented in the examples may be extended based on understanding of those skilled in the art, and simple alternatives made by those skilled in the art according to the present solution in combination with common general knowledge also belong to the scope of the present solution.

Claims

1. The multi-semantic sentence analysis system is characterized by comprising a text encoding module, a neural network module and a text decoding module, wherein the text encoding module converts text information into a plurality of groups of text vector information, the neural network module converts the plurality of groups of text vector information into a plurality of groups of semantic vectors according to a set algorithm, and the text decoding module converts the plurality of groups of semantic vectors into text information and outputs the text information; the processing process of the neural network module comprises the following steps:

the method comprises the steps of processing an input sentence in a word vector format by adopting a convolutional neural network, extracting a feature vector representing the sentence, and customizing the size of the feature vector according to an application scene, wherein the size of the processed feature vector is (R2, F2), R2 represents the length, and F2 represents the feature number;

secondly, decomposing the feature vectors into a corresponding number of feature vectors according to the number of the semantics contained in the sentences by adopting a cyclic neural network, wherein the decomposed feature vectors correspond to the semantics contained in the sentences;

thirdly, respectively carrying out logic operation on the decomposed feature vectors by adopting the combination of a cyclic neural network and a deep neural network to extract semantic vectors;

in the processing procedure of the neural network module, a cyclic neural network and a deep neural network are combined to respectively carry out logic operation on the decomposed feature vectors to obtain semantic vectors and logic vectors, the logic vectors represent the dependency relationship among semantics, and the subsequent semantic vectors are obtained by combining the corresponding feature vectors with the logic vectors of the last feature vector.

2. The multi-semantic sentence analysis method is based on a multi-semantic sentence analysis system, and the multi-semantic sentence analysis system comprises a text encoding module, a neural network module and a text decoding module and is characterized by comprising the following steps:

the text encoding module performs text encoding on input text information and then converts the text information into a word vector format;

the neural network module breaks down text information in a word vector format into a plurality of sentences in the word vector format;

thirdly, the neural network module carries out logic analysis and calculation on the sentences in the multiple word vector formats according to a set algorithm to obtain multiple groups of semantic vectors;

the text decoding module decodes the text of the plurality of groups of semantic vectors into text information and outputs the text information to a specific application layer;

the processing process of the neural network module comprises the following steps: