CN114417889A

CN114417889A - Task type dialogue system and method based on seq2seq framework

Info

Publication number: CN114417889A
Application number: CN202210043220.7A
Authority: CN
Inventors: 冯卫森; 冯落落; 李沛; 李晓瑜; 高明; 王建华; 尹青山
Original assignee: Shandong New Generation Information Industry Technology Research Institute Co Ltd
Current assignee: Shandong New Generation Information Industry Technology Research Institute Co Ltd
Priority date: 2022-01-14
Filing date: 2022-01-14
Publication date: 2022-04-29

Abstract

The invention provides a task-based dialogue system and method based on a seq2seq architecture, which comprises four parts, namely an intention and slot filling network module, a database operation module, a trust tracking module and an optimization and generation network module; the method comprises the following steps: the bert output of the classification task and the vector of the input label are mapped to the same vector space, and the groove value extraction task is trained through a CRF layer; the database operation module is mainly used for inquiring other slot values according to the identified slot values, trusts a tracking layer, takes the output of the intention and slot value network module and the database operation module as input, takes the output of the layer as input, takes the input of intention and generation network, optimizes and generates the network module to select the answer An with the highest score as the answer of the current round of conversation, and carries out subsequent prediction on the answer. The invention combines the task type dialogue system and the chatting type dialogue system, and combines the module of the task type dialogue system inside, thereby ensuring the function of context and carrying out integral optimization.

Description

Task type dialogue system and method based on seq2seq framework

Technical Field

The invention relates to a task type dialogue system and method based on a seq2seq framework, belonging to the technical field of intelligent question answering.

Background

A traditional task type dialogue system adopts a pipeline structure. Due to the fact that the structure adopts a modular mode, errors of the upstream task parameters are conducted to the downstream. The chatting dialogue system adopts a seq2seq structure and produces corresponding answers through an Encoder module and a Decoder module. But the chatty type dialogue system does not consider the role of context, and the generated NLG answer has uncontrollable property.

Disclosure of Invention

The invention aims to provide a task type dialogue system and method based on a seq2seq framework, which not only consider the context influence and avoid generating uncertain answers, but also can carry out overall optimization.

In order to achieve the purpose, the invention is realized by the following technical scheme:

a task-based dialogue system based on a seq2seq architecture comprises four parts, namely an intention and slot filling network module, a database operation module, a trust tracking module and an optimization and generation network module;

the intention and trough value network module is built by adopting a Bert + CRF structure, wherein the Bert is used for intention classification, the CRF is used for trough value extraction, and the module pre-trains three tasks, namely an intention classification task, a trough value extraction task and a masking word prediction task of the Bert;

the database operation module is mainly used for inquiring other slot values according to the identified slot values;

the trust tracking module (Belief Tracker) adopts a Roberta model, and outputs a vector S containing all information from the beginning to the current round_n+1；

And the optimization and generation network module adopts a classification algorithm, selects the highest score as the answer of the current round of conversation in the existing action, and takes the answer as the input of the trust tracking module for subsequent prediction.

A task-based dialog method based on seq2seq architecture comprises the following steps:

1) bert output __ CLS __ Token a of classification task_clsAnd the vector y of the input label_intentMapping to the same vector space by maximizing the similarity S of the predicted value and the label positive case_I ⁺＝h_CLSh⁺ _intentAnd minimizing the similarity S of the predicted value and the label negative case_I ^-＝h_CLSh^- _intentThe optimization is carried out, and the specific formula is as follows:

2) the groove value extraction task is trained through a CRF layer, and the formula is as follows:

L_E＝L_CRF(a,y_entity)

wherein y is_entityInputting a hidden vector of a label for an entity, a being a corresponding vector Token, L of a data input_CRF(.) is the negative similarity of the CRF;

3) the overall loss of the intent and slot value fill layers is calculated as:

L_total＝L_I+L_E+L_M

in the formula L_MMask prediction for Bert self;

4) the database operation module is mainly used for inquiring other slot values according to the identified slot values and searching other slot values through select, where food and drink according to the keywords provided by the system;

5) a trust tracking layer using the output of the intention and slot value network module and the database operation module as input U_IOne, with the output Sn of the layer as one of the inputs, one of the inputs An of the intent and generation network, the Belief Tracker, as the final input

I_b＝U_I+Sn+An

Wherein U is_IRepresenting the user intention and the corresponding entity in the current round, Sn representing all states from the beginning to the previous round, An representing the answer to the user question in the previous round;

6) and the optimizing and generating network module adopts a classification algorithm for a primary final result, selects the answer An with the highest score as the answer of the current conversation in the existing action, and takes the answer as the input of the Belief Tracker for subsequent prediction.

The invention has the advantages that: the present invention combines a task-based dialog system with a chat-based dialog system. The whole adopts a seq2seq structure, and the module of the task dialog system is combined inside, so that the context effect is ensured, and the whole optimization can be carried out.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention.

Fig. 1 is a schematic view of the overall structure of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The entry to this model is the intent and slot value network. The network is built by adopting a Bert + CRF structure. Bert is used for intent classification and CRF is used for bin value extraction. During pre-training, three tasks, namely an intention classification task, a groove value extraction task and a Bert self mask word prediction task, are mainly considered.

Bert output __ CLS __ Token a of classification task_clsAnd the vector y of the input label_intentMapping to the sameVector space by maximizing the similarity S of the predicted value and the label positive case_I ⁺＝h_CLSh⁺ _intentAnd minimizing the similarity S of the predicted value and the label negative case_I ^-＝h_CLSh^- _intentThe optimization is carried out, and the specific formula is as follows:

the groove value extraction task is trained through a CRF layer, and the formula is as follows:

L_E＝L_CRF(a,y_entity)

the overall loss of the intent and slot value fill layers is calculated as:

L_total＝L_I+L_E+L_M

in the formula L_MMask prediction for Bert self;

and the database operation module is mainly used for inquiring other slot values according to the identified slot values and is used for answering and generating. For example:

human: giving me a cup of beverage.

The system comprises the following steps: ask what beverage you need?

Human: what do you have?

The system comprises the following steps: we have coffee, milk tea, and fruit juice.

In the above dialog, after the system extracts the "beverage" keyword, it can search through select where food is drink.

A trust tracking layer, which takes the output of the intention and slot value network module and the database operation module as input U_IOne, with the output Sn of the layer as one of the inputs, one of the inputs An of the intent and generation network, the Belief Tracker, as the final input

I_b＝U_I+Sn+An

Wherein U is_IRepresenting the user intention and the corresponding entity in the current round, Sn representing all states from the beginning to the previous round, An representing the answer to the user question in the previous round; the network considers the historical information and the influence of the information of the current round. The network adopts the Roberta model. The output is a vector S containing all information from the beginning to the current round_n+1。

The optimization and generation network module adopts a classification algorithm, and because the model is used for a task-based dialog system, the accuracy of output results is most important. And (4) adopting a classification algorithm as a final result, selecting the answer An with the highest score as the answer of the current conversation in the existing action, and taking the answer as the input of a Belief Tracker for subsequent prediction.

Claims

1. A task-based dialogue system based on a seq2seq architecture is characterized by comprising four parts, namely an intention and slot filling network module, a database operation module, a trust tracking module and an optimization and generation network module;

the trust tracking module adopts a Roberta model and outputs a vector S containing all information from the beginning to the current round_n+1；

2. A task-based dialog method using the seq2seq architecture as claimed in claim 1, characterized in that it comprises the following steps:

1) of classification tasksbert output __ CLS __ Token a_clsAnd the vector y of the input label_intentMapping to the same vector space by maximizing the similarity S of the predicted value and the label positive case_I ⁺＝h_CLSh⁺ _intentAnd minimizing the similarity S of the predicted value and the label negative case_I ^-＝h_CLSh^- _intentThe optimization is carried out, and the specific formula is as follows:

L_E＝L_CRF(a,y_entity)

3) the overall loss of the intent and slot value fill layers is calculated as:

L_total＝L_I+L_E+L_M

in the formula L_MMask prediction for Bert self;

5) a trust tracking layer module using the output of the intention and slot value network module and the database operation module as input U_IOne, with the output Sn of the layer as one of the inputs, one of the inputs An of the intent and generation network, the Belief Tracker, as the final input

I_b＝U_I+Sn+An