CN106997375B

CN106997375B - Customer service reply recommendation method based on deep learning

Info

Publication number: CN106997375B
Application number: CN201710112855.7A
Authority: CN
Inventors: 王东辉; 梁建增; 庄越挺
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 2017-02-28
Filing date: 2017-02-28
Publication date: 2020-08-18
Anticipated expiration: 2037-02-28
Also published as: CN106997375A

Abstract

The invention discloses a customer service reply recommendation method based on deep learning, which is used for recommending customer service reply by directly learning a conversation model from a conversation record of customer service; the method carries out model construction through an end-to-end training mode, and is high in construction speed compared with the traditional method based on rules and artificial features; the method improves the coverage of the reply content in the customer service reply recommendation system; meanwhile, the invention can be applied to the customer service in various vertical fields, including but not limited to: e-commerce, medicine, law, etc.

Description

Customer service reply recommendation method based on deep learning

Technical Field

The invention belongs to the technical field of customer service assistance, and particularly relates to a customer service reply recommendation method based on deep learning.

Background

With the continuous development of the internet economy, the scale and the volume of the e-commerce platform for providing online goods and service shopping are increasing, and the changing trend puts higher requirements on the service efficiency of online customer service. Meanwhile, some new customer service scenes such as online health consultation and online legal consultation relate to more knowledge and have higher requirements on professional staff.

The existing solution ideas are two kinds: one is to use an intelligent customer service robot to directly take over a part of consultation tasks instead of manual work; the other method is to use a computer to assist manual customer service so as to improve the working efficiency of the customer service. The common intelligent customer service robot usually uses a manually constructed knowledge base as a core engine to replace manual work to complete some common simple consultation services. The disadvantages of this method are mainly: the robot can only process common standard problems generally, and is difficult to process some problems with strong individuation or low occurrence frequency; the establishment of a complex knowledge base requires a great deal of cost; the acceptance of the robot by the user is not as good as that of manual customer service. Computer-assisted human customer service, in general, can accomplish tasks that are not directly interactive with the user, such as: standard reply recommendations, frequently asked question and answer pair recommendations. The existing customer service auxiliary system is mainly designed aiming at scenes with relatively less required professional knowledge, such as trade disputes, commodity information consultation and the like.

The existing solution needs to manually establish and arrange a previous knowledge base or a corpus, and the cost is high. Meanwhile, the coverage rate problem exists in the knowledge base and the standard reply, the design is usually carried out aiming at the problems with more occurrence times, and the coverage is less for the problems with stronger individuation. The traditional online customer service scene problems are distributed more intensively, and the related professional knowledge is relatively less, so that the problems of construction cost and coverage rate of a knowledge base and corpora are relatively easy to solve. However, in a scene with higher professional requirements, as the depth and the breadth of knowledge are increased, the difficulty in constructing a knowledge base and corresponding linguistic data is increased, and the higher problem coverage rate is difficult to realize.

Disclosure of Invention

Aiming at the problems in the prior art, the invention provides a customer service reply recommendation method based on deep learning, which can recommend reply texts which are possibly required by the customer service currently for the customer service according to a conversation record in a customer service consultation process, thereby reducing the input amount of the customer service in the working process and improving the working efficiency of the customer service.

The technical scheme of the invention is as follows:

the customer service reply recommendation method based on deep learning is characterized by comprising the following steps:

(1) clustering all customer service replies in the chat corpus and constructing a candidate customer service reply set;

(2) training word vectors on a customer service chat corpus;

(3) carrying out standardization processing on the conversation records of the customer service chat corpus;

(4) training a dialogue model on the processed corpus;

(5) and inputting the current conversation record of the customer service into the model, and acquiring the recommended reply at the moment.

The step of clustering all replies in the step (1) is as follows:

1.1, processing the speech by using a Chinese word segmentation device to obtain a word segmentation result;

1.2 running an LDA theme generation model on the word segmentation result and calculating the representation of each reply vector;

1.3 running a k-means clustering algorithm on all the replied vector representations, and completing clustering of replied texts by using Euclidean distance through a distance measurement function to obtain n clusters: { c₁，…，c_n}

Wherein: d () is a function of the calculated distance, X, Y are vector representations of the two replies, X_i，y_iComponents X, Y, respectively;

1.4 setting a threshold m from { c }₁,…,c_nRandomly sampling m sentences in each corresponding class cluster to form a candidate reply set C;

the step of training the word vector in the step (2) is as follows:

2.1 using a Chinese word segmentation device to perform word segmentation processing on all corpora;

2.2 training word vectors on the corpus of the divided words by using a word vector training tool to obtain a word vector matrix.

The step (3) of normalizing the corpus comprises the following steps:

3.1 deleting meaningless format control symbols in all chat records and non-manually generated conversations;

3.2 dividing the conversation in the chat log into the following according to the role of generating the conversation: the method comprises the following steps that two types of conversations of a user and a customer service are adopted, and a plurality of conversation records of each service are processed as follows on the basis: { user, customer service, … }, i.e., each set of conversations begins with the user's conversation, with the user and customer service conversations occurring strictly in alternation;

3.3, carrying out truncation processing on the overlong dialogue single sentence dialogue and the field-crossing dialogue sequence.

The step of training the deep dialogue model in the step (4) is as follows:

4.1 randomly picking a set of samples from the dataset, a set of dialog sequences for each sample can be expressed as { (u)₀,h₀,u₁,h₁…), … } wherein u_iI-th sentence, h, representing a user in a dialog sequence_iThe ith sentence of the dialogue representing the customer service in the dialogue sequence;

4.2 randomly selecting one customer service dialogue from the corpus as a negative sample for each customer service dialogue in the sample

4.3 for all dialogs in the sample:

the coding is carried out by using a single sentence coder (Utterance Encoder) provided by the invention, and the vector expressions of each sentence are respectively obtained:

4.4 represent one sample as a sequence of vectors:

on the basis, the sequence is coded by using a Context coder (Context Encoder) to obtain a vector output sequence which represents the coding result of the dialog history at each moment of the dialog:

4.5 at each moment of the customer service session, using the corresponding moment

As input, the local loss function value is calculated using the following formula:

wherein margin is a manually set threshold, Sim (x, y) is a similarity function, and the calculation method is shown in the following formula:

4.6 add all local losses in the current batch of samples to obtain the loss function value required by the update, and the calculation formula is as follows:

wherein: n is the number of samples contained in the current batch, m_iIs the number of customer service sessions, loss, contained in the ith sample_i，jLocal loss values of the jth sentence in the ith sample at the moment corresponding to customer service are obtained;

4.7 update all parameters in the dialogue model using gradient descent;

4.8 if the iteration number reaches a threshold (the threshold can be set according to the requirement), the model is saved, the iteration is ended, and if not, the step 4.1 is returned.

The flow of encoding a single sentence by the single sentence encoder in said step 4.3 is as follows:

4.3.1 according to the different roles of the generated sentences, assigning a role mark to each sentence, setting the role mark of the sentence generated by the user to be 0, setting the role mark of the sentence generated by the customer service or the candidate reply to be 1, and using a symbol r to represent the role mark;

4.3.2 replacing each word in the sentence by the word vector representation form corresponding to the word vector matrix trained in the step (2) by searching the word vector matrix, and representing the word as

4.3.3 and color coordinates the corner with each word vector in the sentenceThe cascading is respectively carried out, and the process can be represented as follows: v. of^w′＝[v^w；r]；

4.3.4, using a two-layer GRU network to read in the processed word vector sequence and obtain the output of the last moment as the encoding result of Utterance Encoder, wherein the updating mode of each GRU unit is shown as the following formula:

z_t＝σ(W_z·[h_t-1,x_t])

r_t＝(W_r·[h_t-1,x_t])

h_t＝(1-z_t)*h_t-1+z_t*h_t

wherein x_tRepresenting input at a certain moment of time, h_tIndicating the output at the corresponding time instant.

The flow of encoding the dialog record sequence by the context encoder in said step 4.4 is as follows:

4.4.1 reading in the vector representation of each sentence in the dialogue record:

4.4.2 insert an all zero vector of the same length at the head of the vector sequence:

4.4.3 use a sliding window of length 2 to splice the vectors in the sequence, resulting in an output of:

4.4.4 using the spliced vector sequence as input, encoding it using a two-layer GRU network, taking the output at each time and expressing as:

wherein

An output representing the ith time instant;

4.4.5 the context expression vector of each time is combined with the sentence vector corresponding to the time in a cascade manner, and the output obtained at the ith time is as follows:

wherein

Represents the output of the GRU network at time i,

a vector encoding of the ith utterance representing the user in the dialog sequence,

a vector code representing an ith utterance serviced in the dialog sequence;

4.4.6 remapping the stitched vector with the linear layer, the calculation is as follows

V_i′＝W^T·V_i

Wherein W^TIs a linear layer weight matrix obtained by training, V_iFor the spliced vector, V_i' A vector representing a context for context at that time represents a result.

The process of selecting a reply according to the history of the current customer service conversation in the step (5) is as follows:

5.1 the candidate reply set constructed in step (1) is represented as: a ═ a₀,a₁,…,a_nIn which a_iRepresenting a candidate reply text;

5.2 Using a single sentence encoder to represent each candidate reply sentence as a vector, the process is the same as in step 4.3, and its output is represented as:

5.3 using Utterance Encoder to code each dialog in the current dialog record, the processing steps are the same as 4.3, and the processing result is expressed as:

5.4 use

The encoding process is the same as step 4.4 as the input of the context encoder, and the output of the last moment is taken out as the encoding result of the current context vector, V^c；

5.5 traversing the vector codes of all the candidate replies, and calculating the matching degree of each candidate reply and the current context, wherein the calculation method is shown as the following formula:

and 5.6, comparing the matching degree scores of all the candidate replies, and taking the highest score as the recommended reply.

The invention provides a customer service reply recommendation method based on a depth model by utilizing a chat conversation record generated in the service process of the customer service, and provides a method for applying the customer service reply recommendation method to an actual scene. Therefore, compared with the traditional method, the method has the following beneficial effects:

(1) the method carries out model construction in an end-to-end training mode, and the construction speed is higher;

(2) the method of the invention improves the coverage of the reply content in the customer service reply recommendation system;

(3) the method of the invention ensures that the construction of the customer service reply recommendation system does not need the participation of personnel with expert knowledge;

(4) the method of the invention can be applied to the customer service in various vertical fields, including but not limited to: e-commerce, medicine, law, etc.

Drawings

FIG. 1 is a schematic view of the main process of the present invention.

Fig. 2 is a schematic structural diagram of a single sentence encoder proposed in the present invention.

Fig. 3 is a schematic structural diagram of a context encoder proposed in the present invention.

Detailed Description

In order to more specifically describe the present invention, the following detailed description is provided for the technical solution of the present invention with reference to the accompanying drawings and the specific embodiments.

The method flow of the invention specifically comprises the following steps:

(1) extracting and screening a candidate reply set from the customer service chat corpus;

(2) training word vectors by using customer service chatting corpus;

(3) carrying out standardization processing on customer service chat corpora;

(4) training a customer service dialogue model;

(5) selecting recommended reply content using a conversation model based on a current conversation record

The step (1) is divided into the following steps:

1-1: screening all dialogues spoken by customer service from the corpus;

1-2: segmenting the selected dialogs by using a knot segmentation tool;

1-3: performing topic modeling on the dialog after word segmentation by using an LDA topic model, and acquiring vector representation of the dialog;

1-4: clustering the screened replies on a vector space by using a k-means clustering algorithm;

1-5: and randomly selecting equal replies in each cluster, and performing deduplication operation on the replies to form a candidate reply set.

The step (2) is divided into the following steps:

2-1, segmenting all conversation records by using a bus segmentation tool;

2-2 training word vectors on the segmented word conversation records, the specific tools used are: word2 vec.

The step (3) is divided into the following steps:

3-1 deleting all format control symbols in the chat records and non-artificially generated conversations;

3-2 dividing the conversations in the chat log into: the method comprises the following steps that two types of conversations of a user and a customer service are adopted, and a plurality of conversation records of each service are processed as follows on the basis: the sequence of user, host, user, host, …, i.e. each set of dialogs starts with a user's dialog and the dialogs of user and customer service appear strictly alternating, where user stands for user and host stands for customer service.

The step (4) is divided into the following steps:

4-1 initializing model parameters, wherein a word vector weight matrix is initialized by using pre-trained word vectors, and the rest parameters are initialized by using random numbers which accord with Gaussian distribution;

4-2, extracting n samples from the data by using a random sampling method to serve as the currently trained batch;

4-3, randomly selecting a customer service dialogue as a negative sample reply from the corpus for replying all customer services in the sample;

4-4 as shown in fig. 1, a single sentence encoder is used to encode all sentences in a sample to obtain their corresponding vector representations, and the specific process is as follows: setting character codes of sentences, converting all words into word vector representations, splicing the character codes and the word vectors, coding a vector sequence by using a double-layer GRU (generalized regression Unit), and acquiring the output of the last moment as a coding result;

4-5 as shown in fig. 2, for each sample, vector representations of all sentences are taken as input, and the vector sequences are encoded by using a context encoder respectively, so as to obtain dialog context representation results at different times, wherein the specific process is as follows: filling an all-zero vector with the same length in the head of a vector sequence, splicing adjacent vectors in the vector sequence by using a sliding window with the length of 2, reading the spliced vector sequence by using a double-layer GRU network, calculating the output at each moment, cascading the output of the GRU network and the vector code of an original sentence at each moment, performing linear transformation and dimension reduction by using a linear layer, and taking out the output as a context coding result at the moment;

4-6, respectively calculating the similarity degree of the original reply and the negative sample reply with the context code at the moment at all the time when the customer service replies for each sample, and calculating a local loss value by using a change loss function;

4-7, adding all local loss values of all samples in the current batch to obtain a global loss value;

4-8, updating all parameters in the model by using a gradient descent method, and setting the learning rate to be 0.01;

4-9, judging whether the change degree of the iteration times and the global loss value reaches a threshold value, if so, stopping training and storing the model, and if not, returning to the step 4-2.

The step (5) is divided into the following steps:

5-1, using a single sentence coder to code all sentences in the candidate set into a vector form;

5-2, coding each dialog in the current dialog record by using a single sentence coder;

5-3, reading in vector representation of each sentence in the current conversation by using a context encoder, and taking the output of the last moment as a current context representation result;

5-4, traversing all candidate replies, calculating the similarity degree of the candidate replies and the current context by using cosine similarity, and selecting the reply with the highest score from the candidate replies and recommending the reply to the customer service.

The above examples are not intended to limit the present invention, and the present invention is not limited to the above embodiments, and the present invention is within the scope of the present invention as long as the requirements of the present invention are met.

Claims

1. A customer service reply recommendation method based on deep learning comprises the following steps:

(1) clustering all customer service replies in the chat corpus and constructing a candidate customer service reply set; training word vectors on a customer service chat corpus;

(4) training a dialogue model on the corpus after the standardization processing;

(5) inputting the current conversation record of the customer service into the conversation model to obtain a corresponding recommendation reply;

the specific steps of the dialogue model training in the step (4) are as follows:

4.1 randomly picking a batch of samples from the dataset, each sample being a multi-group dialog sequence that can be expressed as { (u)₀,h₀,u₁,h₁…), … }, where u is_iI-th sentence, h, representing a user in a dialog sequence_iThe ith sentence of the dialogue representing the customer service in the dialogue sequence;

4.3 for all dialogues of each set of dialog sequences in the sample:

all use single sentence encoder to encode, all encode each sentence into the fixed length vector, obtain the vector expression of each sentence respectively:

4.4 represent each set of dialog sequences as a sequence of vectors:

on the basis, the sequence is coded by using a context coder, the coding result of the dialog record generated when each dialog appears can be obtained, each dialog of the user and the customer service corresponds to a moment, and a vector output sequence is obtained after coding and represents the coding result of the dialog history at each moment:

wherein k is the number of dialogs contained in the set of dialog sequences;

4.5 at each moment of the customer service session, use the corresponding moment i

4.6 add all local losses in the current batch of samples to obtain a loss function value required by the update, wherein the calculation formula is as follows:

wherein: n is the number of samples contained in the current batch, m_iNumber of customer service replies, loss, contained in the ith sample_i,Replying a local loss value at a corresponding moment for the jth customer service in the ith sample;

4.7 update all parameters in the dialogue model using gradient descent;

4.8 if the iteration times reach the threshold value, the model is saved, the iteration is ended, and if not, the step 4.1 is returned.

2. The customer service reply recommendation method according to claim 1, wherein: the step (1) specifically comprises the following steps:

1.1, processing all customer service replies in the corpus by using a Chinese word segmentation device to obtain word segmentation results;

1.2, performing topic modeling on the dialog after word segmentation by using an LDA topic model, and acquiring vector representation of the dialog;

1.3 clustering the customer service replies on a vector space by using a k-means clustering algorithm;

1.4 randomly selecting equal replies in each cluster, and performing deduplication operation on the replies to form a candidate reply set.

3. The customer service reply recommendation method according to claim 1, wherein: the specific steps of the standardized processing of the dialogue records in the step (3) are as follows:

3.1 deleting all format control symbols and non-artificially generated dialogs in the dialog record;

3.2 the dialog is divided into, according to the role that generates it: the method comprises the following steps that two types of conversations of a user and a customer service are adopted, and a plurality of conversation records of each service are processed as follows on the basis: the sequence of user, host, user, host, …, i.e. each set of dialogs starts with a user's dialog and the dialogs of user and customer service appear strictly alternating, where user stands for user and host stands for customer service.

4. The customer service reply recommendation method according to claim 1, wherein: the specific steps of encoding a single sentence by using a single sentence encoder in the step 4.3 are as follows:

5.1 according to the different roles of the sentences, assigning a role mark to each sentence, setting the role mark of the sentence generated by the user to be 0, setting the role mark of the sentence generated by the customer service or the candidate reply to be 1, and using a symbol r to represent the role mark;

5.2 replacing each word in the sentence by a word vector representation form corresponding to the word vector representation form by searching the word vector matrix trained in the step (2) to represent as

5.3 and the role mark is respectively cascaded with each word vector in the sentence, and the process can express thatComprises the following steps: v. of^w′＝[v^w；r]；

And 5.4, reading the processed word vector sequence by using a two-layer GRU network, and obtaining the output of the last moment as the coding result of the single sentence coder.

5. The customer service reply recommendation method according to claim 1, wherein: the specific steps of encoding the dialog sequence by using the context encoder in step 4.4 are as follows:

6.1 reading in the vector representation of each sentence in the dialogue record:

6.2 insert an all-zero vector of the same length at the head of the vector sequence:

6.3 use the length of 2 sliding window to splice the vectors in the sequence, get the output:

6.4 using the spliced vector sequence as input, using a double-layer GRU network to encode the vector sequence, and taking the output of each time and expressing as:

wherein

An output representing the ith time instant;

6.5 the context expression vector of each time is combined with the sentence vector corresponding to the time in a cascade way, and the output obtained at the ith time is as follows:

wherein

Represents the output of the GRU network at time i,

a vector code representing the ith utterance spoken by the user in the dialog sequence,

a vector code representing an ith utterance to be spoken by a customer in the dialog sequence;

6.6 remapping each spliced vector with a linear layer, the calculation is as follows

V_i′＝W^T·V_i

Wherein W^TIs a linear layer weight matrix, V_iFor the spliced vector, V_i' is the vector representation result for the context at that time.

6. The customer service reply recommendation method according to claim 1, wherein: the specific steps of selecting the recommended reply according to the current conversation history in the step (5) are as follows:

7.1 the set of candidate replies constructed in step (1) is represented as: a ═ a₀，a₁，…，a_nIn which a_iRepresenting a candidate reply text;

7.2 Using a single sentence encoder, each candidate reply sentence is represented in vector form, the output of which is represented as:

7.3 use the single sentence encoder to encode each dialog in the current dialog record, and express its processing result as:

7.4 use

Taking the last moment output as the current context vector coding result V as the input of the context coder^c；

7.5 vector encoding V over all candidate replies^aCalculating the matching degree of each candidate reply and the current context;

7.6 comparing the matching degree scores of all the candidate replies, and taking the highest score as the recommended reply.