CN111428448A

CN111428448A - Text generation method and device, computer equipment and readable storage medium

Info

Publication number: CN111428448A
Application number: CN202010136551.6A
Authority: CN
Inventors: 邓悦; 金戈; 徐亮
Original assignee: Ping An Technology Shenzhen Co Ltd
Current assignee: Ping An Technology Shenzhen Co Ltd
Priority date: 2020-03-02
Filing date: 2020-03-02
Publication date: 2020-07-17
Anticipated expiration: 2040-03-02
Also published as: CN111428448B; WO2021174827A1

Abstract

The invention discloses a text generation method and device, computer equipment and a readable storage medium, and belongs to the field of text processing. The text generation method, the text generation device, the computer equipment and the readable storage medium generate the target text data according to the target guide data through the text generation confrontation network model obtained through pre-training, the problem that discrete output is not updatable is solved, and the purpose that text sentences can be generated according to sentence start data by adopting the text generation confrontation network model is achieved.

Description

Text generation method and device, computer equipment and readable storage medium

Technical Field

The present invention relates to the field of text processing, and in particular, to a text generation method, apparatus, computer device, and readable storage medium.

Background

In an intelligent interview scene, Artificial Intelligence (AI for short) needs to ask a candidate for questions according to preset questions and also needs to ask the candidate for open questions according to actual situations so as to test the actual coping ability of the candidate. The open question requires the AI to generate a quiz text using a generative model.

The existing generation model mainly adopts a generation countermeasure network (GAN), and the generation countermeasure network needs to update parameter variables based on continuous output data, so the generation model is mainly applied to image processing, various image generation tasks comprise unsupervised generation, labeled generation, super-resolution reduction, automatic coloring, street view generation and the like, and the quality of generated images is vivid until human eyes cannot distinguish true from false.

When the generated countermeasure network is applied to a text generation task, because the generation of the countermeasure network needs to output the probability distribution of the next word in a vocabulary table based on the generated text sequence in the text generation process, and then selects the word, the output result is discrete data, and the discrete data cannot realize the training and updating of the network. Therefore, the current generation countermeasure network cannot be applied to the text generation task.

Disclosure of Invention

Aiming at the problem that the existing generation countermeasure network only supports continuous output, a text generation method, a device, computer equipment and a readable storage medium for generating the countermeasure network based on a text which can be updated according to discrete data are provided.

In order to achieve the above object, the present invention provides a text generating method, comprising the steps of:

acquiring answer data generated by a business object in a question-answer scene;

extracting the answer data and acquiring target guide data;

generating a confrontation network model through a text obtained by pre-training and generating target text data according to the target guide data;

the target guide data is sentence head data of the target text data.

In one embodiment, before the step of generating a confrontation network model from the pre-trained text and generating target text data according to the target guidance data, the method includes:

acquiring a sample guide set and a sample text set, wherein the sample guide set comprises at least one sample guide datum, the sample text set comprises at least one sample text datum, and the sample guide datum is a sentence head datum of the sample text datum;

training an initial confrontation network model according to the sample guide set and the sample text set, and obtaining a text to generate a confrontation network model.

In one embodiment, the initial confrontation network model includes a generator and a discriminator, and the training of the initial confrontation network model according to the sample guidance set and the sample text set and the obtaining of the text generation confrontation network model includes:

generating, by the generator and in accordance with at least one sample guidance data in the sample guidance set, at least one sample text data;

simulating the at least one sample text data by adopting Monte Carlo simulation and acquiring a plurality of sample simulated text data;

identifying the plurality of sample simulation text data through the discriminator according to target text data in the sample text set, and updating parameter values of the generator according to an identification result;

updating the discriminator according to a loss function based on the updated generator;

and circularly updating the generator and the discriminator until the initial confrontation network model meets a preset convergence condition, and obtaining the text generation confrontation network model formed by the updated generator.

In one embodiment, the step of generating, by the generator and from sample guidance data of at least one of the sample guidance sets, at least one sample text data comprises:

calculating according to the sample guide data through the generator to obtain a first sample word with the maximum probability in a vocabulary, and adding the first sample word to the tail of the sample guide data;

and calculating according to the first sample word by the generator to obtain a second sample word with the maximum probability in the vocabulary, adding the second sample word to the tail of the first sample word, and circularly executing the steps until sample text data with preset length is obtained.

In one embodiment, the step of simulating the at least one sample text data using monte carlo simulation and obtaining a plurality of sample simulated text data comprises:

and simulating words in each sample text data one by adopting Monte Carlo simulation, and generating a plurality of sample simulated text data corresponding to the sample text data.

In one embodiment, the step of identifying, by the discriminator, the plurality of sample simulated text data according to the target text data in the sample text set, and updating the parameter value of the generator according to the identification result includes:

identifying the plurality of sample simulation text data through the discriminator according to target text data in the sample text set, and acquiring a state cost function according to an identification result;

and calculating an objective function according to the state cost function, and updating the parameter value of the generator according to the objective function.

In one embodiment, the step of generating a confrontation network model from the pre-trained text and generating target text data according to the target guidance data includes:

calculating the target guide data by adopting a generator for generating an antagonistic network model by the text to obtain a first sample word with the maximum probability in a vocabulary, and adding the first sample word to the tail of the target guide data;

and calculating the first sample word by adopting the generator, acquiring a second sample word with the maximum probability in a vocabulary, adding the second sample word to the tail of the first sample word, and circularly executing the steps until target text data with preset length is acquired.

In order to achieve the above object, the present invention further provides a text generating apparatus, including:

the acquisition unit is used for acquiring answer data generated by the business object in the question-answer scene;

an acquisition unit, configured to extract the answer data and acquire target guidance data;

the generation unit is used for generating a confrontation network model through a text obtained by pre-training and generating target text data according to the target guide data;

the target guide data is sentence head data of the target text data.

To achieve the above object, the present invention further provides a computer device, which includes a memory, a processor, and a computer program stored on the memory and executable on the processor, and the processor implements the steps of the above method when executing the computer program.

To achieve the above object, the present invention also provides a computer-readable storage medium having stored thereon a computer program which, when being executed by a processor, carries out the steps of the above method.

The beneficial effects of the above technical scheme are that:

in the technical scheme, the text generation method, the text generation device, the computer equipment and the readable storage medium generate the target text data according to the target guide data (such as sentence head data) by the text generation confrontation network model obtained through pre-training, so that the problem that discrete output is not updatable is solved, and the aim that the text sentence (such as a text problem) can be generated according to the sentence head data by adopting the text generation confrontation network model is fulfilled.

Drawings

FIG. 1 is a flowchart of a method of an embodiment of a text generation method according to the present invention;

FIG. 2 is a flow diagram of a method of one embodiment of obtaining a text-to-generate confrontation network model;

FIG. 3 is a flowchart of a method of one embodiment of training an initial confrontation network model to obtain a text-generated confrontation network model based on a sample guide set and a sample text set;

FIG. 4 is a block diagram of an embodiment of a text generation apparatus according to the present invention;

fig. 5 is a schematic diagram of a hardware architecture of an embodiment of a computer device according to the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The text generation method, the text generation device, the computer equipment and the readable storage medium are suitable for the business fields of insurance, finance and the like, and provide a text generation method which can automatically generate open text problems and is convenient for testing the thinking ability of candidates for loan systems, insurance systems and financial systems. The invention generates the target text data according to the target guide data (such as sentence head data) by the text generation confrontation network model obtained by pre-training, solves the problem that discrete output is not updatable, and achieves the aim that the text generation confrontation network model can generate text sentences (such as text problems) according to the sentence head data.

Example one

Referring to fig. 1, a text generating method of the present embodiment includes the following steps:

s1, collecting answer data generated by a business object in a question and answer scene;

in this step, the business object may be a consultation user for consulting business, or a buyer of the online transaction platform, or an interviewer in the interviewing process. The answer data may be collected by a collection device (e.g., an audio receiving device, a microphone or a mobile terminal with a recording function, etc.).

The text generation method in this embodiment is mainly applied to a dialog scene (at least two users), and generates a question text based on the answer information of the target object, so that the target object can answer the question text, for example: when the text generation method is applied to an interview scene, an open text problem is generated according to keywords provided by interviewers.

S2, extracting the answer data and acquiring target guide data;

in step S2, semantic analysis may be performed on the answer data to extract a keyword in the answer data, and the keyword is used as target guidance data; analyzing the answer data to extract a noun in the answer data, and taking the noun as target guide data.

It should be noted that: the target guidance data may be a keyword, or a beginning word of a sentence.

S3, generating a confrontation network model through a text obtained through pre-training and generating target text data according to the target guide data;

it should be noted that: the target guide data is sentence head data of the target text data. For example: the target boot data is: "today"; the target text data is: "how is the weather today? ". The target guide data may be two words or three words as the beginning word of the sentence, and is not limited herein.

Referring to fig. 2, before performing step S3, the step of obtaining the text generation confrontation network model may include:

s31, a sample guide set and a sample text set are obtained, wherein the sample guide set comprises at least one sample guide datum, the sample text set comprises at least one sample text datum, and the sample guide datum is a sentence head datum of the sample text datum;

in the present embodiment, the sample pilot set is a sequence of sample pilot data (beginning data); the sample text set is a sequence of real text data composed of sample text data (complete sentence). The sample leading data is the beginning data of the real text data.

And S32, training an initial confrontation network model according to the sample guide set and the sample text set, and obtaining a text to generate the confrontation network model.

At present, in the process of image processing of the generated countermeasure network, the pixel values of each point of the generated image are continuous values, so that the calculation graph of the whole network, from the weight of the generator to the output thereof, and then to the weight and the output classification of the discriminator, are differentiable (differentiable and conductive), errors can be normally propagated backwards, and the gradient and the weight can be normally updated. However, in the process of text generation, the generator actually outputs a sequence, each round outputs the probability distribution of the next word in the vocabulary based on the generated text sequence, and then selects the word with the highest probability, the "selection" process is not trivial, the generator outputs discrete tokens, and in the training process, errors are propagated to the discrete tokens in the reverse direction, and the gradient update of pixel values cannot be performed on each token like an image generation task, so that the weight values of the generator are updated. On the other hand, the discriminator may directly receive an input of a complete text sequence, and output true or false sentences, but cannot judge half of the sentences that have not been completed by the generator, so that the discriminator cannot provide supervision for training of the generator for each word in the generated text sequence.

Therefore, in the training process for generating the countermeasure network model in this embodiment, in order to solve the infinitesimal problem caused by the discrete output of the generator, in this embodiment, the generation process of the text sequence is regarded as a sequence decision process, a policy gradient (policy gradient) method in reinforcement learning is adopted, the judgment result of the discriminator is regarded as a reward (reward), a part of the text generated by the generator is regarded as a state (state), the generator is regarded as an agent (agent), a next word is predicted as an action (action), and the generator is a policy (policy) that needs to be updated, so that the infinitesimal problem of the loss function of the discrete output is solved. For the method for judging the incomplete sequence, a Monte Carlo search (Monte Carlo search) is adopted in the embodiment, the generator continues to generate the incomplete sequence based on the generated sequence until the sequence is completed, the discriminator judges the sequence for a plurality of times, and the mean value of final rewarded is used as the rewarded estimation of the current incomplete sequence.

It should be noted that: the initial confrontation network model comprises a generator and an arbiter; referring to fig. 3, in step S32, the step of training the initial confrontation network model according to the sample guidance set and the sample text set, and obtaining a text to generate the confrontation network model includes:

by way of example and not limitation, the generator may employ a long and short term memory network (L STM) of output sequences for generating text sequences from a given initial state, and the discriminator may employ a two-class long and short term memory network for receiving the output text of the generator and the actual text to determine whether the output text is true or false.

S321, generating at least one sample text data through the generator according to at least one sample guide data in the sample guide set;

further, the step in step S321 may include:

and calculating according to the first sample word by the generator to obtain a second sample word with the maximum probability in the vocabulary, adding the second sample word to the tail of the first sample word, and circularly executing the steps (and so on) until sample text data with a preset length is obtained.

In this step, the generator G is initialized_θAnd a discriminator D_φ(ii) a Sample guide data is a set of real texts S ═ X_1～TThe sentence length of each real text in the real text set is T, and the tail of each real text with the length less than T is filled with zeros; sample guided set into word set Y₁}。

Set words { Y }₁Is input to the generator G_θGenerator G_θThe input layer(s) maps the input word(s) to tag information (tokenization) corresponding to the corresponding word(s) in the vocabulary table, performs embedded representation, and performs embedding representation on the corresponding word(s) in the vocabulary tableWill be (y) in the actual application₁,y₂,…,y_t-1) Sent as input to generator G_θGenerator G_θAccording to the input data, the softmax classifier outputs the probability of each word of the next word in the vocabulary table, and the word with the maximum probability is taken as y_tRepeating the above steps until the end y of the sentence_TThus, a set of generated sample text sets { Y) with length T (length is not filled with zero) is obtained_1～T}。

Wherein (y)₁,y₂,…,y_t-1) Representing an incomplete sentence consisting of t-1 words, y₁Represents the 1 st word in a sentence; y is₂Represents the 2 nd word in a sentence; y is_t-1Represents the t-1 word in a sentence; y is_TThe T-th word in a sentence (end of the sentence).

In step, only the generator G is used_θTo transmit a word y₁Generator G_θEmbedding the text sequence into L STM, outputting the generated token sequence and the corresponding words in the vocabulary, and obtaining the generated text sequence (y)₁,y₂,…,y_T)。

S322, simulating the at least one sample text data by adopting Monte Carlo simulation and acquiring a plurality of sample simulated text data;

further, the step of step S322 includes:

In this implementation, for the sample text set { Y }_1～TEach sequence in (y)₁,y₂,…,y_T) Sequence is an example, traversing each word y in the sequence_tN Monte Carlo simulations were performed, differing from the previous selection of the most probable word as y_tHere, generator G is used each time_θSampling according to the multinomial distribution of output words, and repeating until reaching the end y of the sentence_TSo as to obtain N different complete sample simulation text sets { Y_1～T ¹,Y_1～T ²,…,Y_1～T ^N}。

It should be noted that the simulation times of words located at different positions in a sentence in the sample text set may be the same or different.

S323, identifying the plurality of sample simulation text data through the discriminator according to target text data in the sample text set, and updating parameter values of the generator according to an identification result;

further, step S323 may include:

In an embodiment, the sample obtained is modeled as a set of texts { Y }_1～T ¹,Y_1～T ²,…,Y_1～T ^NIs inputted to a discriminator D_φPerforming secondary classification, comparing each sample simulation text with the corresponding real text, and if the sample simulation texts are consistent, indicating that the sample simulation text generated by the generator is real (mark 1); if not, the generator generated sample simulation text is shown to be false (label 0). For complete sentences, directly convert discriminator D_φOutputting the result as a status value; for incomplete sentences, the discrimination results of N complete sentences obtained by Monte Carlo simulation are averaged. In summary, the state cost function can be expressed as:

where i represents the number of simulations of the monte carlo simulation.

Updating the generator G according to the state cost function_θThe objective function of the generator is to produce as realistic a spoof discriminator as possible, i.e. to maximize its on-strategyG_θThe following rewards are earned:

wherein G is_θ(y_t|Y_1～t-1) Representing the policy output, essentially viewed as a probability, output y_tProbability values in the vocabulary; y is_1～t-1Is all y_tThe values that appear. The parameter theta is the generator G_θThe weight parameter of (1); generator G_θIs updated on J (θ), in other words, the gradient of the strategy comes from J (θ):

wherein, α_θIs the learning rate.

S324, updating the discriminator according to a loss function based on the updated generator;

in this step, the updated generator G is used_θGenerating a set of text sequences { Y }_1～TSimultaneously from the set of authentic texts S ═ X_1～TSelect the same number of text sequence sets { X }_1～TIs inputted to a discriminator D_φIn the middle classification, the loss function is a two-classification logarithmic loss function:

D_φupdate on J (phi):

wherein, α_φIs the learning rate.

And S325, circularly updating the generator and the discriminator until the initial confrontation network model meets a preset convergence condition, and obtaining the text generation confrontation network model formed by the updated generator.

In this step, the training generator n is repeated for each round of training_GDiscriminant n for secondary and repeated training_DAnd thirdly, until the model meets the preset convergence condition. Such as: the preset convergence condition is n_D>n_GTo ensure that the arbiter can correctly guide the generator update.

In step S3, the step of generating a confrontation network model from the pre-trained text and generating target text data from the target guidance data includes:

and calculating the first sample word by adopting the generator, acquiring a second sample word with the maximum probability in a vocabulary, adding the second sample word to the tail of the first sample word, and circularly executing the steps (and so on) until target text data with a preset length is acquired. Therefore, the target text data for questioning is generated according to the answer data, the purpose of open questioning and answering based on the response of the business object is achieved, and the temporary coping capability of the business object to the open questions is conveniently tested.

In the embodiment, the text generation method is based on a countermeasure long-short term memory network and a strategy gradient, a structure of a discriminator-generator based on L STM is used, the tasks of generating a text sequence and judging the authenticity of the text can be accurately realized, parameters of the discriminator can be dynamically updated by means of countermeasure training, the recognition capability is continuously improved, appropriate guidance is provided for the generator, the quality of the generated text is more potential than that of the generated text based on other static benchmark evaluations, the sequence generation process is converted into a sequence decision process by means of a reinforced learning idea, the problem that a loss function cannot be minimized due to discrete output is solved, the training of the countermeasure network is possible, Monte Carlo search is used, a complete sequence of each step and a scoring result of the complete sequence in the discriminator are obtained by means of strategy simulation, the mean value is used as a rewarded value of the current time step, the problem that an unfinished sequence cannot be directly obtained is solved, in addition, only a generator part needs to be reserved in a training stage, and compared with other discretization inexplicated skills such as Gumbel-software, the model occupation is smaller.

Example two

As shown in fig. 4, the present invention also provides a text generating apparatus 1, including: acquisition unit 11, acquisition unit 12 and generation unit 13, wherein:

the acquisition unit 11 is used for acquiring answer data generated by the business object in the question-answer scene;

the business object can be a consultation user for business consultation, or a buyer of an online transaction platform, or an interviewer in an interviewing process. The answer data may be collected by a collection device (e.g., an audio receiving device, a microphone or a mobile terminal with a recording function, etc.).

The text generation device 1 in the present embodiment is mainly applied to a dialog scenario (at least two users), and generates a question text based on the answer information of a target object, so that the target object can answer the question text, for example: when the text generation device 1 is applied to an interview scene, an open text question is generated according to keywords provided by interviewers.

An obtaining unit 12, configured to extract the answer data and obtain target guidance data;

the adoption unit 12 can carry out semantic analysis on the answer data to extract keywords in the answer data, and the keywords are used as target guide data; analyzing the answer data to extract a noun in the answer data, and taking the noun as target guide data.

The generation unit 13 is used for generating a confrontation network model through a text obtained by pre-training and generating target text data according to the target guide data;

the target guide data is sentence head data of the target text data.

Specifically, the generating unit 13 calculates according to the target guidance data by using the generator for generating the confrontation network model from the text, obtains a first sample word with the highest probability in a vocabulary, and adds the first sample word to the end of the target guidance data;

and the generator calculates according to the first sample word, obtains a second sample word with the maximum probability in a vocabulary, adds the second sample word to the tail of the first sample word, and so on until target text data with a preset length is obtained.

In the embodiment, the text generation device 1 can accurately realize the tasks of generating a text sequence and judging the authenticity of the text by using a structure of a discriminator-generator based on L STM based on a countermeasure long-short term memory network and a strategy gradient, can dynamically update parameters of the discriminator by means of countermeasure training, continuously improve the recognition capability, provide proper guidance for the generator, have potential compared with the quality of the generated text based on other static benchmark evaluations, convert the sequence generation process into a sequence decision process by means of the idea of reinforcement learning, solve the problem that a loss function cannot be minimized due to discrete output, enable the training of the countermeasure network to be generated, use Monte Carlo search to obtain a complete sequence of each step and a grading result of the sequence in the discriminator by means of strategy simulation, use the mean value as a rewarded value of the current time step, solve the problem that an unfinished sequence cannot be directly obtained, and only need to reserve a generator part in the training stage and compared with the problems that other discrete processing such as Gumbel-software and the like, do not need extra parameters and the model occupies smaller training.

EXAMPLE III

In order to achieve the above object, the present invention further provides a computer device 2, where the computer device 2 includes a plurality of computer devices 2, components of the text generating apparatus 1 according to the second embodiment may be distributed in different computer devices 2, and the computer device 2 may be a smart phone, a tablet computer, a notebook computer, a desktop computer, a rack server, a blade server, a tower server, or a rack server (including an independent server or a server cluster formed by a plurality of servers) that executes a program, or the like. The computer device 2 of the present embodiment includes at least, but is not limited to: a memory 21, a processor 23, a network interface 22, and the text generation apparatus 1 (refer to fig. 5) that can be communicatively connected to each other through a system bus. It is noted that fig. 5 only shows the computer device 2 with components, but it is to be understood that not all of the shown components are required to be implemented, and that more or less components may be implemented instead.

In this embodiment, the memory 21 includes at least one type of computer-readable storage medium, which includes a flash memory, a hard disk, a multimedia card, a card-type memory (e.g., SD or DX memory, etc.), a Random Access Memory (RAM), a Static Random Access Memory (SRAM), a Read Only Memory (ROM), an Electrically Erasable Programmable Read Only Memory (EEPROM), a Programmable Read Only Memory (PROM), a magnetic memory, a magnetic disk, an optical disk, and the like. In some embodiments, the storage 21 may be an internal storage unit of the computer device 2, such as a hard disk or a memory of the computer device 2. In other embodiments, the memory 21 may also be an external storage device of the computer device 2, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), or the like provided on the computer device 2. Of course, the memory 21 may also comprise both an internal storage unit of the computer device 2 and an external storage device thereof. In this embodiment, the memory 21 is generally used for storing an operating system installed in the computer device 2 and various types of application software, such as a program code of the text generation method in the first embodiment. Further, the memory 21 may also be used to temporarily store various types of data that have been output or are to be output.

The processor 23 may be a Central Processing Unit (CPU), a controller, a microcontroller, a microprocessor, or other data Processing chip in some embodiments. The processor 23 is typically used for controlling the overall operation of the computer device 2, such as performing control and processing related to data interaction or communication with the computer device 2. In this embodiment, the processor 23 is configured to run the program code stored in the memory 21 or process data, for example, run the text generating apparatus 1.

The network interface 22 may comprise a wireless network interface or a wired network interface, and the network interface 22 is typically used to establish a communication connection between the computer device 2 and other computer devices 2. For example, the network interface 22 is used to connect the computer device 2 to an external terminal through a network, establish a data transmission channel and a communication connection between the computer device 2 and the external terminal, and the like. The network may be a wireless or wired network such as an Intranet (Intranet), the Internet (Internet), a Global System of Mobile communication (GSM), Wideband Code Division Multiple Access (WCDMA), a 4G network, a 5G network, Bluetooth (Bluetooth), Wi-Fi, and the like.

It is noted that fig. 5 only shows the computer device 2 with components 21-23, but it is to be understood that not all shown components are required to be implemented, and that more or less components may be implemented instead.

In this embodiment, the text generating apparatus 1 stored in the memory 21 may be further divided into one or more program modules, and the one or more program modules are stored in the memory 21 and executed by one or more processors (in this embodiment, the processor 23) to complete the present invention.

Example four:

to achieve the above objects, the present invention also provides a computer-readable storage medium including a plurality of storage media such as a flash memory, a hard disk, a multimedia card, a card type memory (e.g., SD or DX memory, etc.), a Random Access Memory (RAM), a Static Random Access Memory (SRAM), a Read Only Memory (ROM), an Electrically Erasable Programmable Read Only Memory (EEPROM), a Programmable Read Only Memory (PROM), a magnetic memory, a magnetic disk, an optical disk, a server, an App application store, etc., on which a computer program is stored, which when executed by the processor 23, implements corresponding functions. The computer-readable storage medium of the present embodiment is used for storing the text generation apparatus 1, and when being executed by the processor 23, the computer-readable storage medium implements the text generation method of the first embodiment.

The above-mentioned serial numbers of the embodiments of the present invention are merely for description and do not represent the merits of the embodiments.

Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner.

The above description is only a preferred embodiment of the present invention, and not intended to limit the scope of the present invention, and all modifications of equivalent structures and equivalent processes, which are made by using the contents of the present specification and the accompanying drawings, or directly or indirectly applied to other related technical fields, are included in the scope of the present invention.

Claims

1. A text generation method is characterized by comprising the following steps based on a question-answer scene:

extracting the answer data and acquiring target guide data;

the target guide data is sentence head data of the target text data.

2. The method of claim 1, wherein before the step of generating a confrontation network model from the pre-trained text and generating target text data from the target guidance data, the method comprises:

3. The text generation method of claim 2, wherein the initial confrontation network model comprises a generator and a discriminator, and the step of training the initial confrontation network model according to the sample guide set and the sample text set and obtaining the text generation confrontation network model comprises:

4. The text generation method of claim 3, wherein the step of generating, by the generator and from at least one sample guide data in the sample guide set, at least one sample text data comprises:

calculating according to the first sample word through the generator to obtain a second sample word with the maximum probability in a vocabulary, and adding the second sample word to the tail of the first sample word;

and circularly executing the steps until sample text data with preset length is obtained.

5. The text generation method of claim 3, wherein the step of simulating the at least one sample text data using a monte carlo simulation and obtaining a plurality of sample simulated text data comprises:

6. The method of claim 3, wherein the step of identifying, by the discriminator, the plurality of sample simulated text data based on the target text data in the sample text set, and updating the parameter values of the generator based on the identification result comprises:

7. The method of claim 3, wherein the step of generating a confrontation network model from the pre-trained text and generating target text data from the target guidance data comprises:

calculating the first sample word by adopting the generator, acquiring a second sample word with the maximum probability in a vocabulary, and adding the second sample word to the tail of the first sample word;

and circularly executing the steps until target text data with preset length is obtained.

8. A text generating apparatus, based on a question-and-answer scenario, comprising:

the target guide data is sentence head data of the target text data.

9. A computer device comprising a memory, a processor, and a computer program stored on the memory and executable on the processor, characterized in that: the processor, when executing the computer program, realizes the steps of the method of any one of claims 1 to 7.

10. A computer-readable storage medium having stored thereon a computer program, characterized in that: the computer program when executed by a processor implements the steps of the method of any one of claims 1 to 7.