CN111241291B

CN111241291B - Method and device for generating countermeasure sample by utilizing countermeasure generation network

Info

Publication number: CN111241291B
Application number: CN202010329630.9A
Authority: CN
Inventors: 任彦昆
Original assignee: Alipay Hangzhou Information Technology Co Ltd
Current assignee: Alipay Hangzhou Information Technology Co Ltd
Priority date: 2020-04-24
Filing date: 2020-04-24
Publication date: 2023-01-03
Anticipated expiration: 2040-04-24
Also published as: CN111241291A

Abstract

The embodiment of the present specification provides a method for generating a countermeasure sample by using a countermeasure generation network, wherein the countermeasure generation network includes: the system comprises a classifier which is trained in advance and used for executing N classification tasks aiming at business objects, a generator used for generating simulation samples corresponding to real samples of each class, and N discriminators corresponding to the N classes, wherein the ith discriminator is used for discriminating whether the samples input into the classifier belong to the real samples under the ith class. In the method, training of the generator and the discriminator can be realized, and the trained generator is further used for generating the confrontation samples which have the designated real category and can be predicted as other categories by the classifier, and meanwhile, high-quality confrontation samples in large batch can be generated efficiently and quickly.

Description

Method and device for generating countermeasure sample by utilizing countermeasure generation network

Technical Field

The embodiment of the specification relates to the technical field of computers, in particular to a method and a device for generating a countermeasure sample by utilizing a countermeasure generation network.

Background

A countersample is an input sample that is formed by purposely adding subtle perturbations to the data set that cause the machine learning model to output erroneous results with high confidence. For example, in a text classification scenario, text content that was originally identified by the text classification model as a violation is misclassified as not violating with little modification and substantially unchanged semantics as seen by humans. For example, in an image recognition scene, a picture that is originally recognized as a panda by the image processing model is misclassified as a gibbon after a slight change is added, which cannot be detected by the human eye.

The confrontation sample may be used by an attacker to attack the machine learning model, resulting in low prediction accuracy of the machine learning model. Therefore, a large number of countermeasure samples need to be generated in advance, and the machine learning model needs to be trained by using the countermeasure samples, so that the model can correctly classify the countermeasure samples to resist external attacks.

However, currently, generating challenge samples typically requires more manual intervention, and the number and quality of generated challenge samples are quite limited. Therefore, a solution is needed to generate high quality challenge samples in large quantities quickly and efficiently.

Disclosure of Invention

One or more embodiments in the present specification provide a method for generating countermeasure samples using an countermeasure generation network, which can achieve fast and efficient generation of large batches of high-quality countermeasure samples.

According to a first aspect, there is provided a method of generating a confrontation sample using a confrontation generating network comprising a pre-trained classifier for performing N classes of classification tasks on a business object; the countermeasure generation network also includes a generator and N discriminators corresponding to the N categories, where N is a positive integer greater than 1. The method comprises the following steps:

obtaining a first noise vector and obtaining an ith category vector corresponding to an ith category, wherein i is a positive integer not greater than N; inputting the first noise vector and the ith category vector into the generator together to obtain a first simulation sample corresponding to the ith category real sample; inputting the first simulation sample into the ith discriminator to obtain a first probability that the first simulation sample belongs to a real sample under the ith category; inputting the obtained first real sample belonging to the ith category into the ith discriminator to obtain a second probability that the first real sample is a real sample under the ith category; training the ith discriminator by taking the first probability as a reduction and the second probability as an increase as a target; inputting the first simulation sample into the classifier to obtain a third probability that the first simulation sample belongs to the ith class; training the generator with the aim of increasing the first probability and decreasing the third probability, wherein the trained generator is used for generating target confrontation samples which simulate real samples of target classes but are predicted as other classes by the classifier.

In one embodiment, wherein obtaining the first noise vector comprises: and randomly sampling the noise space which accords with the Gaussian distribution to obtain the first noise vector.

In one embodiment, wherein obtaining an ith category vector corresponding to the ith category comprises: acquiring N category labels, and carrying out one-hot coding on the N category labels to correspondingly obtain N one-hot coding vectors; treating the N unique hot coded vectors as N class vectors, including the ith class vector.

In one embodiment, wherein the inputting the first noise vector and the ith category vector into the generator together comprises: splicing the first noise vector and the ith category vector to obtain a spliced vector, and inputting the spliced vector into the generator; or, summing the first noise vector and the ith category vector to obtain a summed vector, and inputting the summed vector into the generator.

In one embodiment, wherein the business object is text, the generator is a Recurrent Neural Network (RNN); wherein, inputting the first noise vector and the ith category vector into the generator together to obtain a first simulation sample corresponding to the ith category real sample, comprising: performing fusion processing on the first noise vector and the ith category vector to obtain a fusion vector which is used as an initial state vector of a hidden layer in the RNN; taking wildcards for text characters as initial input of the RNN network to obtain the first simulation sample.

In one embodiment, the business object is text or a picture or audio, and the trained generator is used for generating a text countermeasure sample or a picture countermeasure sample or an audio countermeasure sample.

In one embodiment, wherein after training the generator, the method further comprises: obtaining a second noise vector, and obtaining a target class vector corresponding to a target class, the target class belonging to the N classes; and inputting the second noise vector and the target category vector into the trained generator together to obtain the target confrontation sample.

According to a second aspect, there is provided an apparatus for generating countermeasure samples using a countermeasure generation network comprising a pre-trained classifier for performing N classes of classification tasks for business objects; the countermeasure generation network further includes a generator and N discriminators corresponding to the N categories, where N is a positive integer greater than 1; the device comprises:

a noise vector acquisition unit configured to acquire a first noise vector; a category vector acquisition unit configured to acquire an ith category vector corresponding to an ith category, where i is a positive integer not greater than N; the simulation sample generating unit is configured to input the first noise vector and the ith category vector into the generator together to obtain a first simulation sample corresponding to the ith category real sample; the analog sample distinguishing unit is configured to input the first analog sample into the ith discriminator to obtain a first probability that the first analog sample belongs to a real sample under an ith category; the real sample distinguishing unit is configured to input the acquired first real sample belonging to the ith category into the ith discriminator to obtain a second probability that the first real sample is a real sample under the ith category; a discriminator training unit configured to train the i-th discriminator with a target of decreasing the first probability and increasing the second probability; the analog sample classification unit is configured to input the first analog sample into the classifier to obtain a third probability that the first analog sample belongs to the ith class; and the generator training unit is configured to train the generator with the first probability increased and the third probability decreased as targets, and the trained generator is used for generating target confrontation samples which simulate real samples of target classes but are predicted as other classes by the classifier.

According to a third aspect, there is provided a computer readable storage medium having stored thereon a computer program which, when executed in a computer, causes the computer to perform the method provided in the first or second aspect.

According to a fourth aspect, there is provided a computing device comprising a memory having stored therein executable code and a processor that, when executing the executable code, implements the method provided in the first or second aspect.

In summary, by using the method and apparatus for generating countermeasure samples provided by the embodiments of the present specification, it is possible to generate countermeasure samples having a specified true category but predicted by the classifier as other categories. Moreover, by utilizing the generator after training, a large quantity of high-quality confrontation samples can be efficiently and quickly generated.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments disclosed in the present specification, the drawings needed to be used in the description of the embodiments will be briefly introduced below, it is obvious that the drawings in the following description are only embodiments disclosed in the present specification, and it is obvious for those skilled in the art to obtain other drawings based on the drawings without creative efforts.

FIG. 1 illustrates a content diagram of a text confrontation sample according to one example;

FIG. 2 illustrates an application architecture diagram of a countermeasure generation network, according to one embodiment;

FIG. 3 illustrates a flow diagram of a method for generating a challenge sample using a challenge generation network, according to one embodiment;

figure 4 shows a schematic diagram of the structure of an RNN network according to one embodiment;

fig. 5 is a schematic diagram of an apparatus for generating a challenge sample using a challenge generation network according to an embodiment.

Detailed Description

Embodiments disclosed in the present specification are described below with reference to the accompanying drawings.

As mentioned above, currently, generating challenge samples usually requires much manual intervention, and the number and quality of generated challenge samples are very limited. For example, in the natural language field, confrontational samples are currently generated mainly by replacing individual words in the original text. In one example, fig. 1 is a schematic diagram illustrating the content of a text confrontation sample according to an example, wherein the scratched-out text is an original text, and the original text is replaced by another text to obtain a text confrontation sample (or a natural language confrontation sample). Such replacement does not take into account the context of the word to be replaced, and therefore the replaced sentence often becomes non-compliant with grammatical rules and reading is not smooth. In addition, after words are modified, the meaning of the text may be changed greatly, so that the classification of the modified text is changed by human, which is against the principle of generation of the confrontational sample, and therefore, the quality of the confrontational sample generated in the current alternative mode is poor. Moreover, when the countermeasure sample is generated by adopting the alternative mode, a large amount of trial and verification is needed, and only a small amount of text countermeasure samples can be obtained.

For another example, currently in the field of image processing, a confrontational sample is generated mainly by a fast gradient method or an iterative gradient method. Specifically, the fast gradient method includes a step of giving an image, inputting the image into an intended attack model to obtain a prediction result, and then modifying the original image by using a gradient descent method to make the prediction result worse, and the iterative gradient method is to repeat the process of the fast gradient method for a plurality of times for one image and then use the last modified image as a countersample. However, the quality of the countermeasure sample obtained by the fast gradient method is not high, and although the confidence of classifying the countermeasure sample into the correct class is reduced, the countermeasure sample is difficult to be output into other classes with high confidence, and the amount of calculation required for generating the countermeasure sample by the iterative gradient method is large.

Furthermore, the inventors have also found that, in the current way of generating challenge samples, it is difficult to quickly generate a good quality challenge sample that specifies the true category (corresponding to the classification result of the human brain).

Based on the above observations, the inventor designs a specific countermeasure generation network, and further provides a method for generating countermeasure samples by using the countermeasure generation network, so as to generate a large quantity of high-quality countermeasure samples quickly and efficiently. It is to be noted that "confrontation" in the confrontation generation network and "confrontation" in the confrontation sample are two different concepts.

Specifically, fig. 2 is a schematic diagram illustrating an application architecture of a countermeasure generation network according to an embodiment, where the countermeasure generation network is used for processing data related to a business object, and as shown in fig. 2, the countermeasure generation network includes a pre-trained classifier for performing N (positive integer greater than 1) classification tasks, a generator for generating business object samples, and N discriminators corresponding to N classes, where each discriminator is used for discriminating whether an input sample is a real sample under the corresponding class or a false sample generated by the generator. And further, the generator after training can be utilized to generate the confrontation sample which has high similarity with the real sample of any specified category in the N categories but can be identified as other categories by the classifier.

For the sake of understanding, the above-mentioned countermeasure generation network will be specifically described below.

In an embodiment, the business object targeted by the countermeasure generation network may be a text, and accordingly, the classifier, the generator and the discriminator are a text classifier, a text generator and a text discriminator, respectively, and the countermeasure sample generated by the trained generator is a text countermeasure sample. In a specific embodiment, the text may include a text posted by a user in the social platform, the corresponding classification task may be to identify whether the text posted by the user includes violation content, and the corresponding set N categories may include violation and non-violation, or the corresponding classification task may be to identify a user emotion included in the text, and the corresponding set N categories may include happy feeling, angry feeling, peace and quiet feeling. In another embodiment, the text may include an information text in a content information platform, the corresponding classification task may be to determine a domain category to which the information text belongs, and the correspondingly set N categories may include sports, entertainment, popular science, and the like, or the corresponding classification task may be to identify a degree of interest of the user in the information text, and the correspondingly set N categories may include no interest, great interest, and the like.

In another embodiment, the business object may be a picture, and accordingly, the classifier, the generator and the discriminator are a picture classifier, a picture generator and a picture discriminator, respectively, and the confrontation sample generated by the trained generator is used as a picture confrontation sample. In a specific embodiment, the pictures may include pictures of animals and plants, the corresponding classification task may be to determine the category to which the animals and plants belong, and the correspondingly set N categories may include pandas, tigers, lions, and the like. In another specific embodiment, the picture may include a face image, the corresponding classification task may be to identify a face identity in the face image, and the correspondingly set N categories may be N different user identities, where the user identity may be uniquely identified by a mobile phone number, an identity card number, or the like.

In another embodiment, the business object may be audio, and accordingly, the classifier, the generator and the discriminator are respectively an audio classifier, an audio generator and an audio discriminator, and the confrontation sample generated by the trained generator is an audio confrontation sample. In a specific embodiment, the audio may be a user query voice recorded in the customer service, the corresponding classification task may be a standard user question for determining the user query voice, and the correspondingly set N categories may include a user question of how to turn on flower, how to adjust the amount of flower, and the like. In another specific embodiment, the audio may be a verification voice used as a login password, and the corresponding classification task may be to identify a user identity corresponding to the verification voice.

In still another embodiment, the business object may also be a user, a merchant, a commodity, a business event, and the like. In a particular embodiment, the business event may include a social event (such as a session initiated through instant messaging software or a transfer initiated through a payment platform), a login event, and the like.

As can be seen from the above, the countermeasure generation network can be used to process relevant data of the business objects such as text, pictures, audio, users, merchants, business events, and the like.

On the other hand, in one embodiment, the classifier, the generator and the discriminator in the above-described countermeasure generation network may be implemented based on a Convolutional Neural Network (CNN) or a Deep Neural Network (DNN). In an embodiment, the network structures of the N classifiers may be the same or different.

The above describes the generation of a network against a challenge. A method for generating a challenge sample using the challenge generating network is described below with reference to an embodiment. In particular, fig. 3 shows a flowchart of a method for generating a countermeasure sample using a countermeasure generation network according to an embodiment, and an execution subject of the method can be any device or equipment or system or platform with computing and processing capabilities. As shown in fig. 3, the method comprises the steps of:

step S310, acquiring a first noise vector and an ith category vector corresponding to the ith category, wherein i is a positive integer not greater than N; step S320, inputting the first noise vector and the ith category vector into the generator together to obtain a first simulation sample corresponding to the ith category real sample; step S330, inputting the first simulation sample into the ith discriminator to obtain a first probability that the first simulation sample belongs to a real sample under the ith category; step S340, inputting the acquired first real sample belonging to the ith category into the ith discriminator to obtain a second probability that the first real sample is a real sample under the ith category; step S350, training the ith discriminator by taking the first probability as a reduction and the second probability as an increase as a target; step S360, inputting the first simulation sample into the classifier to obtain a third probability that the first simulation sample belongs to the ith class; step S370, training the generator to increase the first probability and decrease the third probability as a target, wherein the trained generator is used to generate a target confrontation sample, which simulates a target class real sample but is predicted as other classes by the classifier.

In the above steps, it should be first explained that "first" in "first noise vector", "first analog sample", and the like, "second" in "first" and "second probability", and other similar terms are used only for distinguishing the same kind of things, and do not have other limiting effects.

The steps are as follows:

first, in step S310, a first noise vector is obtained, and an ith class vector corresponding to the ith class is obtained, where i is a positive integer not greater than N.

For convenience of description, a certain noise vector acquired in this step is referred to as a first noise vector. Specifically, a noise space conforming to a specific distribution may be randomly sampled, resulting in a first noise vector. In one embodiment, the specific distribution may be a gaussian distribution (or a standard normal distribution). In another embodiment, the particular distribution may be a laplacian distribution.

On the other hand, for the obtaining of the ith category vector, in an embodiment, the obtaining may include: firstly, N category labels are obtained, and the N category labels are subjected to one-hot coding to correspondingly obtain N one-hot coding vectors; then, the N unique hot coded vectors are taken as N class vectors, including the ith class vector.

In a specific embodiment, the N category labels may be N category identifiers, where each category identifier is used to uniquely identify a corresponding category. In one example, the category labels may be comprised of letters, numbers, symbols, or the like. In another specific embodiment, the N category labels may be category names of N categories, such as "low risk", "medium risk", and "high risk".

It should be understood that the above-mentioned One-Hot encoding, i.e., one-Hot encoding, also known as One-bit-efficient encoding, uses an M (positive integer) bit status register to encode M states, each having its own independent register bit, and only One of which is active at any time. Based on this, the obtained category labels can be encoded into N-dimensional vectors, wherein the value of one dimension is different from the value of the other dimensions. In one example, assuming that the above-mentioned N class labels include class numbers 1, 2 and 3, the 3 class labels may be encoded into three unique thermal encoding vectors of (1, 0), (0, 1, 0) and (0, 1) in sequence.

In another embodiment, N category vectors may be randomly assigned to the N categories, only the N category vectors need to be different from each other. Thus, N category vectors corresponding to the N categories can be obtained, and an ith category vector corresponding to the ith category is obtained from the N category vectors.

After the first noise vector and the ith category vector are obtained, in step S320, the first noise vector and the ith category vector are input into the generator together, so as to obtain a first simulation sample corresponding to the ith category real sample.

Specifically, the first noise vector and the ith category vector may be fused to obtain a fused vector, and the fused vector is input into the generator. In one embodiment, the fusion process may be a stitching process, and the corresponding fusion vector is a stitching vector. In another embodiment, wherein the fusion process may be a summation process, the resulting fusion vector is a summation vector.

In an implementation manner, in the case that the business object is a text, the generator may adopt a Recurrent Neural Network (RNN), and fig. 4 illustrates a structural schematic diagram of the RNN Network according to an embodiment, as shown in fig. 4, the RNN Network is based on a hidden state vector h _t And entering words word _t The next word can be predicted _t+1 Wherein t is a natural number. In particular, if an initial state vector h is given ₀ And using a wildcard character<Go>(Special symbol indicating the beginning of a sentence) as the first word of the sentence word ₁ The RNN network may then, after training, generate each word of a sentence in turn, resulting in a natural language sentence, and the sentence conforms to the natural language of the real world.

Based on this, the method can comprise the following steps: and giving the fusion vector as the initial state vector of a hidden layer in the RNN, and taking a wildcard character (such as the wildcard character < Go >) aiming at a text character as an initial input of the RNN to obtain a section of natural language text output by the RNN, namely the first simulation sample. The obtained simulated text is used as a text countermeasure sample, so that the text countermeasure sample is more in line with the language habit of human, more in line with grammar and more smooth.

It should be noted that, in the case that the service object is a text, the generator may also use other neural networks obtained based on RNN network improvement, such as a Long Short-Term Memory (LSTM) network or a gate round robin Unit (GRU) network.

In the above, the first simulation sample simulating the ith category real sample can be obtained. Then, in step S330, the first simulated sample is input into the i-th discriminator to obtain a first probability that the first simulated sample belongs to the real sample in the i-th category. In step S340, the obtained first true sample belonging to the ith category is input into the ith discriminator, so as to obtain a second probability that the first true sample is a true sample in the ith category. In one embodiment, the first real sample may be obtained by randomly sampling from a set of real samples corresponding to the ith category. In another embodiment, a plurality of true samples belonging to the ith category may be obtained from the total set of true samples corresponding to the N categories, and used as the first true sample.

It should be noted that the purpose of the ith discriminator is to distinguish the ith category of real samples from the simulated samples (or dummy samples) generated by the generator to simulate the ith category of real samples as much as possible. In one embodiment, the output of the discriminator is the probability that the sample input thereto belongs to the true sample, and accordingly, the output probability of the discriminator can be directly taken as the probability that the corresponding sample belongs to the true sample. In another embodiment, the output of the discriminator is the probability that the sample input thereto belongs to a false sample, and accordingly, the value obtained by subtracting the output probability of the discriminator from 1 is used as the probability that the corresponding sample belongs to a true sample. Accordingly, in step S350, the i-th discriminator is trained with the objective of decreasing the first probability and increasing the second probability.

Specifically, the discriminant training loss of the ith discriminator is determined according to the first probability and the second probability, and then the model parameters in the ith discriminator are adjusted by using the discriminant training loss. In one embodiment, wherein the discriminant training loss may be determined by:

（1）

wherein the content of the first and second substances,

represents the ithThe discriminant function corresponding to the discriminant,

and

respectively representing the first real sample and the first simulated sample,

and

respectively representing the second probability and the first probability,

to represent

Distribution of true samples conforming to the ith category of true samples

，

Represent

Distribution of simulated samples conforming to simulated samples for the ith class

，

Indicating that the expected value is to be found.

It should be noted that the discriminant training loss may also be determined by calculating the Wasserstein distance, which is not described in detail herein.

The training of the ith discriminator can be realized, and by analogy, the training of each classifier in the N discriminators can be realized.

On the other hand, the generator needs to be trained. It should be noted that the purpose of the generator is to fool the discriminator as much as possible, so that the discrimination network discriminates the analog sample output by the generator as a real sample. Therefore, the discriminator and the generator mutually confront each other, and parameters are continuously adjusted, so that the discriminator can not finally judge whether the simulation sample output by the generator is real or not. Further, it is desirable that the first simulated sample generated to simulate the real sample of the ith category be classified by the classifier into other categories than the ith category as much as possible. Accordingly, in step S360, the first analog sample is input into the classifier, and a third probability that the first analog sample belongs to the ith class is obtained. Then, in step S370, the generator is trained with a goal of increasing the first probability and decreasing the third probability.

Specifically, the generated training loss of the generator is determined according to the first probability and the third probability, and the model parameters in the generator are adjusted by using the generated training loss. In one embodiment, wherein the generation loss may be determined by:

（2）

wherein the content of the first and second substances,

representing the first noise vector as described above and,

represents the above-mentioned i-th class vector,

the corresponding generation function of the generator is represented,

representing the first analog sample described above and,

representing the discriminant function corresponding to the ith discriminator,

the first probability is represented as described above in relation to,

to represent

In accordance with the above-mentioned noise space

The spatial distribution of (a) is such that,

representing the classification function corresponding to the above-mentioned classifier,

which represents the above-mentioned i-th category,

representing a first analogue sample

Is classified into

The above-mentioned third probability of (2),

indicating that the expected value is to be found.

It should be noted that other forms of loss functions may be used to determine the generated training loss.

Therefore, by repeating the steps S310 to S370, multiple rounds of iterative training of the classifiers and generators in the countermeasure generation network can be implemented. Further, a generator trained after multiple rounds of iterative training can be used to generate real samples simulating the target class, and the target confrontation samples predicted by the classifier as other classes. Specifically, in one embodiment, after the step S370, the method further includes: obtaining a second noise vector, and obtaining a target class vector corresponding to a target class, the target class belonging to the N classes; and inputting the second noise vector and the target category vector into the trained generator together to obtain the target confrontation sample. In this manner, generation of a challenge sample having a specified real category may be achieved.

In addition, for the training of the arbiter and the generator, an end-to-end training method may be used, and the arbiter and the generator are adjusted and referred in each training; or, the model parameters in the generator can be fixed first, the discriminator can be trained for multiple times, then the model parameters in the discriminator can be fixed, the generator can be trained, and the iteration is repeated in such a way, so that multiple rounds of iterative training are completed. Moreover, the iteration number may be a manually set hyper-parameter, or may be iterated until the model converges without presetting the iteration number.

In the above embodiments, the first probability and the second probability both refer to the probability that the corresponding sample belongs to the true sample. In other embodiments, the first probability and the second probability may also refer to the probability that the corresponding sample belongs to a false sample, and it should be understood that such embodiments are also covered by the protection scope of the foregoing claims.

In summary, by adopting the method for generating the countermeasure samples by using the countermeasure generating network provided by the embodiments of the present specification, it is possible to generate the countermeasure samples having the specified real category but predicted as other categories by the classifier. Moreover, by utilizing the generator after training, a large quantity of high-quality confrontation samples can be efficiently and quickly generated.

Corresponding to the generation method, the embodiment of the specification also discloses a generation device. In particular, fig. 5 shows a schematic structural diagram of an apparatus for generating countermeasure samples using a countermeasure generation network according to an embodiment, wherein the countermeasure generation network includes a pre-trained classifier for performing classification tasks of N classes for a business object; the countermeasure generation network further includes a generator and N discriminators corresponding to the N categories, where N is a positive integer greater than 1.

The above apparatus may be implemented by any processing platform with computing power, server cluster, etc., as shown in fig. 5, the apparatus 500 includes:

a noise vector acquisition unit 510 configured to acquire a first noise vector; a category vector acquisition unit 520 configured to acquire an ith category vector corresponding to an ith category, where i is a positive integer not greater than N; a simulation sample generating unit 530, configured to input the first noise vector and the ith category vector into the generator together, so as to obtain a first simulation sample corresponding to the ith category real sample; a simulated sample discriminating unit 540 configured to input the first simulated sample into the ith discriminator to obtain a first probability that the first simulated sample belongs to a real sample under the ith category; a real sample distinguishing unit 550, configured to input the obtained first real sample belonging to the ith category into the ith discriminator to obtain a second probability that the first real sample is a real sample in the ith category; a discriminator training unit 560 configured to train the i-th discriminator with a target of decreasing the first probability and increasing the second probability; the analog sample classification unit 570 is configured to input the first analog sample into the classifier to obtain a third probability that the first analog sample belongs to the ith class; a generator training unit 580 configured to train the generator with a goal of increasing the first probability and decreasing the third probability, the trained generator for generating a goal confrontation sample that simulates a goal class true sample but is predicted by the classifier as the other class.

In one embodiment, the noise vector obtaining unit 510 is specifically configured to: and randomly sampling the noise space which accords with the Gaussian distribution to obtain the first noise vector.

In one embodiment, the category vector obtaining unit 520 is specifically configured to: acquiring N category labels, and carrying out one-hot coding on the N category labels to correspondingly obtain N one-hot coding vectors; treating the N unique hot coded vectors as N class vectors, including the ith class vector.

In one embodiment, the analog sample generating unit 530 is specifically configured to: splicing the first noise vector and the ith category vector to obtain a spliced vector, and inputting the spliced vector into the generator to obtain the first analog sample; or, summing the first noise vector and the ith category vector to obtain a summed vector, and inputting the summed vector into the generator to obtain the first analog sample.

In one embodiment, the business object is text and the generator is a Recurrent Neural Network (RNN); the analog sample generating unit 530 is specifically configured to: performing fusion processing on the first noise vector and the ith category vector to obtain a fusion vector which is used as an initial state vector of a hidden layer in the RNN; taking wildcards for text characters as initial input to the RNN network, resulting in the first simulation sample.

In one embodiment, the apparatus 500 further comprises a confrontation sample generation unit 590 configured to: obtaining a second noise vector, and obtaining a target class vector corresponding to a target class, the target class belonging to the N classes; and inputting the second noise vector and the target category vector into the trained generator together to obtain the target confrontation sample.

In summary, by adopting the apparatus for generating a confrontation sample by using a confrontation generation network provided by the embodiments of the present specification, it is possible to generate a confrontation sample having a specified true category but predicted by a classifier as another category. Moreover, by utilizing the generator after training, a large quantity of high-quality confrontation samples can be efficiently and quickly generated.

As above, according to an embodiment of yet another aspect, there is also provided a computer readable storage medium having stored thereon a computer program which, when executed in a computer, causes the computer to perform the method described in connection with fig. 3.

There is also provided, according to an embodiment of yet another aspect, a computing device comprising a memory having stored therein executable code, and a processor that, when executing the executable code, implements the method described in connection with fig. 3.

Those skilled in the art will recognize that, in one or more of the examples described above, the functions described in the embodiments disclosed herein may be implemented in hardware, software, firmware, or any combination thereof. When implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium.

The above-mentioned embodiments, objects, technical solutions and advantages of the embodiments disclosed in the present specification are further described in detail, it should be understood that the above-mentioned embodiments are only specific embodiments of the embodiments disclosed in the present specification, and do not limit the scope of the embodiments disclosed in the present specification, and any modifications, equivalents, improvements and the like made on the basis of the technical solutions of the embodiments disclosed in the present specification should be included in the scope of the embodiments disclosed in the present specification.

Claims

1. A method of generating a confrontation sample using a confrontation generating network comprising a pre-trained classifier for performing N classes of classification tasks on a business object; the countermeasure generation network further includes a generator and N discriminators corresponding to the N categories, where N is a positive integer greater than 1; the method comprises the following steps:

obtaining a first noise vector and obtaining an ith category vector corresponding to an ith category, wherein i is a positive integer not greater than N;

inputting the first noise vector and the ith category vector into the generator together to obtain a first simulation sample corresponding to the ith category real sample;

inputting the first simulation sample into an ith discriminator to obtain a first probability that the first simulation sample belongs to a real sample under the ith category;

inputting the obtained first real sample belonging to the ith category into the ith discriminator to obtain a second probability that the first real sample is a real sample under the ith category;

training the ith discriminator by taking the first probability as a reduction and the second probability as an increase as a target;

inputting the first simulation sample into the classifier to obtain a third probability that the first simulation sample belongs to the ith category;

training the generator with the aim of increasing the first probability and decreasing the third probability, wherein the trained generator is used for generating target confrontation samples which simulate real samples of target classes but are predicted as other classes by the classifier.

2. The method of claim 1, wherein obtaining a first noise vector comprises:

and randomly sampling a noise space conforming to Gaussian distribution to obtain the first noise vector.

3. The method of claim 1, wherein obtaining an ith class vector corresponding to the ith class comprises:

acquiring N category labels, and performing unique hot coding on the N category labels to correspondingly obtain N unique hot coding vectors;

treating the N unique hot coded vectors as N class vectors, including the ith class vector.

4. The method of claim 1, wherein inputting the first noise vector and the ith class vector together into the generator comprises:

splicing the first noise vector and the ith category vector to obtain a spliced vector, and inputting the spliced vector into the generator; or the like, or a combination thereof,

and summing the first noise vector and the ith category vector to obtain a summed vector, and inputting the summed vector into the generator.

5. The method of claim 1, wherein the business object is text and the generator is a Recurrent Neural Network (RNN); wherein, inputting the first noise vector and the ith category vector into the generator together to obtain a first simulation sample corresponding to the ith category real sample, comprising:

performing fusion processing on the first noise vector and the ith category vector to obtain a fusion vector which is used as an initial state vector of a hidden layer in the RNN;

taking wildcards for text characters as initial input of the RNN network to obtain the first simulation sample.

6. The method of claim 1, wherein the business object is text or a picture or audio, and the trained generator is configured to generate a text confrontation sample or a picture confrontation sample or an audio confrontation sample.

7. The method of claim 1, wherein after training the generator, the method further comprises:

obtaining a second noise vector, and obtaining a target class vector corresponding to a target class, the target class belonging to the N classes;

and inputting the second noise vector and the target category vector into the trained generator together to obtain the target confrontation sample.

8. An apparatus for generating countermeasure samples using an countermeasure generation network, the countermeasure generation network including a pre-trained classifier for performing N classes of classification tasks for a business object; the countermeasure generation network further includes a generator and N discriminators corresponding to the N categories, where N is a positive integer greater than 1; the device comprises:

a noise vector acquisition unit configured to acquire a first noise vector;

a category vector acquisition unit configured to acquire an ith category vector corresponding to an ith category, where i is a positive integer not greater than N;

the simulation sample generating unit is configured to input the first noise vector and the ith category vector into the generator together to obtain a first simulation sample corresponding to the ith category real sample;

the analog sample distinguishing unit is configured to input the first analog sample into an ith discriminator to obtain a first probability that the first analog sample belongs to a real sample under an ith category;

the real sample distinguishing unit is configured to input the acquired first real sample belonging to the ith category into the ith discriminator to obtain a second probability that the first real sample is a real sample under the ith category;

a discriminator training unit configured to train the i-th discriminator with a target of decreasing the first probability and increasing the second probability;

the analog sample classification unit is configured to input the first analog sample into the classifier to obtain a third probability that the first analog sample belongs to the ith class;

and a generator training unit configured to train the generator aiming at increasing the first probability and decreasing the third probability, wherein the trained generator is used for generating a target confrontation sample which simulates a target class real sample but is predicted as other classes by the classifier.

9. The apparatus according to claim 8, wherein the noise vector obtaining unit is specifically configured to:

10. The apparatus according to claim 8, wherein the category vector obtaining unit is specifically configured to:

taking the N one-hot coded vectors as N category vectors, wherein the ith category vector is included.

11. The apparatus of claim 8, wherein the analog sample generation unit is specifically configured to:

splicing the first noise vector and the ith category vector to obtain a spliced vector, and inputting the spliced vector into the generator to obtain the first analog sample; or the like, or, alternatively,

and summing the first noise vector and the ith category vector to obtain a summed vector, and inputting the summed vector into the generator to obtain the first analog sample.

12. The apparatus of claim 8, wherein the business object is text and the generator is a Recurrent Neural Network (RNN); the analog sample generation unit is specifically configured to:

13. The apparatus of claim 8, wherein the business object is text or a picture or audio, and the trained generator is configured to generate a text confrontation sample or a picture confrontation sample or an audio confrontation sample.

14. The apparatus of claim 8, wherein the apparatus further comprises a challenge sample generation unit configured to:

15. A computer-readable storage medium, on which a computer program is stored, wherein the computer program, when executed in a computer, causes the computer to perform the method of any of claims 1-7.

16. A computing device comprising a memory and a processor, wherein the memory has stored therein executable code that when executed by the processor implements the method of any of claims 1-7.