CN112419327B

CN112419327B - Image segmentation method, system and device based on generation countermeasure network

Info

Publication number: CN112419327B
Application number: CN202011438792.2A
Authority: CN
Inventors: 王奕; 孙毅
Original assignee: Fudan University Shanghai Cancer Center
Current assignee: Fudan University Shanghai Cancer Center
Priority date: 2020-12-10
Filing date: 2020-12-10
Publication date: 2023-08-04
Anticipated expiration: 2040-12-10
Also published as: CN112419327A

Abstract

The invention discloses an image segmentation method, system and device based on a generated countermeasure network, wherein a semantic segmentation model consists of a segmentation network S and a generated countermeasure model; the segmentation network S predicts a label probability map S (x) of each pixel point for the input data x; the generator G generates a tag probability map G (z) according to the noise z; the arbiter D separates the false tag probability map from the true tag probability map y by predicting a pixel level confidence map p. The algorithm synthesizes the marked three-dimensional medical image data through the game between the generator and the discriminator, and can solve the problem of lack of marked medical image data. The generated data does not relate to user privacy, and is beneficial to sharing medical data. The SEG-GAN segmentation model is used for distinguishing the label generated by the generator through the medical image and the real label by the discriminator, so that the segmentation of the medical image is obtained, the medical image without the label is used for assisting model training, and the segmentation effect of the model is effectively improved.

Description

Image segmentation method, system and device based on generation countermeasure network

Technical Field

The present invention relates to the field of image processing technologies, and in particular, to an image segmentation method, system, and apparatus based on generation of an countermeasure network.

Background

The labeled medical image data is severely starved. Because labeling of medical images requires a considerable level of medical literacy, labeling a complete medical image segmented dataset requires significant time and expense.

The GANs are increasingly receiving attention from the computer vision and medical community and have been used in many fields. With very small and very large two-player game games, the generators in the GANs will mimic the real data distribution at the discretion of the arbiter and achieve applications such as image translation, image synthesis, data enhancement, image completion, etc. Although GANs has been successful in many problems, its instability in the training process is its most fatal disadvantage, which is more exposedly evident when synthesizing high resolution images or three-dimensional voxels.

In the data enhancement problem of segmentation tasks, most of the work is limited by the cost of hardware equipment and training procedures, regarding the task of synthesizing three-dimensional voxels (e.g., MR images) as a sequence of two-dimensional slices in the z-axis. However, this approach may result in discontinuities in the z-axis of the synthesized three-dimensional voxel data, which may be detrimental to the three-dimensional segmentation network trained using such data. Whether relevance-enhanced or non-relevance-enhanced, they are subjectively imposed by the user, requiring the user to predefine transformation rules for the original data. The simplest and objective data enhancement method is to obtain more data from the real data distribution. However, this is not possible because the acquisition of data using medical devices is costly. Taking nuclear magnetic resonance technology as an example, a GE 1.5T exact HDXT price is between $150,000 and $250,000, and the cost of one scan is at least 500 Yuan RMB. Although hospitals and the like accumulate a large amount of data during clinical practice, there is no way to utilize such data for the protection of patient privacy.

In view of the foregoing, there is a particular need for a medical image segmentation method and apparatus based on generating an antagonistic network SEG-GAN to address the deficiencies of the prior art.

Disclosure of Invention

Aiming at the problems of heavy work and high cost of labeling brain tumor medical image data, and the problem of low segmentation accuracy of the existing method for segmenting the medical image by the supervised method, the invention provides an image segmentation method, system and device based on a generation countermeasure network.

In order to solve the technical problems existing in the prior art, the technical scheme of the invention comprises the following steps:

an image segmentation method based on generating an countermeasure network, comprising the steps of: s1, predicting a confidence map as a supervision signal by using a discriminant network which is pre-trained by using label data, and guiding cross entropy loss in a self-learning mode. The confidence map indicates which regions of the prediction distribution are close to the true tag map distribution so that these predictions can be trained by the segmentation network by masking cross entropy loss of other untrusted regions.

S2, as in the supervision setting, an antagonistic penalty is applied on the unlabeled data, which encourages the model' S prediction of unlabeled data to be close to the true label graph distribution.

S3, dividing the network S into a generator structure of the 3D-MedGAN. Given an input MR image x of one dimension h×w×d×1, the segmentation network outputs a semantic tag probability map S (x) of size h×w×d×c, where C is the number of semantic categories.

S4, the generator network G is responsible for generating a semantic tag probability map G (z) with the same size H multiplied by W multiplied by D multiplied by C as the input image according to a random vector z with fixed dimensions.

S5, the performance of the discriminator network D depends on a segmentation network and a generator network, wherein the segmentation network predicts a probability map S (x), the generator network generates a probability map G (z) and a one-hot map of a real label map y as inputs, and then a confidence map p with the size of H multiplied by W multiplied by D is output; each pixel on the confidence map p represents that the label of the input image x at the position corresponding thereto is a sample from the true label map y (p=1) or the false label map (p=0), including S (x) and G (z).

Preferably, the method uses a generated challenge model having two "generators" and a discriminator. The two "generators" are a segmentation network S that predicts the tag probability map for the incoming MR and a generator G that converts random noise into the tag probability map, respectively. The labels predicted by the segmentation network S and the generator G are used as false samples, and labels of the label data are used as true samples, so that the discriminator is trained to have the capability of separating the true labels from the false labels. When the arbiter has this capability, an indication matrix can be derived from its predictions. The indicator matrix may be used to keep a relatively reliable prediction of unlabeled exemplars by the segmentation network S as a supervisory signal for self-training. The better the performance of the discriminator, the more useful the retained supervisory signals, and the better the segmentation effect of the finally trained model on brain tumors.

Preferably, the input data x according to which the splitting network S is based comprises the marking data x _l And unlabeled data x _u Each marking data x _l All have corresponding label patterns y _l The label map y of H×W×D size here _l One-hot encoding is changed into a probability map y with C channels of discrete labels _l The labels at each location will map the probability map y _l The channel of which the upper represents the category is set to be 1, and the same positions on the rest other channels are all set to be 0; in the training process, when the marking data x is used _l In this case, the split network S is formed by a label probability map S (x _l ) And true label map y _l Standard cross entropy loss of (2)Distinguishing y by using a discriminator network D _pred Is->Guiding and updating each parameter in the network; training the segmentation network using a self-supervised learning method for unlabeled data, predicting an unlabeled image x from the segmentation network S _u Is (x) _u ) Then, a tag probability map S (x _u ) Confidence degree of each position is obtained, a confidence degree map p is obtained, the quality of the predicted segmentation area is indicated through the confidence degree map, and the result of the segmentation network S during training can be trusted; then, a region with high confidence is reserved by taking a threshold value on the confidence map p, and a predictive label probability map S (x _u ) The channel with the highest probability in all channels in the areas is used as the label of the area to obtain a pseudo label graph, and the pseudo label graph is subjected to one-hot coding to obtain y _u The method comprises the steps of carrying out a first treatment on the surface of the Then the network S predicts the label probability map S (x) with high confidence region segmentation _u ) And y _u Cross entropy loss between->Resistance loss with the arbiter network D>Training a segmentation network; the generator network generates a false tag probability map G (z) according to the random noise z, and obtains y by taking the channel with the largest value at the same position as the tag _g At the same time, the discriminator is used to calculate the generator loss->The arbiter network D will identify the tag probability map y that is tagged with the genuine data _l False label probability map y predicted by split network _pred Guided, y _u And authentication of true tag probability map y _l False tag probability map y generated by generator network _g Will result in minimized arbiter loss +.>Since the training process is to optimize the segmentation network S and the discriminant network D in turn, the resistance loss is +.>And discriminator loss->And will not be used simultaneously.

Preferably, the input data x comprises marking data x _l And unlabeled data x _u Each marking data x _l All have corresponding label patterns y _l The label map y of H×W×D size here _l One-hot encoding is changed into a probability map y with C channels of discrete labels _l The labels at each location will map the probability map y _l The channel whose category is represented by the upper is set to 1, and the same positions on the remaining other channels are all set to 0. For the sake of expression, the label probability map resulting from the label map one-hot coding is denoted hereinafter by a symbol.

Preferably, during the training processAll data is used. When using the marking data x _l In this case, the split network S is formed by a label probability map S (x _l ) And true label map y _l Standard cross entropy loss of (2)Distinguishing y by using a discriminator network D _pred Is->And guiding and updating each parameter in the network.

Preferably, the proposed self-supervised learning method is used to train the segmentation network on unlabeled data. In the prediction of unlabeled image x by segmentation network S _u Is (x) _u ) Then, a tag probability map S (x _u ) Confidence in each location, a confidence map p is obtained. The confidence map indicates the quality of the predicted segmented regions so that the results of the segmentation network during training can be trusted.

Preferably, the confidence region is then preserved by thresholding the confidence map p and taking the predictive label probability map S (x _u ) The channel with the highest probability in all channels in the areas is used as the label of the area to obtain a pseudo label graph, and the pseudo label graph is subjected to one-hot coding to obtain y _u 。

Preferably, the network predictive label probability map S (x) _u ) And y _u Cross entropy loss betweenResistance loss with the arbiter network>Together, the segmentation network is trained. Whether by marking data x _l Whether or not the data x is unlabeled _u Training the segmentation network and the discriminator network, the generator network generates false labels according to random noise zProbability map G (z), obtaining y by taking the channel with the largest value at the same position as the label _g At the same time, the discriminator is used to calculate the generator loss->

Preferably, the arbiter network D will identify the tag probability map y that is marked with the authentic data _l False label probability map y predicted by split network _pred ,y _u And authentication of true tag probability map y _l False tag probability map y generated by sum generator network _g Resulting discriminant lossSince the training process is to optimize the segmentation network S and the discriminant network D in turn, the resistance loss is +.>And discriminator loss->And will not be used simultaneously.

Loss in a network that a arbiter wants to minimizeThe method comprises the following steps:

the data input into the identifier network are unified into one-hot coding format, all the label probability graphs are obtained by taking the channel with the highest probability on each position as the label, then carrying out one-hot coding to obtain the data of HxW xD xC, and modifying the lossThe method comprises the following steps:

loss as the arbiter is continually optimizedContinuously decreasing, the generator also needs to be left with loss +.>Optimizing in order to generate a tag probability map y sufficient to fool the discriminant _g Make->Lifting. />The method comprises the following steps:

wherein the loss isAnd->The loss function of the standard GANs is used.

Preferably, for the segmentation network S, the gap between the predicted tag probability map and the true tag samples needs to be reduced, this distance for the tagged samples x _l Is a predictive probability map S (x _l ) One-hot coding diagram y of each pixel point and true mark _l Accumulation of cross entropy for each pixel point, for unlabeled exemplar x _u Then it is the predictive probability map S (x _u ) Pixel point on the image and reserved pseudo-marked one-hot coding diagram y _u Accumulation of cross entropy for pixel points on the display. Thus dividing lossThe method comprises the following steps:

preferably, the indication matrix I is trained with the marking data x _l Is set to a matrix with all elements 1, representing S (x _l ) And y is _l The cross entropy calculated at each point is used as a supervisory signal to train the segmentation network S. While training the unlabeled data x _u When the position (h, w, d) is binarized by taking a threshold value T according to the confidence map p, the point indicating the position (h, w, d) in the c-axis direction of the matrix is set to 0 or 1. According to the above description, the indication matrix I is:

whether with marked data x _l Or non-marking data x _u Training the segmentation network S, and predicting the result y of the segmentation network by using a discriminator D of the full convolution network _pred And y _u Calculating the countermeasures against lossTo guide the segmentation network to optimize the segmentation network, and predict the probability graph y which is closer to the true label _l And (3) a distributed result. Countering losses->The method comprises the following steps:

the training segmentation network S is:

another embodiment of the present invention provides an image segmentation system based on generating an antagonistic network SEG-GAN, the system comprising:

the data set acquisition module is used for acquiring a target image set, a reference image set and a pre-marked reference mark set corresponding to the reference image set; the target image set comprises a target image training set and a target image testing set.

And the network construction module is used for constructing a segmentation network and a discrimination network. Wherein the first objective loss function of the segmentation network comprises cross entropy loss of the objective image set and the reference label set, contrast loss of the objective image set, and semi-supervised loss between the objective image set and the reference image set.

The training module is used for inputting the target image training set and the reference image set into the segmentation network, correspondingly obtaining a target probability score graph and a reference probability score graph, and inputting the target probability score graph and the reference probability score graph into the discrimination network so as to perform joint training of the segmentation network and the discrimination network.

And the judging module is used for finishing training when the first target loss function of the segmentation network and the second target loss function of the judging network are converged.

And the test module is used for inputting the target image test set into the trained segmentation network to obtain a target segmentation image.

A further embodiment of the invention correspondingly provides a system using the image segmentation method based on the generation of the antagonism network SEG-GAN, the system comprising a processor, a memory and a computer program stored in the memory and configured to be executed by the processor, the processor implementing any one of the above image segmentation methods based on the generation of the antagonism network SEG-GAN when the computer program is executed.

The semantic segmentation model consists of a segmentation network S and a basic generated challenge model (G and D). The segmentation network S predicts a label probability map S (x) for each pixel for the input data x. The generator G generates a tag probability map G (z) from the noise z. The arbiter D attempts to separate the false tag probability map (i.e., S (x) and G (z)) from the true tag probability map y by predicting a pixel-level confidence map p.

Compared with the prior art, the image segmentation method, the system and the device based on the generation of the antagonism network SEG-GAN can solve the problem of lack of marked medical image data. And the generated data does not relate to user privacy, thereby being beneficial to sharing medical data. The 3D-MedGAN model is applied to semi-supervised medical image segmentation, an SEG-GAN segmentation model is provided, a label generated by a generator through a medical image and a real label are distinguished by a discriminator, so that segmentation of the medical image is obtained, the non-labeled medical image is utilized for assisting model training, the segmentation effect of the model is effectively improved, and the manpower labeling cost is greatly reduced.

Drawings

The invention is described in detail below with reference to the attached drawing figures and the detailed description:

FIG. 1 is a schematic flow chart of the present invention;

FIG. 2 is a schematic diagram of a network architecture of a discriminator of the invention;

FIG. 3 is a schematic diagram of a generated challenge model for semi-supervised learning of the present invention;

FIG. 4 is a schematic diagram of the prediction results of the present invention on a sample.

Detailed Description

The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

Referring to fig. 1, a flowchart of an image segmentation method based on generating an countermeasure network according to an embodiment of the present invention is shown, where the method includes steps S1 to S5 as follows.

S1, predicting a confidence map as a supervision signal by using a discriminant network which is pre-trained by using label data, and guiding cross entropy loss in a self-learning mode. The confidence map indicates which regions of the prediction distribution are close to the true tag map distribution so that these predictions can be trained by the segmentation network by masking cross entropy loss of other untrusted regions.

S2, as in the supervision setting, the resistance loss is applied to the unlabeled data, which encourages the prediction result of the unlabeled data by the model to be close to the real label graph distribution.

S5, the performance of the discriminator network D depends on a segmentation network and a generator network, wherein the segmentation network predicts a probability map S (x), the generator network generates a probability map G (z) and a one-hot map of a real label map y as inputs, and then a confidence map p with the size of H multiplied by W multiplied by D is output. Each pixel on the confidence map p represents that the label of the input image x at the position corresponding thereto is a sample from the true label map y (p=1) or the false label map (p=0), including S (x) and G (z).

The method uses a generated countermeasure model, which has two generators and a discriminator. The two "generators" are a segmentation network S that predicts the tag probability map for the incoming MR and a generator G that converts random noise into the tag probability map, respectively. The labels predicted by the segmentation network S and the generator G are used as false samples, and labels of the label data are used as true samples, so that the discriminator is trained to have the capability of separating the true labels from the false labels. When the arbiter has this capability, an indication matrix can be derived from its predictions. The indicator matrix may be used to keep a relatively reliable prediction of unlabeled exemplars by the segmentation network S as a supervisory signal for self-training. The better the performance of the discriminator, the more useful the retained supervisory signals, and the better the segmentation effect of the finally trained model on brain tumors.

Referring to fig. 2, the present invention generates a discriminator network structure in an antagonism network. The input data can be divided into two groups: false and true, both groups have the same number of samples. In the fake group, the image generated by the current generator would be randomly replaced by samples previously cached in the image pool. The samples in the true group are then the true MR images. The discriminator reduces the size of the input to one fourth by a 3 x 3 convolution with a step of 2 in the first two layers, the number of channels becomes 4 times the original, and then outputs a three-dimensional probability map with dimensions w x h x d by a 3 x 3 convolution with a step of 1 and a 1 x 1 convolution with a step of 1.

Expanding the discriminator network to three dimensions so that it is suitable for three-dimensional input data. The last layer of the discriminator D outputs a probability map of dimension w x h x D, the value of each position on the probability map representing the probability that the image block corresponding thereto belongs to true.

An image buffer pool is introduced to improve the stability of the countermeasure training. The image buffer pool buffers samples generated by some generators in the previous step. During training, samples in a part of the image cache pool are exchanged with the data currently synthesized by the generator to be delivered to the discriminator for judging true or false. This approach may stabilize the training of the discriminator, preventing the discriminator from "forgetting" the previously learned knowledge. In particular, four pairs of data, each pair consisting of MR image x and its tag y, will be cached in the image cache pool. For these four pairs of data (x _i ,y _i ) Is indicated (the subscript indicates its location in the image cache pool). At each iteration of training, it is first determined randomly whether the data in the image buffer pool is to be compared with the samples (x ^* ,y ^* ) Exchange is performed. The index i generated according to the uniform distribution is then used to buffer the samples (x _i ,y _i ) And (x) ^* ,y ^* ) Exchange, will (x _i ,y _i ) Is sent to a discriminator network D for authentication, and (x ^* ,y ^* ) Cached at location i.

The least square loss function proposed by the least square countermeasure generation network is used to replace the sigmoid cross entropy loss of the GAN. This improvement can improve the quality of the generated image, stabilizing the training process.

Where a and b are used to mark spurious and real data, respectively. c is a threshold matrix G such that D believes that the spurious data is real. A and c are initialized to a matrix of dimension w x h x d with all elements being 1, and b is initialized to a zero matrix of the same size.

Referring to fig. 3, in SEG-GAN, a generator in the 3D-MedGAN framework is used as a split network. Unlike a typical generator that trains to generate images from noise vectors, the segmentation network of the method outputs a probability map of the semantic label for each pixel point on a given input image. Under this setting, the result of forcing the output of the segmentation network is as close spatially as possible to the true label map, based on consistency (smoothness) constraint assumptions. For this purpose, an antagonist learning scheme is adopted, a discriminator formed by a convolutional neural network is used for learning to distinguish a real label image from a label image of a segmentation prediction, and an additional generator is used for synthesizing a generated label image according to a noise vector and requiring the discriminator to correctly divide the real label image from the generated label image. The SEG-GAN incorporates cross entropy loss in the segmentation task when trained with labeled data and uses the contrast loss to encourage the segmentation network to produce predictive probability maps in higher-order structures that approach true label maps. By further utilizing the above challenge learning scheme, these unlabeled data are utilized in conjunction with two semi-supervised loss terms when training with unlabeled data.

First, SEG-GAN predicts confidence graphs as supervisory signals using a network of discriminators previously pre-trained with tag data and directs cross entropy loss by way of self-learning. The confidence map indicates which regions of the prediction distribution are close to the true tag map distribution so that these predictions can be trained by the segmentation network by masking cross entropy loss of other untrusted regions. Second, as in the supervision setting, an antagonistic penalty is applied on the unlabeled data, which encourages the model's predictive outcome for the unlabeled data to be close to the true label graph distribution.

The method for generating the countermeasure model consists of three modules: divider S, generator G and arbiter D. Wherein the splitting network S is a generator structure of 3D-MedGAN. Given an input MR image x of one dimension h×w×d×1, the segmentation network outputs a semantic tag probability map S (x) of size h×w×d×c, where C is the number of semantic categories. The generator network G is responsible for generating a semantic tag probability map G (z) of the same size h×w×d×c as the input image from a random vector z of fixed dimension.

The input data x comprises marking data x _l And unlabeled data x _u Is obtained by the following steps.

Each marking data x _l All have corresponding label patterns y _l The label map y of H×W×D size here _l One-hot encoding is changed into a probability map y with C channels of discrete labels _l The labels at each location will map the probability map y _l The channel whose category is represented by the upper is set to 1, and the same positions on the remaining other channels are all set to 0. For the sake of expression, the label probability map resulting from the label map one-hot coding is denoted hereinafter by a symbol.

All data was used during the training process. When using the marking data x _l In this case, the split network S is formed by a label probability map S (x _l ) And true label map y _l Standard cross entropy loss of (2)Distinguishing y by using a discriminator network D _pred Is->And guiding and updating each parameter in the network.

The proposed self-supervised learning method is used to train the segmentation network on unlabeled data. In the prediction of unlabeled image x by segmentation network S _u Is (x) _u ) Then, a tag probability map S (x _u ) Confidence in each location, a confidence map p is obtained. The confidence map indicates the quality of the predicted segmented regions so that the results of the segmentation network during training can be trusted.

Then, a region with high confidence is reserved by taking a threshold value on the confidence map p, and a predictive label probability map S (x _u ) The channel with the highest probability in all channels in the areas is used as the label of the area to obtain a pseudo label graph, and the pseudo label graph is subjected to one-hot coding to obtain y _u 。

Then segment the network predictive label probability map S (x _u ) And y _u Cross entropy loss betweenResistance loss with the arbiter network>Together, the segmentation network is trained. Whether by marking data x _l Whether or not the data x is unlabeled _u Training the segmentation network and the discriminator network, generating a false tag probability graph G (z) by the generator network according to random noise z, and obtaining y by taking the channel with the largest value at the same position as a tag _g At the same time, the discriminator is used to calculate the generator loss->

Distinguishing device netThe complex D will be marked by a tag probability map y identifying the true data _l False label probability map y predicted by split network _pred ,y _u And authentication of true tag probability map y _l False tag probability map y generated by sum generator network _g Resulting discriminant lossSince the training process is to optimize the segmentation network S and the discriminant network D in turn, the resistance is lostAnd discriminator loss->And will not be used simultaneously.

Loss in a network that a arbiter wants to minimizeCan be expressed as the following formula:

wherein Y is _n A label representing a pixel on a label probability map of h×w×d that is located at (H, W, D) is true. When Y is _n When=0, i.e. false, the probability values representing all channels at that point are from the label probability map S (x) predicted by the segmentation network or the label probability map G (z) synthesized by the generator network, when Y _n When=1, it is true, the probability values representing all channels at that point are derived from the labeled sample x _l Is marked y of (2) _l . The subscript n used in the formula represents the nth sample in the small batch of data used in training.

The data input into the arbiter network are unified into a one-hot coding format, all the tag probability maps are obtained by taking the channel with the highest probability on each position as the tag, and then performing one-hot codingH×W×D×C data, modification lossThe method comprises the following steps:

wherein the loss isAnd->The loss function of the standard GANs is used.

For the segmentation network S, it is necessary to narrow the gap between the predicted tag probability map and the true tag samples, which is the case for the tagged samples x _l Is a predictive probability map S (x _l ) One-hot coding diagram y of each pixel point and true mark _l Accumulation of cross entropy for each pixel point, for unlabeled exemplar x _u Then it is to mask the low confidencePredictive probability map S (x) _u ) Pixel point on the image and reserved pseudo-marked one-hot coding diagram y _u Accumulation of cross entropy for pixel points on the display. Thus dividing lossThe method comprises the following steps:

indicating matrix I is trained with marked data x _l Is set to a matrix with all elements 1, representing S (x _l ) And y is _l The cross entropy calculated at each point is used as a supervisory signal to train the segmentation network S. While training the unlabeled data x _u When the position (h, w, d) is binarized by taking a threshold value T according to the confidence map p, the point indicating the position (h, w, d) in the c-axis direction of the matrix is set to 0 or 1. According to the above description, the indication matrix I is:

whether with marked data x _l Or non-marking data x _u Training the segmentation network S, and predicting the result y of the segmentation network by using a discriminator D of the full convolution network _pred And y _u Calculating the countermeasures against lossTo guide the segmentation network to optimize the segmentation network, and predict the probability graph y which is closer to the true label _l And (3) a distributed result. Countering losses->The calculation formula of (2) is as follows:

the training segmentation network S is:

wherein, the liquid crystal display device comprises a liquid crystal display device,and->Is for unmarked data x _u Division loss calculated during semi-supervised learning>And counter-loss->λ _adv And lambda (lambda) _semi Is the two weights used to minimize the proposed multitasking loss function.

For verification of the method of the present invention, see FIG. 4, is a prediction of the model on sample brets_tcia_pat 483_0001. AdvSemiSeg under semi-supervised learning setup performs better than the model described for the prediction of labels occupying smaller areas, i.e. enhanced tumors (green) and necrotic and non-enhanced tumors (red). The results predicted by the model herein are significantly better than AdvSemiSeg. The areas covered by the tumor edema labels are more, and the prediction of the enhanced tumor labels and the necrosis and non-enhanced tumor labels is more accurate.

The foregoing has shown and described the basic principles, principal features and advantages of the invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, and that the above embodiments and descriptions are merely illustrative of the principles of the present invention, and various changes and modifications may be made without departing from the spirit and scope of the invention, which is defined in the appended claims. The scope of the invention is defined by the appended claims and equivalents thereof.

Claims

1. An image segmentation method based on generating an antagonistic network SEG-GAN, comprising the steps of:

s1, predicting a confidence map by using a discriminant network pre-trained with tag data, generating an antagonism network SEG-GAN, taking the confidence map as a supervision signal, and guiding cross entropy loss through self-learning; wherein the confidence map indicates which regions of the prediction distribution are close to the true tag map distribution, such that these predictions can be trained by the segmentation network by masking cross entropy loss of other untrusted regions;

s2: applying the resistance loss on the unlabeled data, and generating a prediction result of the resistance model on the unlabeled data to be close to the real label graph distribution;

s3: the segmentation network S is a generator structure of 3D-MedGAN, an input MR image x with one dimension H multiplied by W multiplied by D multiplied by 1 is given, and the segmentation network outputs a semantic tag probability map S (x) with the size H multiplied by W multiplied by D multiplied by C, wherein C is the number of semantic categories;

s4: the generator network G is responsible for generating a semantic tag probability map G (z) with the same size H multiplied by W multiplied by D multiplied by C as the input image according to a random vector z with fixed dimension;

s5: the discriminator network D depends on the segmentation network and the generator network, takes a probability map S (x) predicted by the segmentation network, a probability map G (z) generated by the generator network and a one-hot map of the real label map y as inputs, and then outputs a confidence map p with the size of H multiplied by W multiplied by D; each pixel on the confidence map p represents that the label of the input image x at the position corresponding thereto is a sample from the true label map y, p=1 or false label map, p=0, including S (x) and G (z);

wherein generating the challenge model includes two "generators" and a discriminator; one of the two generators is a segmentation network S for predicting a tag probability map for an input MR and a generator G for converting random noise into the tag probability map, and the tag probability map predicted by the segmentation network S and the generator G is used as a false sample, and marks of marking data are used as true samples, so that the discriminator is trained, and the discriminator has the capability of separating true marks from false marks; the arbiter obtains an indication matrix according to the prediction; the indication matrix is used for keeping the relative reliable prediction of the segmentation network S on the unlabeled samples as a supervision signal for self-training;

the input data x according to which the splitting network S is based comprises the marking data x _l And unlabeled data x _u Each marking data x _l All have corresponding label patterns y _l The label map y of H×W×D size here _l One-hot encoding is changed into a probability map y with C channels of discrete labels _l The labels at each location will map the probability map y _l The channel of which the upper represents the category is set to be 1, and the same positions on the rest other channels are all set to be 0;

in the training process, when the marking data x is used _l In this case, the split network S is formed by a label probability map S (x _l ) And true label map y _l Standard cross entropy loss of (2)Distinguishing y by using a discriminator network D _pred Is->Guiding and updating each parameter in the network;

training the segmentation network using a self-supervised learning method for unlabeled data, predicting an unlabeled image x from the segmentation network S _u Is (x) _u ) Then, a tag probability map S (x _u ) Confidence degree of each position is obtained, a confidence degree map p is obtained, the quality of the predicted segmentation area is indicated through the confidence degree map, and the result of the segmentation network S during training can be trusted;

then, a region with high confidence is reserved by taking a threshold value on the confidence map p, and a predictive label probability map S (x _u ) The channel with the highest probability in all channels in the areas is used as the label of the area to obtain a pseudo label graph, and the pseudo label graph is the same asCarrying out one-hot coding on the pseudo tag image to obtain y _u ；

Then the network S predicts the label probability map S (x) with high confidence region segmentation _u ) And y _u Cross entropy loss betweenResistance loss with the arbiter network D>Training a segmentation network; the generator network generates a false tag probability map G (z) according to the random noise z, and obtains y by taking the channel with the largest value at the same position as the tag _g At the same time, the discriminator is used to calculate the generator loss->

The arbiter network D will identify the tag probability map y that is tagged with the genuine data _l False label probability map y predicted by split network _pred Guided, y _u And identifying a true tag probability map yl and a false tag probability map y generated by the generator network _g Will result in minimized arbiter lossSince the training process is to optimize the segmentation network S and the discriminant network D in turn, the resistance loss is +.>And discriminator loss->And will not be used simultaneously.

2. The method of generating an image segmentation based on a countermeasure network according to claim 1, wherein minimized arbiter loss among network discriminatorsThe method comprises the following steps:

where n is the nth sample in the small lot data representing training, Y _n A label representing a pixel on a label probability map of h×w×d size at a position (H, W, D) is true; when Y is _n When=0, i.e. false, the probability values representing all channels at that point are from the label probability map S (x) predicted by the segmentation network or the label probability map G (z) synthesized by the generator network, when Y _n When=1, it is true, the probability values representing all channels at that point are derived from the labeled sample x _l Is marked y of (2) _l 。

3. The image segmentation method based on the generation of countermeasure network as set forth in claim 2, wherein the data input to the arbiter network D are unified into one-hot coding format, the tag probability map is modified by taking the channel with the highest probability on each position as the tag, and then performing one-hot coding to obtain the data of HxW xD xCThe method comprises the following steps:

4. a method of image segmentation based on generation of a countermeasure network as claimed in claim 3, wherein loss occurs as the arbiter is continually optimizedContinuously reducing the loss of the generator/>Optimizing in order to generate a tag probability map y of a fraud arbiter _g So that->Lifting, said generator losing->The method comprises the following steps:

wherein the loss isAnd loss->The loss function of the standard GANs is used.

5. The image segmentation method based on generation of countermeasure network according to claim 4, wherein for the segmentation network S, a gap between a predicted tag probability map and a true tag sample needs to be narrowed, the gap for a tagged sample x _l Is a predictive probability map S (x _l ) One-hot coding diagram y of each pixel point and true mark _l Accumulation of cross entropy for each pixel point, for unlabeled exemplar x _u Then it is the predictive probability map S (x _u ) Pixel point on the image and reserved pseudo-marked one-hot coding diagram y _u Accumulation of cross entropy of pixel points on the pixel, thus partition lossThe method comprises the following steps:

6. the method for generating an image segmentation based on a countermeasure network according to claim 5, wherein the indicator matrix I is trained with the marker data x _l Is set to a matrix with all elements 1, representing S (x _l ) And y is _l The cross entropy calculated by each point is used as a supervision signal to train the segmentation network S; while training the unlabeled data x _u When the position (h, w, d) is binarized by taking a threshold value T according to the confidence map p, the point of the position (h, w, d) in the c-axis direction of the indication matrix is set to be 0 or 1; the overall acquisition indication matrix I is:

with marked data x _l Or non-marking data x _u Training the segmentation network S, and predicting the result y of the segmentation network by adopting a discriminator D of the full convolution network _pred And y _u Calculating the countermeasures against lossTo guide the segmentation network to optimize the segmentation network, and predict the probability graph y which is closer to the true label _l Results of the distribution;

wherein, countering the lossThe calculation formula of (2) is as follows:

the training segmentation network S is:

7. a system based on an image segmentation method for generating an antagonistic network SEG-GAN, the system comprising:

the data set acquisition module is used for acquiring a target image set, a reference image set and a pre-marked reference mark set corresponding to the reference image set; the target image set comprises a target image training set and a target image testing set;

the network construction module is used for constructing a segmentation network and a discrimination network; wherein the first objective loss function of the segmentation network comprises cross entropy loss of the objective image set and the reference annotation set, contrast loss of the objective image set, and semi-supervised loss between the objective image set and the reference image set;

the training module is used for inputting the target image training set and the reference image set into the segmentation network, correspondingly obtaining a target probability score graph and a reference probability score graph, and inputting the target probability score graph and the reference probability score graph into the discrimination network so as to perform joint training of the segmentation network and the discrimination network;

the judging module is used for finishing training when the first target loss function of the segmentation network and the second target loss function of the judging network are converged;

the test module is used for inputting the target image test set into the trained segmentation network to obtain a target segmentation image;

the input data x according to which the splitting network S is based comprises the marking data x _l And unlabeled data x _u Each marking data x _l All have corresponding label patterns y _l The label map y of H×W×D size here _l One-hot encoding is changed into a probability map y with C channels of discrete labels _l The labels at each location will map the probability map y _l The channel whose category is represented by the upper is set to 1, and the same positions on the remaining other channels are all set to 0.

8. An apparatus for using an image segmentation method based on generating an antagonistic network SEG-GAN, characterized in that: the apparatus comprises a processor, a memory, and a computer program stored in the memory and configured to be executed by the processor, the processor implementing the method of generating an image segmentation based on a countermeasure network as claimed in any one of claims 1 to 6 when the computer program is executed.