CN112508817A

CN112508817A - Image motion blind deblurring method based on loop generation countermeasure network

Info

Publication number: CN112508817A
Application number: CN202011484067.9A
Authority: CN
Inventors: 王�琦; 芦瑞龙; 李学龙
Original assignee: Northwestern Polytechnical University
Current assignee: Northwestern Polytechnical University
Priority date: 2020-12-16
Filing date: 2020-12-16
Publication date: 2021-03-16
Anticipated expiration: 2040-12-16
Also published as: CN112508817B

Abstract

The invention discloses an image motion blind deblurring method based on a cyclic generation countermeasure network, which comprises the steps of firstly constructing a data set, and acquiring an image motion blind deblurring method with the quantity ratio of 1: 1 as input data; then, a generator network and a discriminator network are constructed, and then a loss function is defined, wherein the loss function consists of the countermeasure loss and the inter-domain circulation invariant loss; and finally, training the discriminator network and the generator network in sequence to finish the training after the discriminator network and the generator network reach the Nash equilibrium state. The invention utilizes the principle of circularly generating the countermeasure network, effectively utilizes the unpaired data to train in the deblurring task lacking the support of the paired data and produces better effect.

Description

Image motion blind deblurring method based on loop generation countermeasure network

Technical Field

The invention belongs to the technical field of image processing, and particularly relates to a blind deblurring method for image motion.

Background

With the continuous development of shooting equipment such as mobile phones and cameras, shooting becomes an indispensable part of people's daily life. In the photographing process, due to the shake of the camera or the relative movement of the photographed object, the occurrence of image blur is often occurred. The blurred image not only seriously influences the acquisition of information by people, but also has negative influence on the subsequent computer vision analysis, so that the image deblurring has extremely high practical significance and research value nowadays.

The deblurring task is divided into two broad categories depending on whether the blur kernel is known or not: non-blind image deblurring and blind image deblurring. Most of early researches are based on non-blind image deblurring and expansion, and most of the early researches are based on algorithms such as classical wiener filtering and Gihonov filtering, and deconvolution operation is carried out to obtain clear image estimation. The basic principle of the algorithms is to adopt a mathematical optimization method to estimate the estimation problem of the image from the degraded image under a certain criterion. But in general the blur kernel of a blurred image is unknown, and most algorithms in the first place rely on heuristics, image statistics and assumptions about the source of the blur. With the development of deep learning, students are also beginning to use a deep network to deal with the image deblurring problem, and the deblurring algorithm based on the deep convolutional neural network achieves better performance and higher speed compared with the traditional algorithm. However, the deep learning method needs a large amount of paired data to train and learn, and cannot shoot motion blur and corresponding sharp images at the same time, so it is very difficult to obtain paired motion blur data sets. Some of the disclosed data sets use multiple sharp frames to synthesize a blurred image to obtain paired data, but such synthesized data does not simulate the blur in the real world very well.

Disclosure of Invention

In order to overcome the defects of the prior art, the invention provides an image motion blind deblurring method based on a loop generation countermeasure network, which comprises the steps of firstly constructing a data set, and acquiring an image motion blind deblurring method with the quantity ratio of 1: 1 as input data; then, a generator network and a discriminator network are constructed, and then a loss function is defined, wherein the loss function consists of the countermeasure loss and the inter-domain circulation invariant loss; and finally, training the discriminator network and the generator network in sequence to finish the training after the discriminator network and the generator network reach the Nash equilibrium state. The invention utilizes the principle of circularly generating the countermeasure network, effectively utilizes the unpaired data to train in the deblurring task lacking the support of the paired data and produces better effect.

The technical scheme adopted by the invention for solving the technical problem comprises the following steps:

step 1: constructing and preprocessing a data set;

taking the fuzzy images in the first 11 scenes of the training set of the GOPRO data set as a fuzzy data set, taking the clear images in the last 11 scenes as a clear data set, and forming a new training set by the clear data set and the fuzzy data set; randomly cutting all images in the new training set into a plurality of images with the size of 256 multiplied by 256, and performing standardization processing to obtain an input data set;

taking the test set of the GOPRO data set as a new test set;

step 2: constructing a generator network;

the generator network comprises two generators, respectively a blur-sharpness generator G_b2sAnd a sharpness-blur generator G_s2b(ii) a The roles of the two generators are respectively: blur-sharpness generator G_b2sOutputting the input blurred image as a corresponding sharp image, a sharp-blur generator G_s2bOutputting the input clear image as a corresponding blurred image; the two generators have the same structure but do not share parameters, and both use InstanceNorm as a normalization layer and LeakyReLU as an activation layer; the concrete structure is as follows:

(1) an input module: the convolution block comprises a convolution layer with a channel of 64 and a convolution kernel size of 7 multiplied by 7, an example normalization layer and a ReLu activation layer;

(2) a feature extraction module: the convolution block comprises two identical convolution blocks, wherein each convolution block comprises a convolution layer with a channel of 128 and a convolution kernel size of 3 x 3, an example normalization layer and a ReLu activation layer;

(3) residual error intensive learning module: contains 5 residual dense blocks RDB, each of which contains four volume blocks: the first three convolution blocks have the same structure and are all composed of convolution layers with the size of 3 multiplied by 3 convolution kernels, an InstanceNorm normalization layer and a Relu activation layer, the first three convolution blocks are densely connected, and then the input of a residual error dense block and the output channel of each convolution block in the first three convolution blocks jointly form 4 characteristic diagrams; reducing the number of channels of the 4 characteristic graphs to the size of the input of the residual dense block through a 1 multiplied by 1 convolution kernel, and finally adding the residual dense block input with the residual dense block input to learn the residual;

(4) an image reconstruction module: the convolution block comprises two identical convolution blocks, wherein each convolution block comprises a convolution layer with a channel of 128 and a convolution kernel size of 3 x 3, an example normalization layer and a ReLu activation layer;

(5) an output module: the convolution block comprises a convolution layer with a channel of 64 and a convolution kernel size of 7 multiplied by 7, an example normalization layer and a ReLu activation layer;

step three: constructing a discriminator network;

the discriminator network comprises two discriminators, respectively a sharp discriminator D for discriminating sharp images_sAnd a blur discriminator D for discriminating a blurred image_bThe two discriminators have the same structure but do not share parameters, and the specific structure is shown in table 1:

table 1: discriminator structure

And 4, step 4: defining a loss function;

the loss function of the network consists of two parts: resistance loss and inter-domain cyclic invariant loss;

the resistance loss:

wherein G denotes a generator, D denotes a discriminator, b and s are respectively a blurred image and a sharp image, and p (b) and p(s) are respectively a data distribution of the blurred image and a data distribution of the sharp image; the generator G aims to make the generated G (b) consistent with the distribution of s, and the discriminator aims to distinguish G (b) from s;

the inter-domain circulation has constant loss:

wherein b and s are respectively a blurred image and a sharp image, and p (b) and p(s) are respectively data distribution of the blurred image and data distribution of the sharp image;

the overall loss function is:

L(G_b2s,G_s2b,D_s,D_b,s,b)＝L_adv(G_b2s,D_s,b,s)+L_adv(G_s2b,D_b,s,b)+λL_Cycle(G_b2s,G_s2b,s,b)

the overall loss function includes three parts: the first part is a blur-sharpness generator G_b2sAgainst loss, the second part being the sharpness-blur generator G_s2bThe third part is the inter-domain circulation invariant loss; wherein λ acts to control the degree of importance of the antagonistic losses and the inter-domain cyclic invariant losses;

and 5: inputting the input data set in the step 1 into a network, and performing optimization training on the network by adopting an Adam optimization algorithm;

step 5-1: training a discriminator network;

training the real data: a blurred image and a sharp image are respectively taken from the input data set and the two images are respectively input to a sharpness discriminator D_sAnd a fuzzy discriminator D_bObtaining the judgment of the two images, solving the loss value according to the judgment result, and performing back propagation to adjust the network parameters to optimize the network;

training the synthetic data: a blurred image and a sharp image are respectively taken out from an input data set, and the two images are respectively input into a blur-sharp generator G_b2sAnd a sharpness-blur generator G_s2bIn the method, a corresponding synthesized sharp image and a synthesized blurred image are obtained, and then the two synthesized images are divided into twoCloth input to the sharpness discriminator D_sAnd a fuzzy discriminator D_bObtaining the judgment of the two images, solving the loss value according to the judgment result, and performing back propagation to adjust the network parameters to optimize the network;

the result obtained by the discriminator network is a two-dimensional matrix, all values in the two-dimensional matrix are averaged, and the obtained average value is used as the evaluation of the discriminator network on the whole image;

step 5-2: training a generator network;

fuzzy-clear-fuzzy training cycle: a blurred image is selected from the input data set and input to a blur-sharpness generator G_b2sTo obtain a corresponding synthesized sharp image, and inputting the corresponding synthesized sharp image to a sharp discriminator D_sObtaining a clear discriminator D_sEvaluating the resultant corresponding sharp image to obtain a loss value against loss, the blur-sharpness process being aimed at making the blur-sharpness generator G_b2sThe generated image can be clearly identified by a discriminator D_sIdentifying the image as a real clear image; simultaneously inputting the synthesized sharp image to a sharpness-blur generator G_s2bObtaining a composite blurred image, i.e. G, generated from a composite sharp image_b2s(G_s2b(s)), and obtaining a loss value of the inter-domain cyclic invariant loss; the goal of the blur-sharpness-blur training cycle is to pass the input blurred image through a blur-sharpness generator G_b2sThen passes through a clear-fuzzy generator G_s2bThe obtained fuzzy image is consistent with the originally input fuzzy image; finally, the obtained countermeasure loss and the inter-domain circulation invariant loss are subjected to back propagation to adjust the parameter optimization network of the network;

clear-fuzzy-clear training cycle: selecting a sharp image from the input data set and inputting the sharp image to a sharpness-blur generator G_s2bTo obtain a corresponding blurred image, and inputting the blurred image to a blur discriminator D_bObtaining a fuzzy discriminator D_bEvaluating the composite image to obtain a loss value for resisting loss; the goal of the sharpness-blur process is to makeSharpness-blur generator G_s2bThe generated image can be blurred discriminated by a discriminator D_bIdentifying as a true blurred image; simultaneously inputting the combined blurred image to a blur-sharpness generator G_b2sObtaining a composite sharp image, i.e. G, generated from the composite blurred image_s2b(G_b2s(s)), and obtaining a loss value of the inter-domain cyclic invariant loss; the goal of the sharpness-blur-sharpness training cycle is to pass the input sharp image through a sharpness-blur generator G_s2bThen passes through a fuzzy-clear generator G_b2sThe obtained clear image is consistent with the originally input clear image; finally, the obtained countermeasure loss and the inter-domain circulation invariant loss are subjected to back propagation to adjust the parameter optimization network of the network;

step 5-3: and training the discriminator network and the generator network in sequence, and finishing the training when the discriminator network and the generator network reach a Nash equilibrium state.

Preferably, λ is 10.0.

Preferably, the hyper-parameters for performing optimization training on the network by using the Adam optimization algorithm in the step 5 are set as: learning rate of 1 × 10^-4The epochs training times for all training samples are 300, the learning rates of the first 150 epochs are unchanged, the learning rates of the last 150 epochs linearly decay to zero, and the amount of data batchsize in each batch of training is set to 1.

The invention has the following beneficial effects:

1. the invention adopts an end-to-end deep neural network to realize the motion blind deblurring task without a fuzzy core.

2. The invention utilizes the principle of circularly generating the countermeasure network, effectively utilizes the unpaired data to train in the deblurring task lacking the support of the paired data and produces better effect.

3. The generator of the invention adopts the residual error intensive module, so that the network has stronger adaptability and the learning efficiency of the network is improved, thereby obtaining better deblurring effect.

4. The inter-domain cyclic invariant loss defined in the loss function effectively limits the mapping space of the two generators, thereby ensuring the effectiveness of the generators.

Drawings

FIG. 1 is a flow chart of an implementation of the present invention.

Fig. 2 is a diagram of a generator network architecture of the present invention.

Fig. 3 is a detailed block diagram of the residual dense block in the generator network of the present invention.

FIG. 4 is a diagram of the motion blur removal effect of the present invention, with the left column being the input blurred image, the middle column being the output after deblurring by the method of the present invention, and the right column being the true sharp image.

Detailed Description

The invention is further illustrated with reference to the following figures and examples.

The invention provides an image motion blind deblurring method based on a loop generation countermeasure network to realize an end-to-end motion deblurring task without paired data.

As shown in fig. 1, the present invention provides an image motion blind deblurring method based on a loop-generated countermeasure network, which includes the following steps:

step 1: constructing and preprocessing a data set;

taking the test set of the GOPRO data set as a new test set;

step 2: building a generator network, as shown in FIG. 2;

the generator network comprises two generators, respectively a blur-sharpness generator G_b2sAnd a sharpness-blur generator G_s2b(ii) a The roles of the two generators are respectively: blur-sharpness generator G_b2sOutputting the input blurred image as a corresponding sharp image, a sharp-blur generator G_s2bOutputting the input clear image as a corresponding blurred image; two are providedThe generators have consistent structures, but parameters are not shared, and both use InstanceNorm as a normalization layer and LeakyReLU as an activation layer; the concrete structure is as follows:

(3) residual error intensive learning module: as shown in fig. 3, 5 residual dense blocks RDB are included, and each residual dense block RDB includes four volume blocks: the first three convolution blocks have the same structure and are all composed of convolution layers with the size of 3 multiplied by 3 convolution kernels, an InstanceNorm normalization layer and a Relu activation layer, the first three convolution blocks are densely connected, and then the input of a residual error dense block and the output channel of each convolution block in the first three convolution blocks jointly form 4 characteristic diagrams; reducing the number of channels of the 4 characteristic graphs to the size of the input of the residual dense block through a 1 multiplied by 1 convolution kernel, and finally adding the residual dense block input with the residual dense block input to learn the residual;

step three: constructing a discriminator network;

table 1: discriminator structure

And 4, step 4: defining a loss function;

the resistance loss:

the inter-domain circulation has constant loss:

the overall loss function is:

the overall loss function includes three parts: the first part is a blur-sharpness generator G_b2sAgainst loss, the second part being the sharpness-blur generator G_s2bThe third part is the inter-domain circulation invariant loss; wherein λ is used to control the degree of importance of the antagonistic loss and the inter-domain cyclic invariant loss, λ is 10.0;

and 5: inputting the input data set in the step 1 into a network, and adopting Adam optimalOptimizing and training the network by using a chemometric algorithm, wherein the hyper-parameters are set as: learning rate of 1 × 10^-4The epochs training times for all training samples are 300, the learning rates of the first 150 epochs are unchanged, the learning rates of the last 150 epochs linearly decay to zero, and the amount of data batchsize in each batch of training is set to 1. (ii) a

Step 5-1: training a discriminator network;

training the synthetic data: a blurred image and a sharp image are respectively taken out from an input data set, and the two images are respectively input into a blur-sharp generator G_b2sAnd a sharpness-blur generator G_s2bThen the two synthesized image distributions are input to the sharpness discriminator D_sAnd a fuzzy discriminator D_bObtaining the judgment of the two images, solving the loss value according to the judgment result, and performing back propagation to adjust the network parameters to optimize the network;

step 5-2: training a generator network;

fuzzy-clear-fuzzy training cycle: a blurred image is selected from the input data set and input to a blur-sharpness generator G_b2sTo obtain a corresponding synthesized sharp image, and inputting the corresponding synthesized sharp image to a sharp discriminator D_sObtaining a clear discriminator D_sEvaluating the resultant corresponding sharp image to obtain a loss value against loss, the blur-sharpness process being aimed at making the blur-sharpness generator G_b2sGenerated imageCan be clearly identified by discriminator D_sIdentifying the image as a real clear image; simultaneously inputting the synthesized sharp image to a sharpness-blur generator G_s2bObtaining a composite blurred image, i.e. G, generated from a composite sharp image_b2s(G_s2b(s)), and obtaining a loss value of the inter-domain cyclic invariant loss; the goal of the blur-sharpness-blur training cycle is to pass the input blurred image through a blur-sharpness generator G_b2sThen passes through a clear-fuzzy generator G_s2bThe obtained fuzzy image is consistent with the originally input fuzzy image; finally, the obtained countermeasure loss and the inter-domain circulation invariant loss are subjected to back propagation to adjust the parameter optimization network of the network;

clear-fuzzy-clear training cycle: selecting a sharp image from the input data set and inputting the sharp image to a sharpness-blur generator G_s2bTo obtain a corresponding blurred image, and inputting the blurred image to a blur discriminator D_bObtaining a fuzzy discriminator D_bEvaluating the composite image to obtain a loss value for resisting loss; the goal of the sharpness-blur process is to make the sharpness-blur generator G_s2bThe generated image can be blurred discriminated by a discriminator D_bIdentifying as a true blurred image; simultaneously inputting the combined blurred image to a blur-sharpness generator G_b2sObtaining a composite sharp image, i.e. G, generated from the composite blurred image_s2b(G_b2s(s)), and obtaining a loss value of the inter-domain cyclic invariant loss; the goal of the sharpness-blur-sharpness training cycle is to pass the input sharp image through a sharpness-blur generator G_s2bThen passes through a fuzzy-clear generator G_b2sThe obtained clear image is consistent with the originally input clear image; finally, the obtained countermeasure loss and the inter-domain circulation invariant loss are subjected to back propagation to adjust the parameter optimization network of the network;

The specific embodiment is as follows:

the method of the present invention is tested by using the test set defined in the step 1, the deblurred image is input into the deblurring network of the present invention to obtain a deblurred clear image, and the test result of the clear image is shown in table 2:

TABLE 2 evaluation of the results

As shown in table 2, the experimental results of this example are measured by PSNR and SSIM indexes, and compared with the current three well-known advanced algorithms Kim et al, Sun et al, and DeblurGAN, all of which are obtained from the GOPRO dataset. By comparison, the method of the invention achieves the optimal effect in the PSNR index, has better result in the SSIM index, and fully shows the effectiveness of the method of the invention.

As shown in fig. 4, which is a visualization result diagram of the present embodiment, the leftmost column in the diagram is an input original image, that is, a blurred image, the middle column is an image output through a deblurring network, and the rightmost column is a corresponding sharp image. Three areas are marked in each figure with three rectangular boxes, corresponding to the enlargement of these three areas below the image, to better observe the deblurring effect in detail. From the visual angle, the method has better deblurring effect obviously, thereby proving the effectiveness of the method.

Claims

1. An image motion blind deblurring method based on a loop generation countermeasure network is characterized by comprising the following steps:

step 1: constructing and preprocessing a data set;

taking the test set of the GOPRO data set as a new test set;

step 2: constructing a generator network;

step three: constructing a discriminator network;

the discriminator network comprises two discriminators, respectively a sharp discriminator Ds for discriminating sharp images and a blurred discriminator D for discriminating blurred images_bThe two discriminators have the same structure but do not share the parameters, and the specific structure is as shown in table 1:

table 1: discriminator structure

And 4, step 4: defining a loss function;

the resistance loss:

the inter-domain circulation has constant loss:

the overall loss function is:

L(G_b2s，G_s2b，D_s，D_b，s，b)＝L_adv(G_b2s，D_s，b，s)+L_adv(G_s2b，D_b，s，b)+λL_Cycle(G_b2s，G_s2b，s，b)

step 5-1: training a discriminator network;

step 5-2: training a generator network;

fuzzy-clear-fuzzy training cycle: a blurred image is selected from the input data set and input to a blur-sharpness generator G_b2sTo obtain a corresponding synthesized sharp image, and then combining the corresponding sharp imageImage input to sharpness discriminator D_sObtaining a clear discriminator D_sEvaluating the resultant corresponding sharp image to obtain a loss value against loss, the blur-sharpness process being aimed at making the blur-sharpness generator G_b2sThe generated image can be clearly identified by a discriminator D_sIdentifying the image as a real clear image; simultaneously inputting the synthesized sharp image to a sharpness-blur generator G_s2bObtaining a composite blurred image, i.e. G, generated from a composite sharp image_b2s(G_s2b(s)), and obtaining a loss value of the inter-domain cyclic invariant loss; the goal of the blur-sharpness-blur training cycle is to pass the input blurred image through a blur-sharpness generator G_b2sThen passes through a clear-fuzzy generator G_s2bThe obtained fuzzy image is consistent with the originally input fuzzy image; finally, the obtained countermeasure loss and the inter-domain circulation invariant loss are subjected to back propagation to adjust the parameter optimization network of the network;

2. The blind deblurring method for image motion based on loop-generated countermeasure network of claim 1, wherein λ is 10.0.

3. The blind image motion deblurring method based on the loop-generated countermeasure network of claim 1, wherein the hyper-parameters for the optimization training of the network by using the Adam optimization algorithm in the step 5 are set as: learning rate of 1 × 10^-4The epochs training times for all training samples are 300, the learning rates of the first 150 epochs are unchanged, the learning rates of the last 150 epochs linearly decay to zero, and the amount of data batchsize in each batch of training is set to 1.