CN111445465A

CN111445465A - Light field image snowflake or rain strip detection and removal method and device based on deep learning

Info

Publication number: CN111445465A
Application number: CN202010243333.2A
Authority: CN
Inventors: 晏涛; 丁宇阳; 李明悦
Original assignee: Jiangnan University
Current assignee: Jiangnan University
Priority date: 2020-03-31
Filing date: 2020-03-31
Publication date: 2020-07-24
Anticipated expiration: 2040-03-31
Also published as: CN111445465B

Abstract

The invention discloses a method and a system for detecting and removing snowflakes or rain belts of light field images based on deep learning, wherein the method comprises the following steps: detecting the position and the size of the snowflake or the rain strip in the acquired first light field image by using a 3D residual error network; removing the detected snowflakes or rain belts by adopting a 3D U type network to obtain a second light field image; acquiring a second light field image and a third light field image, and optimizing a discriminator and an objective function by judging the authenticity of the second light field image and the third light field image; evaluating the quality of the generated second light field image through the peak signal-to-noise ratio and the structural similarity index, and if the evaluation result does not meet the requirement, executing the steps S11, S13 and S15 again until the evaluation result meets the requirement; the scheme directly detects the data of the snowflakes or the rain strips, detects and removes the snowflakes or the rain strips by utilizing the characteristic that the light field image has rich three-dimensional structure information, automatically and iteratively updates the intermediate implicit parameters of the neural network model, and has good snowflake or rain strip removing effect.

Description

Light field image snowflake or rain strip detection and removal method and device based on deep learning

Technical Field

The invention relates to the technical field of computer image processing, in particular to a method and equipment for detecting and removing snowflakes or rain strips of a light field image based on deep learning.

Background

With the tremendous development of the field of computer science under the support of high-precision technology, the research in the field of computer vision also gets a great progress. In the field of computer vision, the light field imaging technology is widely applied. Compared with the traditional camera, the light field camera can acquire four-dimensional information of a scene, including two-dimensional space information and two-dimensional angle information, through single exposure, so that richer image information can be acquired in the image reconstruction process, and in addition, the problems of out-of-focus of images, excessive background targets and the like in special occasions can be solved through a digital refocusing technology; realizing 'perspective' monitoring by a synthetic aperture technology; after the method is fused with the microscopic technology, a multi-view large-depth-of-field microscopic image and a reconstructed three-dimensional stereogram can be obtained. With the intensive research on light field technology, light field image processing is gradually concerned by experts and scholars at home and abroad, and is used as an important technical means for three-dimensional scene perception in the industry represented by automatic driving. The research of the light field image processing focuses on aspects of depth estimation, super-resolution, image restoration and the like.

In the field of computer vision, due to the complexity of illumination, shading and the shape and color of the snowflake or the rain belt, a snowflake or rain belt removing algorithm on a single image can only detect and remove the snowflake or the rain belt according to the color and shape information in the image, so that the method has great limitation and poor snowflake or rain belt removing effect.

In summary, a scheme for removing snow or rain belts with a good snow or rain belt removing effect is lacked in the prior art.

Disclosure of Invention

The embodiment of the invention provides a method and a system for detecting and removing snowflakes or rain belts in a light field image based on deep learning, and aims to solve the technical problem that a scheme for removing the snowflakes or the rain belts with a good snowflakes or rain belt removing effect is lacked in the prior art.

In a first aspect, a method for detecting and removing snowflakes or rain strips in a light field image based on deep learning is provided according to an embodiment of the present application, and includes:

s11, detecting the position and the size of the snowflake or the rain belt in the acquired first light field image by the detector by using a 3D residual error network;

s13, removing the detected snowflakes or rain belts in the first light field image by self-learning by using a 3D U type network by a generator to obtain a second light field image;

step S15, the discriminator acquires the second light field image and the third light field image, and optimizes the objective function of the discriminator by judging and distinguishing the authenticity of the second light field image and the third light field image;

step S17, evaluating the quality of the generated second light field image through the peak signal-to-noise ratio and the structural similarity, and if the evaluation result does not meet the requirement, executing the steps S11, S13 and S15 again until the evaluation result meets the requirement;

wherein the third light field image is a light field image without snowflakes or rain strips in the same scene as the first light field image, and is also called a true value of the second light field image.

In one embodiment, before the step S11, the method further includes:

the first light field image and the third light field image are acquired.

In one embodiment, the detecting the position and size of the snowflakes or rain strips in the acquired first light field image by using the 3D residual network includes:

and obtaining a mask of the snow or rain strip data of the first light field image by using the 3D residual error network.

In one embodiment, the obtaining a mask of snowflake or rainstrip data of the first light-field image using the 3D residual network comprises:

setting a residual block according to the 3D residual network;

setting parameters of a first convolution layer and a second convolution layer, wherein each residual block uses the first convolution layer and the second convolution layer respectively;

after the first convolution layer and the second convolution layer are used, a standardization operation and a long-time and short-time memory network are added, and a mask of snow or rain strip data of the first light field image is obtained.

In one embodiment, the removing the detected snowflakes or rain strips from the first light-field image by using a 3D U type network to obtain a second light-field image includes:

from the mask, snow or rain flakes in the first light field image are removed using a 3D U type network, resulting in a second light field image.

In one embodiment, said removing snowflakes or rain strips from the first light field image using a 3D U type network to obtain a second light field image comprises:

dividing a 3D U type network into an encoder network and a decoder network according to the position relation;

respectively adopting an encoder network and a decoder network to encode and decode snowflake or rain belt data;

and adopting pooling layer down-sampling to set an output function to obtain a second light field image.

In one embodiment, the objective function of the discriminator is:

wherein, V_TIs the true value of the 3D snow-free EPI volume block for the virtual scene,

for the 3DEPI volume block with the added snowflake or raindrop data distribution mask, D represents the discriminator and G represents the generator.

In a second aspect, a snow or rain strip detection and removal system for a light field image based on deep learning is provided according to an embodiment of the present invention, including:

the detection module is used for detecting the position and the size of the snowflakes or the rain strips in the acquired first light field image by using the 3D residual error network;

the removing module is used for removing the detected snowflakes or rain belts in the first light field image by the self-learning by adopting a 3D U type network to obtain a second light field image;

the discrimination module is used for acquiring the second light field image and the third light field image and optimizing the objective function of the discriminator by judging and distinguishing the authenticity of the second light field image and the third light field image;

the evaluation module is used for evaluating the quality of the generated second light field image through the peak signal-to-noise ratio and the structural similarity, and if the evaluation result does not meet the requirement, the steps S11, S13 and S15 are executed again until the evaluation result meets the requirement;

and the third light field image is a light field image without snowflakes or rain strips under the same scene as the first light field image.

In a third aspect, a light field image snow or rain strip detection and removal device based on deep learning is provided according to an embodiment of the present invention, including:

the detector is used for detecting the position and the size of the snowflakes or the rain strips in the acquired first light field image by using the 3D residual error network;

the generator is used for removing the detected snowflakes or rain belts in the first light field image by adopting a 3D U type network to obtain a second light field image;

the discriminator acquires the second light field image and the third light field image, and optimizes the objective function of the discriminator by judging and distinguishing the truth of the second light field image and the third light field image;

the evaluator evaluates the quality of the generated second light field image through the peak signal-to-noise ratio and the structural similarity, and controls the detector, the generator and the discriminator to work again if the evaluation result does not meet the requirement until the evaluation result meets the requirement;

In a fourth aspect, according to an embodiment of the invention, a computer-readable storage medium is provided, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 7.

According to the method and the system for detecting and removing the snowflakes or the rain belts of the light field image based on the deep learning, the snowflakes or the rain belts of the first light field image are directly detected, the snowflakes or the rain belts are detected and removed by utilizing the characteristic that the light field image has rich three-dimensional structure information, and the snowflakes or the rain belts are better in data removal effect; according to the scheme, the snow or rain belt data are removed by adopting a deep learning self-learning method, intermediate parameters do not need to be estimated, the end-to-end snow or rain belt data removal can be realized by directly adopting a deep learning-based light field image snow or rain belt detection and removal system, and the snow or rain belt removal effect is better; meanwhile, in the detector and the generator, after each convolution layer, a batch of standardized operation and a long-time and short-time memory network are added, and the batch of standardized layers are used for regularizing the model, accelerating the training of the model and avoiding gradient disappearance; the long-time memory network is used for strengthening the relation between adjacent viewpoints, and the information of the adjacent viewpoints is used for better processing the shielded snowflake or rain zone area, so that the more complex texture information of the background can be recovered.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the invention and not to limit the invention. In the drawings:

FIG. 1 is a flow chart of a deep learning-based light field image snowflake or rainstrip detection and removal method according to an embodiment of the present invention;

FIG. 2 is a flow chart of another method for detecting and removing snow or rain flakes from a light field image based on deep learning according to an embodiment of the present invention;

FIG. 3 is a flowchart of a snow or rain strip detection and removal method for a light field image based on deep learning according to another embodiment of the present invention;

FIG. 4 is a schematic structural diagram of a snow or rain strip detection and removal system for a light field image based on deep learning according to an embodiment of the present invention;

fig. 5 is a schematic structural diagram of a snow or rain strip detection and removal device for a light field image based on deep learning according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

In a first aspect, a method for detecting and removing snowflakes or rain strips in a light field image based on deep learning is provided according to an embodiment of the present invention, as shown in fig. 1, which may include the following steps:

in an embodiment of the present application, the first light field image may be a light field image acquired by a light field camera under a light field of a virtual scene. And detecting the position and the size of the snowflakes or the rain strips in the acquired first light field image, wherein two continuous convolutions can be used, and batch standardization operation and a long-time and short-time memory network are added after each convolution, so that the position and the size of the snowflakes or the rain strips are detected.

S13, removing the detected snowflakes or rain belts in the first light field image by the generator by adopting a 3D U type network to obtain a second light field image;

in the present embodiment, the snow or rain flakes detected in step S11 are removed, and a 3D U type network is used, and similarly two successive convolutions are used, with a batch of normalization operations and a long-term memory network added after each convolution.

in this embodiment, the step of distinguishing the authenticity of the second light field image and the third light field image may adopt a plurality of convolution layers, and the generated output is compared with the third light field image at the last convolution layer to obtain the corresponding output parameter.

the third light field image is a light field image without snowflakes or rain strips in the same scene as the first light field image, and can also be called a true value of the second light field image.

In the embodiment of the application, a self-learning depth algorithm is adopted, and if the evaluation result of the step S17 does not meet the preset requirement, the steps S11, S13 and S15 are automatically and repeatedly executed until the evaluation result meets the preset requirement.

In the embodiment of the application, a self-learning depth algorithm is adopted, after the discriminator acquires the second light field image and the third light field image, the authenticity of the second light field image and the third light field image can be discriminated, the generated snow-free image (the second light field image) and the real snow-free image (the third light field image) can respectively output a value from 0 to 1 through the discriminator, the closer the value is to 1, the closer the second light field image and the third light field image are, namely, the better the snow or rain strip removing effect is. The quality of the generated snow-free image is quantitatively evaluated using a peak signal-to-noise ratio (PSNR) and a Structural Similarity (SSIM), and if the evaluation result does not satisfy the requirement of the objective function, steps S11, S13, S15 are performed again.

In the embodiment of the present application, referring to fig. 2, before the step S11, the method further includes:

the method comprises the steps of S10, acquiring a first light field image and a third light field image, constructing a light field data set, constructing a virtual light field camera by using a blender to render a virtual scene data set, dividing the data into preset sizes (such as 256 × 256) to obtain a final virtual scene light field data set, acquiring a virtual scene light field image to obtain a first light field image, and acquiring a real scene light field data set by using a light field camera to obtain a third light field image.

In this embodiment of the application, in step S11, the detecting, by using a 3D residual network, the position and size of the snowflakes or the rain strips in the acquired first light field image includes:

and obtaining a mask of the snow or rain strip data of the first light field image by using the 3D residual error network. The method specifically comprises the following steps:

1) setting a residual block according to the 3D residual network;

2) setting parameters of a first convolution layer and a second convolution layer, wherein each residual block uses the first convolution layer and the second convolution layer respectively;

after the first convolution layer and the second convolution layer are used, batch standardization operation and a long-time and short-time memory network are added, and a mask of snow or rain belt data of the first light field image is obtained.

Specifically, the residual block can be set according to preset requirements, then each residual block of the 3D residual network is set to use two continuous convolution layers, the convolution kernel size of each convolution layer can be set to be 3 × 3 × 3, the excitation function of the first convolution layer is set to be Relu, and the excitation function of the second convolution layer is set to be Sigmoid, and a standardized operation and a long-term memory network are added behind the first convolution layer and the second convolution layer respectively, wherein the standardized operation is used for regularizing the model, training of the model is accelerated, and gradient disappearance is avoided, the long-term memory network is used for reinforcing the relation between adjacent viewpoints, for an image, a common long-term memory network is difficult to extract corresponding features, because the image is three-dimensional, and the convolutional operation can just make up for the shortages of the long-term memory network, and hidden snow or rain belt regions are better processed through the information of the adjacent viewpoints, so that texture information with a more complex background can be recovered, and the formula is as follows:

wherein, represents the convolution of the data,

which means multiplication of corresponding elements of the matrix, where i, f, C, o, H, X are all three-dimensional tensors. X is the input, here representing a light field-line sub-viewpoint 3D EPI volume, f represents the forgetting gate, which reads H_t-1And X_tOutputting a value between 0 and 1 to each of the at least one neuron states C_t-1The number in (1) indicates complete retention, and 0 indicates complete rejection. Wherein H_t-1Representing the output of the last neuron, X_tThe input to the current neuron is indicated. i.e. i_tC_tForming input gates, output gates determining how much new information to add to the network, i_tRepresenting a decision which information needs to be updated, o_tH_tAnd forming an output gate, and determining what value is finally output, wherein sigma represents a Sigmoid function, W is a corresponding coefficient, and b is a deviation.

The loss function in step S11 is:

wherein,

is a row of sub-point 3DEPI volume blocks, V, of the t-axis of the fixed light field image_MIs a true value, F, of a snowflake or rain strip 3D mask volume block of a virtual scene_mIs the detector of the present invention, using L2 loss as a function of loss L_ML2 distance, L, of 3D EPI volume representing generated snowflake or rain strip distribution mask and 3D EPI of real snowflake or rain strip distribution mask_MThe smaller the representation, the closer the generated result is to the true value.

In this embodiment, referring to fig. 3, the step S13 may specifically include:

step S131, dividing a 3D U type network into an encoder network and a decoder network according to an operation principle; for example, the encoder is located on the left and the decoder network may be located on the right.

S132, respectively adopting an encoder network and a decoder network to encode and decode snowflake or rain strip data; each convolutional layer of the encoding part reduces the width and height of the 3D EPI volume to half of the original, respectively, increases the number of channels to twice of the original, and the decoder part restores the feature map.

If the size of each convolution kernel is set to be 3 × 3 × 3, the second light field image with the value range of 0 to 1 is obtained by the pooling layer down-sampling, the excitation function is Relu, the excitation function of the last layer is Sigmoid, and similarly, a standardized operation and a long-time and short-time memory network are added behind each convolution layer, wherein in the step S13, the loss function is as follows:

wherein

3DEPI volume block V added with snowflake or rain strip data distribution mask_TIs the true value of the 3D snow-free EPI volume for a virtual scene, Fs is the generator of the present invention, using L2 losses as the loss function L_SRepresentsThe L2 distance, L, of the resulting snow-free 3DEPI volume and the true snow-free 3D EPI is shown_SThe smaller the representation, the closer the generated result is to the true value. According to the scheme, the judgment network is used for judging the authenticity of the snow-free image generated by distinguishing so as to achieve the aim of obtaining a more real snow-free image.

In the embodiment of the present application, the discriminator is used for judging whether the snow-free image is true or false, and the adopted function model is as follows:

the snow-free 3D EPI volume is generated, wherein D in formula (3) is the discriminator and relevant parameters thereof, and G is the generator and relevant parameters thereof.

Once a satisfactory detector, generator and discriminator are obtained through learning training, in an actual test, only the steps performed by the detector and the generator need to be performed, so that a snow-free (rainstreak-free) second light field image as a target can be obtained from the first light field image. The arbiter need not be executed any more.

As follows, a specific example is illustrated:

firstly, a virtual light field camera is constructed by using a blender to render a virtual scene data set, the light field data is divided into 256 × 256 sizes to obtain a final virtual scene light field data set, and a first light field image is acquired;

then, the first light field image is processed as a stack of 3D EPI volume blocks, and four-dimensional parameters (depth, weight, height, channel) of the volume blocks are obtained, where depth is the number of sub-viewpoints of one line of the light field fixed t, set to 9, weight is the width of the sub-viewpoint image, set to 256, height is the height of the sub-viewpoint image, set to 256, and channel is the number of channels of the image, here, RGB three channels.

Continuously carrying out batch standardization on the light field data in the four-dimensional data form after convolution operation, and then strengthening the relation between adjacent viewpoints through a long-time and short-time memory network to obtain a mask of snowflake or rain belt data distribution in the first light field image; and adding the obtained mask to the original image 3DEPI volume block through connection operation, reducing the width and height of the first light field image to one half of the original width and height by convolution of each layer of the coding part, increasing the number of channels to two times of the original width and height, restoring the first light field image by the decoder part to finally obtain a second light field image with a value range of 0 to 1, and finally evaluating the second light field image.

The light field image snowflake or rainstrip detection and removal system based on deep learning provided by the embodiment of the invention directly detects the snowflake or rainstrip data of the first light field image, detects and removes the snowflake or rainstrip data by utilizing the characteristic that the light field image has rich three-dimensional structure information, and has a good snowflake or rainstrip data removal effect; according to the scheme, the snow or rain belt data are removed by adopting a deep learning self-learning method, intermediate parameters do not need to be estimated, the end-to-end snow or rain belt data removal can be realized by directly adopting a deep learning-based light field image snow or rain belt detection and removal system, and the snow or rain belt removal effect is better; meanwhile, in the detector and the generator, after each convolution layer, a batch of standardized operation and a long-time and short-time memory network are added, and the batch of standardized layers are used for regularizing the model, accelerating the training of the model and avoiding gradient disappearance; the long-time memory network is used for strengthening the relation between adjacent viewpoints, and the information of the adjacent viewpoints is used for better processing the shielded snowflake or rain zone area, so that the more complex texture information of the background can be recovered.

Fig. 4 is a schematic structural diagram of a snow or rain strip detection and removal system for a light field image based on deep learning according to an embodiment of the present invention. The system as shown in fig. 4 may include:

a detection module 41, configured to detect a position and a size of a snowflake or a rain strip in the acquired first light field image by using a 3D residual network;

a removing module 42, configured to remove the detected snowflakes or rain strips in the first light field image by using a 3D U-type network, so as to obtain a second light field image;

a distinguishing module 43, configured to obtain the second light field image and the third light field image, and optimize an objective function of the discriminator by determining whether the second light field image and the third light field image are true or false

The evaluation module 44 evaluates the quality of the generated second light field image through peak signal-to-noise ratio (PSNR) and Structural Similarity (SSIM), and if the evaluation result does not meet the requirement, performs steps S11, S13, and S15 again until the evaluation result meets the requirement;

The light field image snowflake or rain strip detection and removal system based on deep learning provided by the embodiment of the invention can realize each process in the method embodiments of fig. 1 to 3, and is not repeated here for avoiding repetition.

The light field image snowflake or raindrop detection and removal system based on deep learning provided by the embodiment of the invention directly detects the snowflake or raindrop data of the first light field image, detects and removes the snowflake or raindrop data by utilizing the characteristic that the light field image has rich three-dimensional structure information, and has a good snowflake or raindrop data removal effect; according to the scheme, the snow or rain belt data are removed by adopting a deep learning self-learning method, intermediate parameters do not need to be estimated, the end-to-end snow or rain belt data removal can be realized by directly adopting a deep learning-based light field image snow or rain belt detection and removal system, and the snow or rain belt removal effect is better; meanwhile, in the detector and the generator, after each convolution layer, a batch of standardized operation and a long-time and short-time memory network are added, and the batch of standardized layers are used for regularizing the model, accelerating the training of the model and avoiding gradient disappearance; the long-time memory network is used for strengthening the relation between adjacent viewpoints, and the information of the adjacent viewpoints is used for better processing the shielded snowflake or rain zone area, so that the more complex texture information of the background can be recovered.

Corresponding to the method and system for detecting and removing the snowflakes or the rain belts of the light field image based on the deep learning provided by the embodiment of the present invention, the embodiment of the present invention provides a device for detecting and removing the snowflakes or the rain belts of the light field image based on the deep learning, as shown in fig. 5, including a detector 51, a generator 52, a discriminator 53 and an evaluator 54, wherein:

a detector 51 for detecting the position and size of a snowflake or a rain strip in a first acquired light field image by using a 3D residual error network; the detector may use a 3D residual network to obtain a mask of snowflake or rainstrip data of the first light field image;

the generator 52 removes the detected snowflakes or rain bands in the first light field image by adopting a 3D U type network to obtain a second light field image, the generator can take a snowflake or rain band data mask output by the detector as one input, the first light field image as the other input, the generator network can be divided into a left encoder network and a right decoder network, the width and the height of the first light field image are reduced to one half of the original width and the height of the first light field image by convolution of each layer of the encoder network, the number of channels is increased to two times, the first light field image is reduced by the decoder part, the convolution kernel size of each layer is 3 × 3 × 3, the laminated image without the snowflakes or the rain bands is obtained by downsampling through layers of pooling, the excitation function is Relu, the excitation function of the last layer is Sigmoid, and the similar laminated image without the snowflakes or the rain bands with the value range of 0-1 is obtained, and batch standardization operation and the long-.

A discriminator 53 for acquiring the second light field image and the third light field image, and optimizing an objective function of the discriminator by judging whether the second light field image and the third light field image are true or false;

the evaluator 54 evaluates the quality of the generated second light field image through the peak signal-to-noise ratio and the structural similarity, and if the evaluation result does not meet the requirement, the steps S11, S13 and S15 are executed again until the evaluation result meets the requirement;

in the embodiment of the invention, the discriminator and the generator can be alternately trained until the discriminator can not distinguish the generated snow-free image from the real snow-free image; and the third light field image is a light field image without snowflakes or rain strips under the same scene as the first light field image.

The arbiter may contain 6 convolutional layers with convolutional kernels of 3 × 3 × 3, the last layer compares the resulting output with the original snow-free image using a fully connected layer, evaluates the detector-generator network for goodness and badness, and adjusts the network parameters accordingly.

The penalty function for the discriminator is:

where D is the discriminator and G is the generator.

The light field image snowflake or rain strip detection and removal device based on deep learning provided by the embodiment of the invention has the following advantages:

(1) the invention uses the light field image as input, detects and removes the snowflakes or the rain strips by using the characteristic that the light field image has abundant three-dimensional structure information, and has better effect than the snowflakes or the rain strips removing effect of a single image.

(2) The invention constructs a network model consisting of a detector, a generator and a discriminator based on deep learning, directly converts an image with snowflakes or a rain belt into an image without snowflakes or a rain belt through the network model, realizes end-to-end image snowflakes or rain belt removal, does not need to estimate intermediate parameters, and can obtain good snowflakes or rain belt removal effect.

(3) Adding batch standardization operation and a long-time memory network behind each convolution layer of the detector and the generator, wherein the batch standardization layer is used for regularizing the model, accelerating the training of the model and avoiding gradient disappearance; the long-time memory network is used for strengthening the relation between adjacent viewpoints, and the information of the adjacent viewpoints is used for better processing the shielded snowflake or rain zone area, so that the more complex texture information of the background can be recovered.

(4) The discriminator compares the generated image part with the original snow-free image to judge the authenticity of the output image, which is helpful for optimizing the network model and generating a more real target image.

It is noted that the method and system for detecting and removing snow or rain strips in a light field image based on deep learning provided by the embodiments of the present invention are not limited to be used only for detecting and removing snow or rain strips in a light field image, and can also be used for detecting and removing light field images containing attachments during capturing. Therefore, the technical solution of only changing the object attached to the light field image acquired during the capturing process without any creative labor still belongs to the protection scope of the present invention.

The embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the computer program implements the processes of the method embodiments, and can achieve the same technical effects, and in order to avoid repetition, the details are not repeated here. The computer-readable storage medium may be a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk.

It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.

Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which is stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal (such as a mobile phone, a computer, a server, an air conditioner, or a network device) to execute the method according to the embodiments of the present invention.

While the present invention has been described with reference to the embodiments shown in the drawings, the present invention is not limited to the embodiments, which are illustrative and not restrictive, and it will be apparent to those skilled in the art that various changes and modifications can be made therein without departing from the spirit and scope of the invention as defined in the appended claims.

Claims

1. A method for detecting and removing snowflakes or rain belts of light field images based on deep learning is characterized by comprising the following steps:

2. The method according to claim 1, wherein before the step S11, further comprising:

the first light field image and the third light field image are acquired.

3. The method according to claim 1, wherein the detecting the position and size of the snowflakes or rain strips in the acquired first light field image by using the 3D residual network comprises:

4. The method of claim 3, wherein said deriving a mask of snowflake or rainribbon data for the first light-field image using a 3D residual network comprises:

setting a residual block according to the 3D residual network;

5. The method of claim 4, wherein said removing detected snow or rain flakes from the first light field image using a 3D U type network to obtain a second light field image comprises:

6. The method of claim 5 wherein said removing snowflakes or rain flakes from the first light field image using a 3D U type network to obtain a second light field image comprises:

7. The method of claim 1, wherein the objective function of the discriminator is:

wherein, V_TIs a true value, V, of the 3D snow-free EPI volume block of the virtual scene_tm*The snow-free 3D EPI volume block is generated, wherein D in the formula is a discriminator and related parameters thereof, and G is the generator and related parameters thereof.

8. A light field image snowflake or rain strip detection and removal system based on deep learning is characterized by comprising:

9. A light field image snowflake or rain strip detection and removal device based on deep learning is characterized by comprising:

10. A computer-readable storage medium, characterized in that a computer program is stored on the computer-readable storage medium, which computer program, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 7.