CN110097522A

CN110097522A - A kind of single width Method of defogging image of outdoor scenes based on multiple dimensioned convolutional neural networks

Info

Publication number: CN110097522A
Application number: CN201910397724.7A
Authority: CN
Inventors: 张世辉; 桑榆; 陈宇翔; 张健
Original assignee: Yanshan University; Beijing Institute of Computer Technology and Applications
Current assignee: Yanshan University; Beijing Institute of Computer Technology and Applications
Priority date: 2019-05-14
Filing date: 2019-05-14
Publication date: 2019-08-06
Anticipated expiration: 2039-05-14
Also published as: CN110097522B

Abstract

The invention discloses a kind of single width Method of defogging image of outdoor scenes based on multiple dimensioned convolutional neural networks, belong to computer vision field.The present invention constructs training sample set the following steps are included: according to atmospherical scattering model；Based on deep learning thought, multiple dimensioned convolutional neural networks are built；According to the multiple dimensioned convolutional neural networks built, objective function is constructed；Objective function based on construction, the multiple dimensioned convolutional neural networks of training.The present invention is not necessarily to obtain the priori knowledge of outdoor image, and can effectively save the information such as edge, texture, color, contrast and the saturation degree of image.

Description

A kind of single width Method of defogging image of outdoor scenes based on multiple dimensioned convolutional neural networks

Technical field

The invention belongs to computer vision fields, more particularly, to a kind of single width family based on multiple dimensioned convolutional neural networks Outer image defogging method.

Background technique

Mist is that a kind of traditional meteor is formed by by particles such as steam, dust, cigarettes.Mist can make locating for vision system The image of reason is fuzzy, contrast reduces, saturation degree deviation, but will hinder the visual tasks such as classification, identification, detection and tracking into Row, even results in multi view mission failure.Therefore, how to remove mist from outdoor image becomes computer vision field Problem, and the extensive concern by scholars.

There are mainly two types of information types handled by existing defogging method: outdoor video and single width outdoor image.Based on family The defogging method of outer video is relatively fewer, and main cause is that the defogging method based on video is needed first by Video segmentation into several views Then frequency frame successively carries out defogging to the video frame after segmentation again, handle substantially or to single width outdoor image.Cause This, existing defogging method is typically based on the realization of single width outdoor image.Meanwhile the existing defogging based on single width outdoor image There is the problems such as need to obtaining priori knowledge in advance, edge and texture are lost, and color, contrast and saturation degree are distorted in method. K.M.He and J.Sun is in article " Single image haze removal using dark channel prior.Proceedings of the IEEE Conference on Computer Vision and Pattern Based on helping secretly proposed in Recognition Workshops:IEEE Computer Society, 2009:1956-1963 " The defogging method of road priori is one of mist removal most classic method in field, but the priori that this method need to obtain image in advance is known Know.Chen Shuzhen and Ren Zhanguang is in article " based on the single image defogging algorithm automation for improving dark channel prior and Steerable filter Journal .2016,42 (3): based on improved dark channel prior defogging method, there are dark threshold values proposed in 455-465 " With mixing dark brightness maxima cannot adaptively choose, and after defogging image color distortion the problems such as.C.Z.He With C.D.Zhang in article " A haze density aware adaptive perceptual single image haze removal algorithm.Proceedings of the IEEE International Conference on Mentioned method exists in Information and Automation, IEEE Computer Society, 2016:1933-1938 " Image border and texture information recovery extent are to be improved after defogging.B Cai and X Xu are in article " DehazeNet:An End- to-End System for Single Image Haze Removal.IEEE Transactions on Image Processing, 2016,25 (11): that there are contrasts is low for method proposed in 5187-5198 ", saturation degree is low and edge letter The problems such as breath distortion.Z.G.Ling and G.F.Fan is in article " Perception oriented transmission estimation for high quality image dehazing.Neurocomputing,2017,224(2):82-95” Proposed in method the problems such as being distorted there are contrast and saturation degree.Z.G.Li and H.Jing is in article " Single Image De-Hazing Using Globally Guided Image Filtering.IEEE Transactions on Image Processing, 2018,27 (1): outdoor image color distortion and contrast after method defogging proposed in 442-450 " It is lower.

Summary of the invention

Existing single width Method of defogging image of outdoor scenes there are aiming at the problem that, the purpose of the present invention is to propose to a kind of based on more The single width Method of defogging image of outdoor scenes of scale convolutional neural networks, by building the multiple dimensioned convolutional neural networks for defogging, Obtain mapping relations between foggy image and fog free images, and the multiple dimensioned convolutional neural networks that training is built, successive optimization Parameter in network, to achieve the purpose that any single width outdoor image defogging.

In order to solve the above technical problems, the technical solution adopted by the present invention are as follows: one kind is based on multiple dimensioned convolutional neural networks Single width Method of defogging image of outdoor scenes, which comprises the steps of:

(1) training sample obtains: obtaining fog free images sample, carries out mist to fog free images sample using atmospherical scattering model Change processing and obtain foggy image sample, using fog free images sample and corresponding foggy image sample as training sample；

(2) multiple dimensioned convolutional neural networks model: the convolutional layer that building is no less than three；The output end of each convolutional layer connects The pond a Max Pooling layer is connect, the output end of the pond each Max Pooling layer connects one based on amendment linear unit The Nonlinear Mapping layer of ReLu；The output end of all Nonlinear Mapping layers is connect with Fusion Features layer；The Fusion Features layer Output end connect a bilateral filtering layer that transmissivity is handled, using bilateral filtering layer output transmissivity to convolution The foggy image sample of layer input carries out defogging processing；

(3) multiple dimensioned convolutional neural networks model training: using the foggy image sample in step (1) as multiple dimensioned volume The input of product neural network model, the differentiation exported using the fog free images sample in step (1) as multiple dimensioned neural network Standard is trained multiple dimensioned convolutional neural networks model for the purpose of the minimization of object function, carries out parametric solution；Its Middle objective function are as follows:

Wherein, c_i、w_iAnd h_iIt is the corresponding average RGB numerical value of i-th of sample, average contrast and average staturation respectively； The parameter of multiple dimensioned convolutional neural networks is Φ, and i-th of foggy image sample is I_i, the corresponding nothing of i-th of foggy image sample Mist image pattern is J_i, the number of training sample is N；

(4) the multiple dimensioned convolutional neural networks model after being solved using step (3) goes foggy image to be processed Mist processing.

Further technical solution is that there are three the convolutional layers, and convolution kernel is 7 × 7,5 × 5 and 3 × 3 respectively；Or institute It states there are four convolutional layers, convolution kernel is 11 × 11,7 × 7,5 × 5 and 3 × 3 respectively.

Further technical solution is that the objective function is constructed according to mean square error MSE and L-2 norm.

Further technical solution is that the objective function is minimized according to stochastic gradient descent method.

Further technical solution is that in step (1) sample acquisition, fog free images sample architecture is corresponding with mist image pattern Calculation method it is as follows:

I_T(x)=J_T(x)t_T(x)+α_T(1-t_T(x))

Wherein, J_TIt (x) is the i.e. collected image of fog free images sample, t_TIt (x) is transmissivity, α_TFor global air light value, I_TIt (x) is foggy image sample.

Further technical solution is that the convolutional layer form is as follows:

Wherein, I (x) is the foggy image sample to defogging, and q is convolution kernel size,It is convolutional layer filter, It is convolutional layer biasing, * is convolution algorithm,It is the output of multiple dimensioned convolutional neural networks convolutional layer.

Further technical solution is that the pond Max Pooling layer form is as follows:

Wherein,It is the output of multiple dimensioned convolutional neural networks pond layer,It is multiple dimensioned convolutional Neural The output of network convolutional layer.

Further technical solution is that it is more that the Nonlinear Mapping layer carries out Nonlinear Mapping acquisition to the feature after dimensionality reduction Scale feature figure, constructed active coating form are as follows:

Wherein,It is active coating filter,It is active coating biasing,It is multiple dimensioned convolutional neural networks The output of active coating,It is the output of multiple dimensioned convolutional neural networks pond layer.

Further technical solution is that the Fusion Features layer merges Analysis On Multi-scale Features figure, to get transmissivityAnalysis On Multi-scale Features figure amalgamation mode is as follows:

Wherein, λ₁,λ₂,…,λ_nIt is characteristic pattern weight coefficient, h respectively^(q)、c^(q)And s^(q)It is each scale feature figure respectively Average RGB numerical value, mean contrast value and average intensity value；It is the defeated of n scale convolutional neural networks pond layer Out.

Further technical solution is that the bilateral filtering layer is using bilateral filtering to transmissivityIt is handled, from And fining transmissivity t (x) is obtained, calculation method is as follows:

D (ξ, y)=| | ξ-y | |₂

Wherein, y is transmissivityIn pixel, ξ is the pixel adjacent with y, and c (ξ, y) is space weighting function, σ_cIt is the variance between two pixels, d (ξ, y) is between two pixels apart from measure function,WithIt is with pixel respectively The transmissivity that 8 neighborhood territory pixel blocks centered on point ξ and y are constituted,It is similar weight calculation function, σ_sIt is two Variance between transmissivity,It is two transmissivities apart from measure function.

The present invention is compared with the advantage of the prior art:

(1) it is based on deep learning thought, builds multiple dimensioned convolutional neural networks, and sufficiently excavate the spy of each scale feature figure Point so that the information such as outdoor image color, contrast, saturation degree after defogging with initially have mist outdoor image close.

(2) transmissivity merged by characteristic pattern is handled using bilateral filtering, so that the open air after defogging Image border and texture information save complete.

(3) it is based on MSE and L-2 norm, constructs objective function, to realize to the effective quasi- of multiple dimensioned convolutional neural networks It closes, realizes the more effective removal to mist in outdoor image.

Detailed description of the invention

Fig. 1 is the flow chart of defogging method proposed by the invention；

Fig. 2 is part training sample schematic diagram；

Fig. 3 is 3 scale convolutional neural networks structural schematic diagrams；

Fig. 4 is 4 scale convolutional neural networks structural schematic diagrams.

Specific embodiment

Clear to be more clear technical solution of the present invention, the present invention will be further described below with reference to the accompanying drawings.

A kind of single width Method of defogging image of outdoor scenes based on multiple dimensioned convolutional neural networks is elaborated in the embodiment of the present invention, It is characterized by comprising the following steps:

There are three convolutional layers described in the embodiment of the present invention, and convolution kernel is 7 × 7,5 × 5 and 3 × 3 respectively；Or the convolution There are four layers, and convolution kernel is 11 × 11,7 × 7,5 × 5 and 3 × 3 respectively.

Objective function described in the embodiment of the present invention is constructed according to mean square error MSE and L-2 norm.

Objective function described in the embodiment of the present invention is minimized according to stochastic gradient descent method.

In the embodiment of the present invention in step (1) sample acquisition, fog free images sample architecture is corresponding with the meter of mist image pattern Calculation method is as follows:

I_T(x)=J_T(x)t_T(x)+α_T(1-t_T(x))

Convolutional layer form described in the embodiment of the present invention is as follows:

The pond Max Pooling described in embodiment of the present invention layer form is as follows:

It is multiple dimensioned that Nonlinear Mapping layer described in the embodiment of the present invention carries out Nonlinear Mapping acquisition to the feature after dimensionality reduction Characteristic pattern, constructed active coating form are as follows:

Fusion Features layer described in the embodiment of the present invention merge Analysis On Multi-scale Features figure, to get transmissivity Analysis On Multi-scale Features figure amalgamation mode is as follows:

Wherein, λ₁,λ₂,…,λ_nIt is characteristic pattern weight coefficient, h respectively^(q)、c^(q)And s^(q)It is each scale feature figure respectively Average RGB numerical value, mean contrast value and average intensity value；It is the output of n scale convolutional neural networks pond layer.

Bilateral filtering layer described in the embodiment of the present invention is using bilateral filtering to transmissivityIt is handled, to obtain It refines transmissivity t (x), calculation method is as follows:

D (ξ, y)=| | ξ-y | |₂

As shown in Figure 1, a kind of single width outdoor image based on multiple dimensioned convolutional neural networks in the embodiment of the present invention Defogging method the following steps are included:

Step 1: training sample being obtained according to atmospherical scattering model, constructs training sample data collection.

1.1) the fogless outdoor image under 3000 width different scenes is collected from internet.

1.2) 3000 collected width outdoor images are directed to, and set J_TIt (x) is the i.e. collected figure of fogless outdoor image Picture, t_TIt (x) is transmissivity, α_TFor global air light value, I_TIt (x) is to have mist outdoor image.Since training sample to be guaranteed wraps as far as possible Containing a variety of situations, therefore choose different t_T(x), global air light value α_TFixed value is chosen, definition has mist outdoor image I_T (x) it is

I_T(x)=J_T(x)t_T(x)+α_T(1-t_T(x)) (1)

1.3 3000 width outdoor images of traversal, and obtain 3000 width according to above formula and have mist outdoor image as training sample, from And construct training sample data collection.

Training sample data concentrated part training sample and its Ground Truth are as shown in Figure 2.Wherein, first be classified as from The fogless outdoor image that internet is got, second is classified as and has mist outdoor image through what formula (1) calculated.

Step 2: building multiple dimensioned convolutional neural networks.

2.1) convolutional layer for the multiple dimensioned convolutional neural networks built is rolled up by 7 × 7,5 × 5,3 × 3 three kinds of different scales Product core is constituted, if the training sample to defogging is I (x), q represents convolution kernel size and q ∈ { 7,5,3 },Represent convolutional layer Filter,Convolutional layer biasing is represented, * represents convolution algorithm, then the output of multiple dimensioned convolutional neural networks convolutional layerIt is represented by

2.2) the pond layer for the multiple dimensioned neural network built is constructed by Max Pooling, and purport is to by convolution Layer and calculated feature carries out dimensionality reduction, to get the feature with translation invariance, rotational invariance, then multiple dimensioned volume The output of product neural network pond layerIt is represented by

2.3) the multiple dimensioned convolutional neural networks built construct active coating according to linear unit R eLu is corrected, after dimensionality reduction Feature carry out Nonlinear Mapping, to obtain Analysis On Multi-scale Features figure,Active coating filter is represented,Represent activation Layer biasing, the then output of multiple dimensioned convolutional neural networks active coatingIt can be expressed as

2.4) it after obtaining Analysis On Multi-scale Features figure, sufficiently excavates in color each scale feature graph coloring, saturation degree and contrast Feature merges each scale feature figure, so that the corresponding transmissivity of input picture I (x) be calculatedFusion function It is defined as

Wherein, λ, μ and γ are characteristic pattern weight coefficient, h respectively^(q)、c^(q)And s^(q)It is being averaged for each scale feature figure respectively RGB numerical value, mean contrast value and average intensity value.

2.5) since the calculated transmissivity of existing defogging method institute is more coarse, so that the outdoor image after defogging is general Store-through saves the problems such as imperfect on boundary and texture.Therefore, transmission of the bilateral filtering to merging by characteristic pattern is selected Rate is handled, in the hope of obtaining fining transmissivity t (x), to transmissivityBilateral filtering process is carried out to be represented by

2.6) after obtaining fining transmissivity t (x), selecting overall situation air light value α is each pixel in input picture I (x) The corresponding maximum brightness value of point.At this point, atmospherical scattering model is deformed, so that the outdoor image J (x) after defogging is calculated, Calculation method is

Multiple dimensioned convolutional neural networks structure is as shown in Figure 3.

Step 3: according to mean square error MSE and L-2 norm, constructing objective function.

Single width outdoor image defogging problem is typical supervised learning problem, and supervised learning needs to establish convolutional Neural Mapping relations G between network inputs (having mist outdoor image) and output (fogless outdoor image).If multiple dimensioned convolutional Neural net The parameter of network is I-th of training sample is I_i, the corresponding Ground Truth of i-th of training sample is J_i, the number of training sample is N, multiple dimensioned The parameter Φ of convolutional neural networks can be obtained by minimizing objective function, be constructed by mean square error MSE and L-2 norm Objective function, form are

Wherein, c_i、w_iAnd h_iIt is the corresponding average RGB numerical value of i-th of sample, average contrast and average staturation respectively.

Step 4: the multiple dimensioned convolutional neural networks of training.

Firstly, randomly selecting 20000 64 × 64 foggy image blocks from the training sample concentration constructed, respectively there is mist figure As block possesses corresponding Ground Truth；Then, constructed objective function is minimized using stochastic gradient descent method；Most It afterwards, is objective function given threshold represents multiple dimensioned convolutional Neural when minimizing objective function result less than given threshold The parameter Φ of network it has been determined that complete the training to convolutional neural networks, and then can realize and carry out to any outdoor image at this time Defogging processing；As shown in figure 3, the output of its multiple dimensioned convolutional neural networks is treated picture.

The improvement that the embodiment of the present invention does upper one embodiment in improvement, builds multiple dimensioned convolutional Neural net in step 2 The convolutional layer of network is made of 11 × 11,7 × 7,5 × 5,3 × 3 four kinds of different scale convolution kernels, if the training sample to defogging is I (x), q represents convolution kernel size and q ∈ { 11,7,5,3 }, and its Analysis On Multi-scale Features figure amalgamation mode is as follows:

Wherein, λ, μ, γ and β are characteristic pattern weight coefficient, h respectively^(q)、c^(q)And s^(q)It is the flat of each scale feature figure respectively Equal RGB numerical value, mean contrast value and average intensity value.As shown in figure 4, the output of its four scale convolutional neural networks is Treated picture.

Embodiment described above is only that preferred embodiments of the present invention will be described, not to the scope of the present invention It is defined, without departing from the spirit of the design of the present invention, those of ordinary skill in the art are to technical solution of the present invention The various changes and improvements made should all be fallen into the protection scope that claims of the present invention determines.

Claims

1. a kind of single width Method of defogging image of outdoor scenes based on multiple dimensioned convolutional neural networks, which is characterized in that including walking as follows It is rapid:

(1) training sample obtains: obtaining fog free images sample, is carried out at atomization using atmospherical scattering model to fog free images sample Reason obtains foggy image sample, using fog free images sample and corresponding foggy image sample as training sample；

(2) multiple dimensioned convolutional neural networks model: the convolutional layer that building is no less than three；The output end connection one of each convolutional layer The pond a Max Pooling layer, the output end of the pond each Max Pooling layer connect one based on the linear unit R eLu of amendment Nonlinear Mapping layer；The output end of all Nonlinear Mapping layers is connect with Fusion Features layer；The Fusion Features layer it is defeated Outlet connects a bilateral filtering layer handled transmissivity, and the transmissivity exported using bilateral filtering layer is defeated to convolutional layer The foggy image sample entered carries out defogging processing；

(3) multiple dimensioned convolutional neural networks model training: using the foggy image sample in step (1) as multiple dimensioned convolution mind Input through network model, the discrimination standard exported using the fog free images sample in step (1) as multiple dimensioned neural network, For the purpose of the minimization of object function, multiple dimensioned convolutional neural networks model is trained, carries out parametric solution；Wherein target Function are as follows:

Wherein, c_i、w_iAnd h_iIt is the corresponding average RGB numerical value of i-th of sample, average contrast and average staturation respectively；More rulers The parameter for spending convolutional neural networks is Φ, and i-th of foggy image sample is I_i, the corresponding fogless figure of i-th of foggy image sample Decent is J_i, the number of training sample is N；

(4) the multiple dimensioned convolutional neural networks model after being solved using step (3) carries out at defogging foggy image to be processed Reason.

2. a kind of single width Method of defogging image of outdoor scenes based on multiple dimensioned convolutional neural networks according to claim 1, It is characterized in that, there are three the convolutional layers, and convolution kernel is 7 × 7,5 × 5 and 3 × 3 respectively；Or there are four the convolutional layers, convolution Core is 11 × 11,7 × 7,5 × 5 and 3 × 3 respectively.

3. a kind of single width Method of defogging image of outdoor scenes based on multiple dimensioned convolutional neural networks according to claim 1, It is characterized in that, the objective function is constructed according to mean square error MSE and L-2 norm.

4. a kind of single width Method of defogging image of outdoor scenes based on multiple dimensioned convolutional neural networks according to claim 1, It is characterized in that, the objective function is minimized according to stochastic gradient descent method.

5. a kind of single width Method of defogging image of outdoor scenes based on multiple dimensioned convolutional neural networks according to claim 1, It is characterized in that, in step (1) sample acquisition, the calculation method that fog free images sample architecture is corresponding with mist image pattern is as follows:

I_T(x)=J_T(x)t_T(x)+α_T(1-t_T(x))

Wherein, J_TIt (x) is the i.e. collected image of fog free images sample, t_TIt (x) is transmissivity, α_TFor global air light value, I_T(x) For foggy image sample.

6. a kind of single width Method of defogging image of outdoor scenes based on multiple dimensioned convolutional neural networks according to claim 1 or 2, It is characterized in that, the convolutional layer form is as follows:

Wherein, I (x) is the foggy image sample to defogging, and q is convolution kernel size,It is convolutional layer filter,It is Convolutional layer biasing, * is convolution algorithm,It is the output of multiple dimensioned convolutional neural networks convolutional layer.

7. a kind of single width Method of defogging image of outdoor scenes based on multiple dimensioned convolutional neural networks according to claim 1, It is characterized in that, the pond Max Pooling layer form is as follows:

Wherein,It is the output of multiple dimensioned convolutional neural networks pond layer,It is multiple dimensioned convolutional neural networks The output of convolutional layer.

8. a kind of single width Method of defogging image of outdoor scenes based on multiple dimensioned convolutional neural networks according to claim 1, It is characterized in that, the Nonlinear Mapping layer carries out Nonlinear Mapping to the feature after dimensionality reduction and obtains Analysis On Multi-scale Features figure, constructed Active coating form it is as follows:

Wherein,It is active coating filter,It is active coating biasing,It is multiple dimensioned convolutional neural networks activation The output of layer,It is the output of multiple dimensioned convolutional neural networks pond layer.

9. a kind of single width Method of defogging image of outdoor scenes based on multiple dimensioned convolutional neural networks according to claim 1, It is characterized in that, the Fusion Features layer merges Analysis On Multi-scale Features figure, to get transmissivityAnalysis On Multi-scale Features figure melts Conjunction mode is as follows:

Wherein, λ₁,λ₂,…,λ_nIt is characteristic pattern weight coefficient, h respectively^(q)、c^(q)And s^(q)It is being averaged for each scale feature figure respectively RGB numerical value, mean contrast value and average intensity value；It is the output of n scale convolutional neural networks pond layer.

10. a kind of single width Method of defogging image of outdoor scenes based on multiple dimensioned convolutional neural networks according to claim 1, It is characterized in that, the bilateral filtering layer is using bilateral filtering to transmissivityIt is handled, to obtain fining transmissivity t (x), calculation method is as follows:

D (ξ, y)=| | ξ-y | |₂

Wherein, y is transmissivityIn pixel, ξ is the pixel adjacent with y, and c (ξ, y) is space weighting function, σ_cIt is Variance between two pixels, d (ξ, y) are between two pixels apart from measure function,WithBe respectively with pixel ξ with The transmissivity that 8 neighborhood territory pixel blocks centered on y are constituted,It is similar weight calculation function, σ_sIt is two transmissivities Between variance,It is two transmissivities apart from measure function.