CN108230243A

CN108230243A - One kind is based on salient region detection model background-blurring method

Info

Publication number: CN108230243A
Application number: CN201810133575.9A
Authority: CN
Inventors: 余春艳; 徐小丹; 陈立; 杨素琼; 王秀
Original assignee: Fuzhou University
Current assignee: Fuzhou University
Priority date: 2018-02-09
Filing date: 2018-02-09
Publication date: 2018-06-29
Anticipated expiration: 2038-02-09
Also published as: CN108230243B

Abstract

The invention discloses one kind based on salient region detection model background-blurring method, include the following steps：Obtain original image, structure salient region detection model convolutional network obtains the Saliency maps of original image, the Saliency maps picture of acquisition is put into full condition of contact random field and is trained the Saliency maps picture after being optimized, the Saliency maps after optimization are subjected to binaryzation again or dividing processing obtains 01 matrix, obtain prospect index matrix and background index matrix；The overall situation for realizing original image using distance weighted average algorithm obscures；Finally prospect artwork is mutually spliced with blurred background figure, generation virtualization Background.The present invention can accurately not only detect complete salient region, and conspicuousness boundary is more clear, so as to retain the feature of foreground picture when background is blurred, not damage foreground picture picture material.

Description

One kind is based on salient region detection model background-blurring method

Technical field

The present invention relates to digital image processing techniques fields, more particularly to a kind of to be based on salient region detection model background Weakening method.

Background technology

Image background virtualization is processing procedure very common in the tasks such as image rendering, beautification, enhancing, it can very effectively Ground protrudes target object and fading background information, so as to promote visual effect.At present, certain image processing softwares are preferably completed This processing, but its processing method is required for carrying out foreground area manual mark, needs to expend a large amount of manpower, be not easy to Mass disposal；In addition, the diffusion way that the prior art obscures is regular shape, it is difficult to be adapted in image complicated and changeable Hold.Existing automatic background virtualization technology is immature in foreground edge extraction, causes obscure boundary clear, cuts to the region of mistake Etc..

Invention content

For obscure boundary existing for existing background-blurring method it is clear the problems such as, the present invention proposes a kind of to be based on conspicuousness area Domain detection model background-blurring method, can detect entire salient region, comprising multiple saliency objects, include small scale The Various Complexes such as saliency object situation shows well, can accurately not only detect complete salient region, and significantly Property boundary is more clear.So as to which the feature of foreground picture can be retained when background is blurred, foreground picture picture material is not damaged.

To achieve the above object, the technical scheme is that：One kind is background blurring based on salient region detection model Method includes the following steps:

Step S1：Obtain original image；

Step S2：Salient region detection model is built based on convolutional neural networks, obtains the Saliency maps of original image；

Step S3：Saliency maps are put into full condition of contact random field to be trained, the Saliency maps after being optimized;

Step S4：Saliency maps after optimization are carried out with binaryzation or dividing processing obtains 01 matrix SBM, obtains prospect index square Battle array IF and background index matrix IB, is defined as follows：

Wherein, M × N is all 1's matrix identical with original image resolution；

Step S5：The overall situation for realizing original image using distance weighted average algorithm obscures, and obtains original fuzzy graph；

Step S6：Utilization prospects index matrix IF extracts clear foreground picture to original image, using background index matrix IB to original Beginning fuzzy graph extracts blurred background figure；Finally clear foreground picture with blurred background figure is mutually spliced, obtains background blurring result.

Further, the specific network structure of salient region detection model is as follows：

First layer is input layer, inputs original image；

The second layer is made of two convolutional layers, wherein first convolutional layer, using 64 convolution kernels, size is（4,4,3）, second A convolutional layer uses 64 convolution kernels, and size is（3,3,64）, activation primitive is ReLU functions；

Third layer is pond layer, and size is（2,2）, activation primitive is ReLU functions；

4th layer is made of two convolutional layers, wherein first convolutional layer, using 128 convolution kernels, size is（3,3,64）, the Two convolutional layers use 128 convolution kernels, and size is（3,3,128）, activation primitive is ReLU functions；

Layer 5 is pond layer, and size is（2,2）, activation primitive is ReLU functions；

Layer 6 is made of three convolutional layers, wherein first convolutional layer, using 256 convolution kernels, size is（3,3,128）, the Two convolutional layers use 256 convolution kernels, and size is（3,3,256）, third convolutional layer uses 256 convolution kernels, and size is （3,3,256）, activation primitive is ReLU functions；

Layer 7 is pond layer, and size is（2,2）, activation primitive is ReLU functions；

8th layer is made of three convolutional layers, wherein first convolutional layer, using 512 convolution kernels, size is（3,3,256）, the Two convolutional layers use 512 convolution kernels, and size is（3,3,512）, third convolutional layer uses 512 convolution kernels, and size is （3,3,512）, activation primitive is ReLU functions；

9th layer is pond layer, and size is（2,2）, activation primitive is ReLU functions；

Tenth layer is made of three convolutional layers, wherein first convolutional layer, using 512 convolution kernels, size is（3,3,512）, the Two convolutional layers use 512 convolution kernels, and size is（3,3,512）, third convolutional layer uses 512 convolution kernels, and size is （3,3,512）, activation primitive is ReLU functions；

Eleventh floor is pond layer, and size is（3,3）, it is 1 to expand marginal dimension, and activation primitive is ReLU functions；

Floor 12 is made of two convolutional layers, wherein first convolutional layer, using 1024 convolution kernels, size is（3,3, 512）, second convolutional layer be using 512 convolution kernels, size（3,3,1024）, activation primitive is ReLU functions；

13rd layer is made of two convolutional layers and a normalization layer, wherein first convolutional layer uses 256 convolution kernels, ruler It is very little to be（3,3,1024）, second convolutional layer be using 512 convolution kernels, size（3,3,256）, activation primitive is ReLU letters Number；

14th layer is module of deconvoluting, and two of which input is respectively that the 13rd layer of output is exported with Floor 12；

15th layer is module of deconvoluting, and two of which input is respectively the 14th layer of output and the 8th layer of output；

16th layer is module of deconvoluting, and two of which input is respectively that the 15th layer of output is exported with layer 6；

17th layer is module of deconvoluting, and two of which input is respectively the 16th layer of output and the 4th layer of output；

18th layer is module of deconvoluting, and two of which input is respectively that the 17th layer of output is exported with the second layer；

19th layer is made of the layer that deconvolutes, a convolutional layer, a cascading layers, and the input for the layer that deconvolutes is the 14th layer Output, using 1 convolution kernel, size is（4,4,512）, the input of convolutional layer is the 14th layer of output, using 1 convolution kernel, Size is（1,1,512）, for cascading layers by the output of deconvolute layer and convolutional layer into row of channels connection, activation primitive is ReLU letters Number；

20th layer is made of two layers that deconvolute with a shear layer, wherein first layer that deconvolutes uses 1 convolution kernel, ruler It is very little to be（8,8,2）, using 1 convolution kernel, size is second layer that deconvolutes（8,8,1）, will be deconvoluted layer knot using shear layer Fruit cuts into the size as original image, and activation primitive is Sigmoid functions；

Second eleventh floor is made of the layer that deconvolutes, a convolutional layer, a cascading layers, the input for the layer that deconvolutes is the tenth Nine layers of output, using 2 convolution kernels, size is（4,4,2）, the input of convolutional layer is the 15th layer of output, uses 1 convolution Core, size are（1,1,512）, cascading layers by the output of deconvolute layer and convolutional layer into row of channels connect, activation primitive ReLU Function；

Second Floor 12 is made of the layer that deconvolutes with a shear layer, wherein the layer that deconvolutes, using 1 convolution kernel, size is （16,16,1）, layer result of deconvoluting is cut into size as original image using shear layer, activation primitive is Sigmoid functions；

23rd layer is made of the layer that deconvolutes, a convolutional layer, a cascading layers, and the input for the layer that deconvolutes is the 20th One layer of output, using 3 convolution kernels, size is（4,4,3）, the input of convolutional layer is the 16th layer of output, uses 1 convolution Core, size are（1,1,256）, cascading layers by the output of deconvolute layer and convolutional layer into row of channels connect, activation primitive ReLU Function；

24th layer is made of the layer that deconvolutes with a shear layer, wherein the layer that deconvolutes, using 1 convolution kernel, size is （8,8,2）, layer result of deconvoluting is cut into size as original image, activation primitive Sigmoid using shear layer Function；

25th layer is made of the layer that deconvolutes, a convolutional layer, a cascading layers, and the input for the layer that deconvolutes is the 20th Three layers of output, using 4 convolution kernels, size is（4,4,4）, the input of convolutional layer is the 17th layer of output, uses 1 convolution Core, size are（1,1,128）, cascading layers by the output of deconvolute layer and convolutional layer into row of channels connect, activation primitive ReLU Function；

26th layer is made of the layer that deconvolutes with a shear layer, wherein the layer that deconvolutes, using 1 convolution kernel, size is （4,4,3）, layer result of deconvoluting is cut into size as original image, activation primitive Sigmoid using shear layer Function；

27th layer is made of the layer that deconvolutes, a convolutional layer, a cascading layers, and the input for the layer that deconvolutes is the 20th Five layers of output, using 3 convolution kernels, size is（4,4,3）, the input of convolutional layer is the 17th layer of output, uses 1 convolution Core, size are（1,1,256）, cascading layers by the output of deconvolute layer and convolutional layer into row of channels connect, activation primitive ReLU Function；

28th layer is made of the layer that deconvolutes with a shear layer, wherein the layer that deconvolutes, using 1 convolution kernel, size is （2,2,4）, layer result of deconvoluting is cut into size as original image, activation primitive Sigmoid using shear layer Function；

29th layer is made of a cascading layers and a convolutional layer, cascading layers by the 28th layer, the 26th layer, second 14 layers, the second Floor 12, the 20th layer export and connected into row of channels, convolutional layer uses 1 convolution kernel, and size is（1,1,5）, Activation primitive is Sigmoid functions, obtains final output result；

The module of deconvoluting is made of the layer that deconvolutes, a shear layer, Eltwise layers with normalization layer, specific to tie Structure is as follows：If input is respectively characteristic pattern C₁With characteristic pattern C₂, size is respectively（h₁, w₁, k₁）With（h₂, w₂, k₂）And feature Scheme C₁Size be less than characteristic pattern C₂Size, first layer uses k to deconvolute layer₂A convolution kernel, size are（4,4, k₁）, swash Function living is ReLU functions, and input is characterized figure C₁；The second layer is shear layer, according to characteristic pattern C₂Size it is defeated to last layer Go out to be sheared, third layer is Eltwise layers, to characteristic pattern C₂It is multiplied pixel-by-pixel with last layer output, activation primitive is ReLU functions；4th layer is normalization layer, and operation is normalized to last layer output.

Further, the step S3 is specifically included：

Output after condition of contact random field obtains Saliency maps convolution by way of connecting entirely entirely, by this output result Input condition random field, if x=(x1, x2 ..., xn) is allowed to represent observed input data sequence, y=(y1, y2 ..., yn) table Show a status switch, in the case where giving a list entries, the joint item of the CRF model definition status sequences of linear chain Part probability is：

Wherein：Z is the probability normalization factor using input data sequence x as condition；F is an arbitrary characteristic function；W is every The weights of a characteristic function,It is stringent positive potential function.

Further, the step S5 is specifically included：

According to the difference of each pixel importance, give different weight numbers respectively and be averaged, to original image RGB tri- The weighted average matrix of three picture element matrixs is obtained in a channel respectively, realizes that the overall situation of original image obscures.

Further, the step S6 is specifically included：

If original image and original fuzzy graph are expressed as IO and IB, clear foreground picture ICF and blurred background figure IBB are extracted：

Wherein, i is x-axis coordinate, and j is y-axis coordinate,

The superposition of clear foreground picture ICF and blurred background figure IBB is the background blurring result of final image.

Compared with prior art, the present invention has advantageous effect：

（1）The present invention can detect entire salient region, comprising multiple saliency objects, include small scale saliency object Various Complexes situation is waited to show well；

（2）The present invention can accurately not only detect complete salient region, and conspicuousness boundary is more clear.So as in void The feature of foreground picture can be retained when changing background, do not damage foreground picture picture material.

Description of the drawings

Fig. 1 is a kind of flow diagram based on salient region detection model background-blurring method of the present invention；

Fig. 2 is the Comparative result schematic diagram of one embodiment of the invention.

Specific embodiment

The present invention will be further described with reference to the accompanying drawings and embodiments.

As shown in Figure 1, the present invention provides one kind based on salient region detection model background-blurring method, including following step Suddenly:

Step S1：Obtain original image；

Wherein, M × N is all 1's matrix identical with original image resolution；

The specific network structure of salient region detection model is as follows：

First layer is input layer, inputs original image；

The module of deconvoluting is made of the layer that deconvolutes, a shear layer, Eltwise layers with normalization layer, specific to tie Structure is as follows：If input is respectively characteristic pattern C₁With characteristic pattern C₂, size is respectively（h₁, w₁, k₁）With（h₂, w₂, k₂）And feature Scheme C₁Size be less than characteristic pattern C₂Size, first layer uses k to deconvolute layer₂A convolution kernel, size are（4,4, k₁）, swash Function living is ReLU functions, and input is characterized figure C₁；The second layer is shear layer, according to characteristic pattern C₂Size to last layer export It is sheared, third layer is Eltwise layers, to characteristic pattern C₂It is multiplied pixel-by-pixel with last layer output, activation primitive is ReLU functions；4th layer is normalization layer, and operation is normalized to last layer output.

The step S3 is specifically included：

The step S5 is specifically included：

The step S6 is specifically included：

Wherein, i is x-axis coordinate, and j is y-axis coordinate,

Fig. 2 is the background blurring comparative result figure obtained using the method for the present invention, and the left side is artwork, and the right is background blurring Result figure.

What has been described above is only a preferred embodiment of the present invention, and the present invention is not limited to embodiment of above.It is appreciated that The oher improvements and changes that those skilled in the art directly export or associate under the premise of in the design for not departing from the present invention, It is considered as being included within protection scope of the present invention.

Claims

1. one kind is based on salient region detection model background-blurring method, which is characterized in that includes the following steps:

Step S1：Obtain original image；

Wherein, M × N is all 1's matrix identical with original image resolution；

2. according to claim 1 be based on salient region detection model background-blurring method, which is characterized in that described aobvious The work property specific network structure of region detection model is as follows：

First layer is input layer, inputs original image；

3. according to claim 1 be based on salient region detection model background-blurring method, which is characterized in that

The step S3 is specifically included：

4. according to claim 1 be based on salient region detection model background-blurring method, which is characterized in that the step Rapid S5 is specifically included：

5. according to claim 1 be based on salient region detection model background-blurring method, which is characterized in that the step Rapid S6 is specifically included：

Wherein,iIt isxAxial coordinate,jIt isyAxial coordinate,