CN108549833A

CN108549833A - A kind of target extraction method of accurate robust

Info

Publication number: CN108549833A
Application number: CN201810184857.1A
Authority: CN
Inventors: 马小骏; 顾晓东
Original assignee: WISCOM SYSTEM CO Ltd
Current assignee: WISCOM SYSTEM CO Ltd
Priority date: 2018-03-07
Filing date: 2018-03-07
Publication date: 2018-09-18

Abstract

The invention discloses a kind of target extraction methods of accurate robust, are a kind of methods for extracting foreground target in video frame, realize the external Objective extraction in static or background close to static state.This method includes three parts：1) super-pixel figure is generated；2) the foreground target detection based on super-pixel；3) foreground target refines.Compared with existing other methods, advantage of the invention is that：Target can accurately be extracted and identify boundary, efficiency high robust is good.

Description

A kind of target extraction method of accurate robust

Technical field

The present invention relates to a kind of target extraction methods of accurate robust, more particularly to are carried out under the fixed scene of video camera The method of detection and the extraction of external object, belongs to video analysis field.

Background technology

Objective extraction is usually to need means to be used in video related application, and the purpose is to automatically extract in the video frame Interested target, Objective extraction are commonly used in the Video Applications such as video monitoring, video compress and target identification.At these In, video camera fixes irremovable or rotation, background be almost it is static constant, will be by when there is exterior object to enter visual field It is detected and extracts as " foreground target ".So-called background " almost static constant " is considered based on following：1) in video Background it is static constant, the movement of any pixel is not present；2) in scene there may be some relatively regular object of which movement, For example the leaf in background is shaken due to wind, camera pedestal is due to shaking etc. caused by wind or other vibrations； 3) sunray migrates under outdoor scene and the slow scene brightness of generation changes, etc..

A kind of common target extraction method is the background subtraction based on mixed Gauss model, and this method is suitable for environment Change slow scene.The basic thought of background subtraction is：This pixel quilt if some pixel is larger with respect to background deviation It is considered foreground.The shortcomings that background subtraction, is：1) foreground object being extracted is very coarse, sometimes can only detect and carry Take out a part for target；2) when pixel magnitude is very much that efficiency of algorithm is very low greatly；3) very sensitive to ambient noise, that is, it is easy It is interfered by ambient noise.

Invention content

Technical problem to be solved by the invention is to provide a kind of target extraction method of accurate robust, the method it is basic Thought is：First, each video frame is expressed as sparse form, i.e., each frame is converted to a super-pixel figure；2) using the back of the body Scape subtraction sums up in the point that each pixel in foreground class or background classes；3) method is cut using figure to carry out carefully the foreground target detected Change.

The present invention uses following technical scheme to solve above-mentioned technical problem：

The present invention provides a kind of target extraction method of accurate robust, including step in detail below：

Step 1, input video stream generates the corresponding super-pixel figure of each video frame；

Step 2, foreground target is detected based on super-pixel；

Step 3, the foreground target that step 2 detects is refined.

As the further technical solution of the present invention, in step 1 using the method for cluster by the pixel cluster of video frame at Super-pixel, so that video frame is converted to the representation of super-pixel figure, specially：

1) it initializes：For video frame t, randomly chooses N number of point and be used as seed, be expressed asK=0,1 ..., N-1；

2) it clusters：WithIt is N number of super-pixel the pixel cluster of video frame t as seed；

3) seed generates：Calculate k-th of super-pixel k_tGeometric center, and using it as new seed

As the further technical solution of the present invention, super-pixel is indicated using mixed Gauss model in step 2, and utilize the back of the body Scape subtraction is detected foreground target, specially：

1) it initializes：If video frame t is not initial frame, go to step 2)；Otherwise, for k-th of super-pixel k_t, build Vertical mixed Gauss modelWherein, L is the number of Gaussian function, and x is i-th of Gaussian function Weight coefficient, g_i(x；μ_i,σ_i) it is i-th of variance μ_iAnd meansquaredeviationσ_iGaussian function；

2) classify：For k-th of super-pixel k_t, calculate each g_i(x；μ_i,σ_i) mahalanobis distance, if from k_tTo g_i(x； μ_i,σ_i) distanceThen k_tBelong to foreground, gos to step 4)；Otherwise belong to background, go to step 3)；

3) it updates：Mixed Gauss model is updated with moving average；

4) it post-processes：Execute foreground target Refinement operation.

As the further technical solution of the present invention, step 3 includes two steps：1) foreground target restores；2) boundary refines.

As the further technical solution of the present invention, foreground target restores to cut method using figure, i.e.,：Before being super-pixel segmentation Two class of scape and background, specially：

1) bounding box based on current super-pixel figure testing result is generated；

2) it is all super-pixel structure figures in bounding boxSo that：Each super-pixel is a node, and any two exists It is connected with a line between two adjacent nodes of space；

3) schemingOn the segmentation result that is improved using the figure method of cutting of standard.

As the present invention further technical solution, boundary refine the step of it is as follows：

1) for each foreground target being resumed in super-pixel figure, a narrowband is drawn along its boundary；

2) on original video frame the fine boundary of method extraction Pixel-level is cut using figure；

3) output boundary.

The present invention has the following technical effects using above technical scheme is compared with the prior art：The present invention is that one kind exists The method of extraction foreground target in video frame realizes the external Objective extraction in static or close static background.The present invention's Advantage is：Target can accurately be extracted and identify boundary, efficiency high robust is good.

Description of the drawings

Fig. 1 is flow chart of the method for the present invention.

Specific implementation mode

Technical scheme of the present invention is described in further detail below in conjunction with the accompanying drawings：

The present invention proposes a kind of method cut based on super-pixel and figure, and the basic thought of the method is：First, each Video frame is expressed as sparse form, i.e., each frame is converted to a super-pixel figure；2) each pixel is returned using background subtraction It ties in foreground class or background classes；3) method is cut using figure to refine the foreground target detected.

Technical scheme of the present invention is as shown in Figure 1：

A method of detecting external object under video camera fixed scene, this method is suitable for executing in computing device, Include the following steps：

Step 1, video frame is converted to the representation of super-pixel figure.

In step 1, similarity of the pixel of video frame based on some scale is spatially clustered into many groups, these Group's i.e. so-called " super-pixel ".A large amount of pixel cluster at super-pixel, magnitude substantially reduces, therefore for the processing of super-pixel It is much higher more than the treatment effeciency for pixel.In addition, super-pixel has stronger iamge description ability compared to pixel.For this purpose, Efficient super-pixel clustering algorithm, such as average drifting (mean shift) method, watershed (water shed) can be used in we Method etc..

The algorithm for generating super-pixel figure is as follows：

For k-th of super-pixel k in video frame t_tThere are corresponding super-pixel k' in video frame t+1_t+1.In order to look for To corresponding to k_tSuper-pixel k'_t+1, we only need to track by seed in video frame t+1The super-pixel of generation, is used in combination this As a result the model in foreground detection is updated.

Step 2, foreground target is detected based on super-pixel figure.

In foreground target detection based on super-pixel, we can subtract method using existing background pixel-based, We use mixed Gauss model to indicate super-pixel for this, wherein mixed Gauss model is several Gaussian functions with some weight The model that ratio is summed to form.

K-th of super-pixel k in video frame t backgrounds_tIt is expressed as with mixed Gauss model (GMM)It is high based on mixing Under this model, super-pixel k is corresponded in video frame t+1_tSuper-pixel k'_t+1If k'_t+1It arrivesAny component distance More than some threshold value (variance of such as 2 times this components), then k'_t+1It can be classified as foreground, be otherwise classified as background and for updating

Step 2 is described as follows：

1) it initializes：If video frame t is not initial frame, go to step 2)；Otherwise, for k-th of super-pixel k_t, build Vertical mixed Gauss modelWherein, L is the number of Gaussian function, and x is i-th of Gaussian function Weight coefficient, g_i(x；μ_i,σ_i) it is i-th of variance μ_iAnd meansquaredeviationσ_iGaussian function.

2) classify：For k-th of super-pixel k_t, calculate each g_i(x；μ_i,σ_i) geneva (Mahalanobis) distance, from k_tTo g_i(x；μ_i,σ_i) distance useIt indicates.IfThen k_tBelong to foreground, gos to step 4)；Otherwise belong to In background, go to step 3).

3) it updates：Mixed Gauss model is updated with moving average

4) it post-processes：Execute foreground target Refinement operation.

Step 3, the foreground target detected is refined.

Compared to traditional target detection pixel-based, had in terms of efficiency and robustness based on the method for super-pixel bright Aobvious advantage, however disadvantage there are two it：1) object boundary detected is not very accurate, an and often part for target； 2) object boundary detected is very coarse.Therefore micronization processes must be carried out to boundary to obtain more accurate and fine side Boundary.

Boundary refines and processing includes two steps：1) foreground target restores；2) boundary refines.Foreground target restore purpose be Certain error detections be background foreground restored, boundary refinement purpose be reach the fine target side of Pixel-level Boundary.

Foreground target restores to cut method using figure, i.e.,：It is two class of foreground and background super-pixel segmentation.It is each in step 2 Super-pixel is all to be indicated with mixed Gauss model, and all foreground super-pixel is indicated with a mixed Gauss model here, Same had powerful connections super-pixel is indicated with another mixed Gauss model.This representation can describe video frame well, from And the super-pixel being accidentally classified as in background is restored in foreground.

Shown in foreground target recovery is as follows：

The result that foreground is restored similarly is to obtain a super-pixel figure, and wherein foreground super-pixel is marked as foreground.Before Scape recovery usually can restore to obtain almost entire foreground target, but due to super-pixel, obtained boundary is not very smart Carefully, it must continue to refine into row bound thus.The step of boundary refines is as follows：

3) output boundary.

The above, the only specific implementation mode in the present invention, but scope of protection of the present invention is not limited thereto, appoints What is familiar with the people of the technology within the technical scope disclosed by the invention, it will be appreciated that expects transforms or replaces, and should all cover Within the scope of the present invention, therefore, the scope of protection of the invention shall be subject to the scope of protection specified in the patent claim.

Claims

1. a kind of target extraction method of accurate robust, which is characterized in that including step in detail below：

Step 2, foreground target is detected based on super-pixel；

Step 3, the foreground target that step 2 detects is refined.

2. a kind of target extraction method of accurate robust according to claim 1, which is characterized in that using poly- in step 1 The method of class by the pixel cluster of video frame at super-pixel, so that video frame is converted to the representation of super-pixel figure, specifically For：

3. a kind of target extraction method of accurate robust according to claim 2, which is characterized in that using mixed in step 2 It closes Gauss model and indicates super-pixel, and foreground target is detected using background subtraction, specially：

1) it initializes：If video frame t is not initial frame, go to step 2)；Otherwise, for k-th of super-pixel k_t, establish mixed Close Gauss modelWherein, L is the number of Gaussian function, and x is the power of i-th of Gaussian function Weight coefficient, g_i(x；μ_i,σ_i) it is i-th of variance μ_iAnd meansquaredeviationσ_iGaussian function；

2) classify：For k-th of super-pixel k_t, calculate each g_i(x；μ_i,σ_i) mahalanobis distance, if from k_tTo g_i(x；μ_i,σ_i) DistanceThen k_tBelong to foreground, gos to step 4)；Otherwise belong to background, go to step 3)；

3) it updates：Mixed Gauss model is updated with moving average；

4) it post-processes：Execute foreground target Refinement operation.

4. a kind of target extraction method of accurate robust according to claim 1, which is characterized in that step 3 includes two steps： 1) foreground target restores；2) boundary refines.

5. a kind of target extraction method of accurate robust according to claim 4, which is characterized in that foreground target recovery is adopted Method is cut with figure, i.e.,：It is two class of foreground and background super-pixel segmentation, specially：

2) it is all super-pixel structure figures in bounding boxSo that：Each super-pixel is a node, and any two is in space It is connected with a line between two adjacent nodes；

6. a kind of target extraction method of accurate robust according to claim 4, which is characterized in that the step of boundary refines It is as follows：

3) output boundary.