CN111915687A

CN111915687A - Background extraction method with depth information and color information

Info

Publication number: CN111915687A
Application number: CN202010669614.4A
Authority: CN
Inventors: 邱飞岳; 章国道; 陈宏�; 王丽萍; 姜弼君; 孔德伟; 潘毅; 李丽萍
Original assignee: Zhejiang University of Technology ZJUT
Current assignee: Zhejiang University of Technology ZJUT
Priority date: 2020-07-13
Filing date: 2020-07-13
Publication date: 2020-11-10
Anticipated expiration: 2040-07-13
Also published as: CN111915687B

Abstract

A background extraction method with depth information and color information belongs to the technical field of automatic detection of video sequences. It includes step 1, initialization, decomposing the image matrix with depth information in the first frame into image BGR matrix v¹Sum-depth image floating point type single-channel matrix d¹And use of v¹Initialize the ViBe algorithm, and d¹Attaching the matrix element by element to a depth model matrix D; step 2: sequentially reading in the t frame depth imageDecomposition of t-frame depth image into image BGR matrix v^tSum-depth image floating point single-channel type matrix d^tAnd processes both matrices. According to the method, whether the foreground depth distance is larger than the background depth distance or not can be judged through the background floating point type matrix with the depth information and the current floating point type matrix with the depth information, and then the ghost problem of the ViBe is detected; moreover, a high-quality background BGR matrix with color information can be obtained by utilizing a ViBe sample set; the method has extremely high calculation speed and is suitable for industrial application.

Description

Background extraction method with depth information and color information

Technical Field

The invention belongs to the technical field of automatic detection of video sequences, and particularly relates to a method for extracting a background under the condition of an image sequence with depth information.

Background

With the development of society, the number of mobile terminals with camera functions gradually increases, the related technologies of the terminal camera functions gradually increase, and with the development of machine vision, the subversive technologies of automatic driving, virtual reality and the like gradually develop, a 3D camera is adopted to perform object recognition, behavior recognition and more related applications of scene modeling.

Background modeling is fundamental work of sequence image analysis and is a hot problem for researchers at home and abroad at present. The modeling result has important influence on the subsequent processing of video image such as motion detection, moving target classification, tracking and behavior understanding. However, due to the difference of the actual application environments and the diversity of the background, it is difficult to establish a good background sample, so that in the actual application, different algorithms are needed to optimize the design scheme, and a good result can be obtained.

Since 2011 a paper of the ViBe algorithm is published, the ViBe algorithm is applied to moving target detection of a visual screen sequence, but the technique has a ghost problem (a detected moving object does not correspond to any real moving object), and the ghost problem can cause misjudgment of a downstream technique.

Disclosure of Invention

In view of the above problems in the prior art, an object of the present invention is to provide a background extraction method capable of extracting a background with color information and a background with depth information from an image sequence with depth information, and capable of well suppressing a ghost problem of a ViBe and efficiently updating a color information model.

The invention provides the following technical scheme:

a background extraction method with depth information and color information is characterized by comprising the following steps:

step 1, initialization, specifically comprising the following steps:

step 1.1, decomposing the image matrix with depth information in the first frame into an image BGR matrix v¹Sum-depth image floating point type single-channel matrix d¹；

Step 1.2, use v¹Initialize the ViBe algorithm, and d¹Attaching the matrix element by element to a model single-channel floating-point type matrix D with the same size;

step 2: sequentially reading in the depth image of the t-th frame, and decomposing the depth image of the t-th frame into an image BGR matrix v according to the operation in the step 1.1^tSum-depth image floating point single-channel type matrix d^tAnd processing the two matrixes according to the following steps:

step 2.1: using v^tUpdating the ViBe algorithm;

step 2.2: updating the matrix D;

step 2.3: judging whether the depth distance of the current pixel is closer than the depth distance of the background;

step 2.4: updating a ViBe sample space at the false detection position;

step 2.5: and (3) operating to obtain a color background BGR floating-point matrix B, and after the operation is finished, executing the step 2 again, wherein B is the background with color information, and D is the background with depth information.

The method for extracting the background with the depth information and the color information is characterized in that in the step 1.1, an image matrix v¹And depth image d¹The values of (A) and (B) are width × height.

The method for extracting the background with the depth information and the color information is characterized in that in the step 2.2, the matrix D is updated according to the following formula:

wherein,

representing depth floating point values of a depth moment floating point mono-channel matrix D at (x, y) at the t-th frame of a depth image sequence, D_x,yRepresenting the depth floating point value of the depth background matrix at (x, y), wherein alpha is a threshold value and is more than or equal to 0 and less than or equal to 1; wherein, α represents the weight of the current depth frame, and (1- α) is the weight of the background depth model.

The method for extracting the background with the depth information and the color information is characterized in that in the step 2.3, whether the current pixel depth distance is closer to the background depth distance is judged according to the following formula:

R_D＝(max-min)/T

wherein max and min are the maximum distance and the minimum distance measurable by the depth camera, respectively, T is the threshold value, when C is_x,yIf 0, the depth distance at (x, y) is farther than the background depth distance or close to the background depth, and if the place is judged as the foreground by the ViBe, the ViBe at the place will be judged as the foregroundThe background is misjudged as foreground, i.e. ghost.

The background extraction method with the depth information and the color information is characterized in that in the step 2.4, the ViBe sample space at the false detection position is updated according to the following formula:

wherein r is₁,r₂,r₃,r₄Are all random numbers, r₁,r₃∈[0,N)，r₂,r₄E [0, γ)), and omega (x, y) represents the coordinates of a random point in eight neighborhoods of coordinates (x, y), samples is the set of sample space matrices of ViBe,

represents the r-th sample value on (x, y), which is the color BGR value, N is the sample space size of each pixel of the ViBe algorithm, and γ is the sample update threshold of ViBe.

The background extraction method with depth information and color information is characterized in that in step 2.5, a color background BGR floating point matrix B is obtained according to the following formula:

wherein, B_x，yAnd (3) the BGR value of the color background BGR floating-point matrix B at the position (x, y) is a three-dimensional vector, the summation mode at the position is the addition of corresponding components, and after the operation is finished, the step 2 is executed again, wherein in the process, B is the background with color information, and D is the background with depth information.

By adopting the technology, compared with the prior art, the invention has the following beneficial effects:

the method can obtain the background floating point type matrix with the depth information in the image sequence with the depth information through calculation, judge whether the foreground depth distance is larger than the background depth distance through the background floating point type matrix with the depth information and the current floating point type matrix with the depth information, further detect the ghost problem of the ViBe, and then update the ViBe sample set to enable the color information in the ViBe sample to be closer to the background, so that the ViBe sample set is utilized to obtain the high-quality background BGR matrix with the color information.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail with reference to the following embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

On the contrary, the invention is intended to cover alternatives, modifications, equivalents and alternatives which may be included within the spirit and scope of the invention as defined by the appended claims. Furthermore, in the following detailed description of the present invention, certain specific details are set forth in order to provide a better understanding of the present invention. It will be apparent to one skilled in the art that the present invention may be practiced without these specific details.

Step 1: initializing, specifically comprising the following steps:

step 1.1: we use RealSenseDepthParametric D435 depth camera to extract depth video, and decompose the image matrix with depth information in the first frame into image BGR matrix v through RealSensesSDK toolkit¹Sum-depth image floating point type single-channel matrix d¹The sizes of the two are width multiplied height;

step 1.2: using v¹Initialize the ViBe algorithm, and d¹Attaching the matrix element by element to a depth model single-channel floating-point matrix D with the same size;

step 2: according to step 1.1, reading in the t frame depth image in sequence and adding depthDecomposition of degree image into image BGR matrix v^tSum-depth image floating point single-channel type matrix d^tAnd processing the two matrixes according to the following steps:

step 2.1: using v^tUpdating the ViBe algorithm;

step 2.2: updating the matrix D according to equation (1):

wherein,

representing depth floating point values of a depth moment floating point mono-channel matrix D at (x, y) at the t-th frame of a depth image sequence, D_x,yRepresenting a depth floating point value of the depth background matrix at (x, y), where α is a threshold value, and α is greater than or equal to 0 and less than or equal to 1, and a value of α needs to be defined according to an actual application environment, and is usually defined as α being 0.15;

step 2.3: according to the formula (2), judging whether the depth distance of the current pixel is closer than the depth distance of the background:

R_D＝(max-min)/T (4)

wherein max and min are the longest distance and the shortest distance that can be measured by the depth camera, respectively, under the RealSenseDepthCameraD435 depth camera, max is 10, min is 0.1, T is the threshold, and T is 100, if C_x,yIf the distance of the depth at the position (x, y) is more distant than the depth of the background or close to the depth of the background, and if the position is judged as the foreground by the ViBe, the ViBe judges the background as the foreground by mistake, namely ghost image;

step 2.4: updating the ViBe sample space according to the formulas (5) and (6):

representing the r-th sample value on (x, y), which is a color BGR value, N is a sample space size of each pixel of the ViBe algorithm, N is 20 according to the ViBe algorithm, γ is a sample update threshold of the ViBe, and γ is 16 according to the ViBe algorithm;

step 2.5: the color background BGR floating-point matrix B is obtained according to the following formula:

wherein, B_x，yAnd (3) adding corresponding components for the BGR value of the color background BGR floating-point matrix B at (x, y), and executing the step 2 again after the operation is finished, wherein B is the background with color information, and D is the background with depth information.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.

Claims

1. A background extraction method with depth information and color information is characterized by comprising the following steps:

step 1, initialization, specifically comprising the following steps:

step 2.1: using v^tUpdating the ViBe algorithm;

step 2.2: updating the matrix D;

step 2.4: updating a ViBe sample space at the false detection position;

2. A background extraction method with depth information and color information as claimed in claim 1, wherein in step 1.1, the image matrix v is¹And depth image d¹The values of (A) and (B) are width × height.

3. A background extraction method with depth information and color information as claimed in claim 1, characterized in that in step 2.2, the matrix D is updated according to the following formula:

wherein,

4. The method as claimed in claim 1, wherein in step 2.3, it is determined whether the current pixel depth distance is closer than the background depth distance according to the following formula:

R_D＝(max-min)/T

wherein max and min are the maximum distance and the minimum distance measurable by the depth camera, respectively, T is the threshold value, when C is_x,yIf the distance of the depth at (x, y) is greater than the background depth distance or closer to the background depth, and if the place is judged as the foreground by the ViBe, the background is judged as the foreground by the ViBe, which is a ghost image.

5. The method as claimed in claim 1, wherein in step 2.4, the ViBe sample space at the false detection point is updated according to the following formula:

6. The method of claim 1, wherein in the step 2.5, the color background BGR floating point matrix B is obtained according to the following formula: