CN112464886A

CN112464886A - Aircraft identification tracking method

Info

Publication number: CN112464886A
Application number: CN202011465723.0A
Authority: CN
Inventors: 钱久超; 薛建伟; 潘敏婷; 朱肖光; 刘佩林
Original assignee: Shanghai Jiaotong University
Current assignee: Shanghai Jiaotong University
Priority date: 2020-12-14
Filing date: 2020-12-14
Publication date: 2021-03-09

Abstract

The invention discloses an aircraft identification and tracking method, which comprises the following steps: detecting and identifying video data containing an aircraft by using YOLOv3, and if the video data are judged to be the aircraft, recording boundary box information and center point information of the aircraft; constructing a flight state vector of the aircraft, wherein the flight state vector comprises the central point information and the boundary box information; predicting the motion trail of the aircraft by the flight state vector through a Kalman filter; correcting the motion trajectory prediction of the aircraft. The invention determines a specific range in the data frame by comparing the detected position information of the specific image frame with the information predicted by the Kalman filter, thereby improving the tracking speed. In addition, when the frame information difference is found to be large in the key of prediction and detection, the frame information can be adjusted in time, and the method is a method with a feedback mechanism and acting on the identification and tracking of the aircraft.

Description

Aircraft identification tracking method

Technical Field

The invention relates to the field of computer vision processing, in particular to a method for identifying and tracking objects in video data.

Background

The tracking and identification of the aircraft is an important technology in the national aviation safety field, and particularly has an important significance for the identification and tracking of some fighters, so that the national airspace safety is enhanced by some scientific and technological means in the military field, and the national attention is also paid to the aircraft. As a very important part of the field of computer vision, target tracking can implement a technology for identifying and tracking an aircraft in an acquired video image.

In recent years, with the development of deep learning techniques and computer capabilities, various applications based on deep learning are more and more varied, and in particular, various algorithms for deep learning are widely applied in the field of computer vision, and the target detection, identification and tracking techniques based on deep learning are rapidly developed. Compared with the traditional method, the recognition tracking method based on deep learning has the advantages that the accuracy is greatly improved, and the tracking performance is greatly improved. The current mainstream object detection strategies are divided into two categories, namely region suggestions and region distribution. However, the accuracy of aircraft tracking is mainly affected by complex environments, the speed of the aircraft tracking is high, and the difference between a frame image and a frame image is large, so that the visual tracking algorithm still has some challenging problems, such as sudden motion, attitude change, object shielding, background clutter, illumination or viewpoint change and the like, which can cause the performance reduction of the aircraft tracking, even the aircraft tracking fails, and these are external factors affecting the aircraft tracking effect. Therefore, how to identify and track the aircraft more reliably and accurately is a problem to be solved at present.

Disclosure of Invention

The invention aims to solve the technical problem of how to more reliably and accurately identify and track an aircraft, and provides an aircraft identification and tracking method.

The invention solves the technical problems through the following technical scheme:

an aircraft identification tracking method, the method comprising:

detecting and identifying video data containing an aircraft by using YOLOv3, and if the video data are judged to be the aircraft, recording boundary box information and center point information of the aircraft;

constructing a flight state vector of the aircraft, wherein the flight state vector comprises the central point information and the boundary box information;

predicting the motion trail of the aircraft by the flight state vector through a Kalman filter:

when the track of the central point between the adjacent images of the aircraft is a straight line, the position vector of the central point of the aircraft adopts a linear model, Kalman gain calculation is carried out according to the prediction result of the next moment and the detection result of the current moment, the prediction result of the previous moment and the detection result of the current moment are calculated, and the uncertainty of the prediction of the motion track of the current moment is determined;

when the track of the central point between the adjacent images of the aircraft is a nonlinear track, an extended Kalman filter is adopted for prediction, a nonlinear model is adopted for the position vector of the central point of the aircraft, and then the uncertainty of the motion track prediction at the current moment is determined;

combining the linear model prediction part and the nonlinear model prediction part to predict the motion trail of the aircraft;

correcting the motion trajectory prediction of the aircraft;

judging whether the boundary box can detect the aircraft, when the deviation between the prediction result and the detection result is large, limiting the effective range according to the size of a target object to improve the detection speed, if the overlapping value of the prediction boundary box and the detection boundary box of the aircraft is larger than a predefined threshold value, modifying the position and the size of the current detection boundary box according to the prediction boundary box of the previous frame, and if not, re-determining the boundary box by taking the aircraft as the center and inputting the boundary box into a detection network for training.

Preferably, the video data is processed into a video with a fixed length, and is combined into a data set to train the detection network, where the data set includes: a training set for training and a validation set for validation.

Further, when the position vector of the central point of the aircraft is approximated by a linear model, the state transition matrix and the observation matrix are constants; when the position vector of the aircraft center point is approximated by a nonlinear model, the state transition matrix and the observation matrix are respectively:

wherein, F_kIs a state change matrix, where H_kIs an observation matrix. Wherein

For the observed values of the input parameters, the lower subscript k is denoted as the kth time, where u_kAre control vectors.

Further, the correction includes a bezel correction formula:

wherein delta_b,δ_aRespectively representing the confidence of the detection result between the adjacent images, the higher the value of the confidence, the higher the accuracy of the model is, wherein w_a，w_b，w_cRespectively representing the widths, h, of the adjacent images and the corrected bounding box_a,h_b,h_cRespectively representing the heights of the adjacent images and the corrected bounding box, wherein (x)_a,y_a),(x_b,y_b),(x_c,y_c) Respectively representing adjacent images and correctedThe horizontal and vertical height of the bounding box.

On the basis of the common knowledge in the field, the above preferred conditions can be combined randomly to obtain the preferred embodiments of the invention.

The positive progress effects of the invention are as follows: the position information detected by a specific image frame is compared with the information predicted by the Kalman filter to establish a specific range in the data frame, so that the tracking speed is improved. In addition, when the frame information difference is found to be large in the key of prediction and detection, the frame information can be adjusted in time, and the method is a method with a feedback mechanism and acting on the identification and tracking of the aircraft.

Drawings

FIG. 1 is a flowchart of a method in an embodiment of a method for identifying and tracking an aircraft according to the invention;

fig. 2 is a schematic diagram of a frame correction method according to an embodiment of an aircraft identification and tracking method of the present invention.

Detailed Description

To facilitate an understanding of the present application, the present application will now be described more fully with reference to the accompanying drawings. Preferred embodiments of the present application are shown in the drawings. This application may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete.

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs. The terminology used herein in the description of the present application is for the purpose of describing particular embodiments only and is not intended to be limiting of the application. As used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items.

FIG. 1 shows a flowchart of a method for identifying and tracking an aircraft according to the present invention:

s01: detecting and identifying video data containing an aircraft by using YOLOv3, and if the video data are judged to be the aircraft, recording boundary box information and center point information of the aircraft;

in one example, YOLOV3 first compresses the input picture to 416 × 416, extracts features from the input image through a feature extraction network to obtain a feature map with a certain size, such as 13 × 13, then divides the input image into 13 × 13 grids, and then predicts a target in GT (real value) if the central coordinate of the target falls in which grid. Each mesh predicts 3 bounding boxes. The type of the detected object is then determined based on the values of a frame, wherein the frame contains four values v ═ (v ═ v)_x,v_y,v_w,v_h) For subsequent calculation, where v_x,v_yCoordinates representing the center point x and y of the object, v_w,v_hShowing the width and height of the bounding box;

dividing the ROI (Region Of interest) into four sub-regions, i.e., upper left, lower left, upper right, and lower right, which are used as a score map, the vertices Of the frame can be represented by the following methods, respectively.

Wherein B is_tlVertex representing the upper left corner, B_trVertex representing the upper right corner, B_blVertex representing the lower left corner, B_brThe vertex representing the lower right corner.

S02: constructing a flight state vector of the aircraft, wherein the flight state vector comprises the central point information and the boundary box information;

in one example, a flight state vector of an aircraft is constructed

For describing the state of motion of an aircraft, in which,

for describing the position of the center point of the object,

for describing the proportion of the frame and the longitudinal and transverse proportion of the aircraft.

S03: predicting the motion trail of the aircraft by the flight state vector through a Kalman filter;

in one example, (1)

The approximation can be performed by a linear model, because the motion track between adjacent images can be identified by straight lines, and the parameters are separately extracted from the constructed state vector to describe the motion target.

x_k＝Ax_k-1+Bu_k+w_k (1)

z_k＝Hx_k+v_k (2)

Wherein x represents the state vector of the system, z is the observed value, and A is the state transition matrix, and because the invention is applied to a non-controllable system, the control parameters formed by B and u can be ignored. Where H is the observation model, i.e., the observation matrix, the true state can be mapped to the observation space. w and v represent noise during state update and observation, respectively. The lower subscript k for all the above parameters indicates the time at kth.

(2) According to the principle of the kalman filter, it is known that:

wherein, P is an error covariance matrix between the predicted value and the actual value, and is used for representing the uncertainty of the prediction result, and Q is the newly increased uncertainty in the prediction process. Equation 3 represents the current state determined together with the external input at the previous moment, and equation 4 represents the new uncertainty Q due to the previously existing uncertainty.

(3) Calculating Kalman gain K and the predicted value of the kth time

Can be calculated from the following formula:

where equation 5 represents the uncertainty of the prediction result

And the uncertainty R of the observation, the kalman gain K is calculated. Formula 6 shows that the prediction results are weighted and averaged to obtain the state estimation of the current time.

(4) Updating

Representing the uncertainty of this state estimate.

And predicting the parts of s and r which are not suitable for the linear model by using an extended Kalman filter. The state vector is created in the same way as in the second step, the state matrix at this time is no longer a constant matrix, and the update is represented by:

wherein, the matrix F_kAnd H_kDerived from a Jacobian matrix, where F_kCorresponds to the matrix A in the Kalman filter, and H_kCorresponding to the constant matrix H.

And the inference of the nonlinear part is added into a linear inference system, namely the motion state of the airplane can be described.

S04: correcting the motion trajectory prediction of the aircraft;

as shown in fig. 1 and 2, in one example, it is determined whether the frame can correctly detect the airplane, and if the overlap degree between the predicted frame of the airplane and the actual detected frame of the airplane in the specific image is greater than a predefined threshold, the state variable is updated again according to the detected frame information, and the specific correction formula is as follows:

wherein delta_b,δ_aRespectively representing the confidence of the detection result between the front image and the rear image, the higher the value of the confidence, the higher the accuracy of the model is, wherein w_a，w_b，w_cRespectively representing the widths, h, of the adjacent images and the corrected bounding box_a,h_b,h_cRespectively representing the heights of the adjacent images and the corrected bounding box, wherein (x)_a,y_a),(x_b,y_b),(x_c,y_c) The horizontal and vertical heights of the front and rear images and the corrected bounding box are respectively indicated.

In one example, video data containing airplane motion is gathered, uniformly processed into fixed-length video, and composed into a data set, wherein 80% of the data is input into the detection network as a training set and verified through 20% of the data.

While specific embodiments of the invention have been described above, it will be appreciated by those skilled in the art that this is by way of example only, and that the scope of the invention is defined by the appended claims. Various changes and modifications to these embodiments may be made by those skilled in the art without departing from the spirit and scope of the invention, and these changes and modifications are within the scope of the invention.

Claims

1. An aircraft identification tracking method, characterized in that the method comprises:

correcting the motion trajectory prediction of the aircraft;

2. The aircraft identification and tracking method according to claim 1, wherein the video data is processed into fixed-length video and composed into a data set to train the detection network, the data set comprising: a training set for training and a validation set for validation.

3. The method of claim 2, wherein the state transition matrix and the observation matrix are constants when the vector of the aircraft center point position is approximated by a linear model; when the position vector of the aircraft center point is approximated by a nonlinear model, the state transition matrix and the observation matrix are respectively:

4. The method of claim 2, wherein said correction comprises a frame correction formula:

wherein delta_b,δ_aRespectively representing the confidence of the detection result between the adjacent images, the higher the value of the confidence, the higher the accuracy of the model is, wherein w_a，w_b，w_cRespectively representing the widths, h, of the adjacent images and the corrected bounding box_a,h_b,h_cRespectively representing the heights of the adjacent images and the corrected bounding box, wherein (x)_a,y_a),(x_b,y_b),(x_c,y_c) Respectively representing the horizontal and vertical heights of the adjacent images and the corrected bounding box.