CN111723672B

CN111723672B - Method and device for acquiring video recognition driving track and storage medium

Info

Publication number: CN111723672B
Application number: CN202010447021.3A
Authority: CN
Inventors: 王晓飞; 郭凯; 曾彦杰
Original assignee: South China University of Technology SCUT
Current assignee: South China University of Technology SCUT
Priority date: 2020-05-25
Filing date: 2020-05-25
Publication date: 2021-03-30
Anticipated expiration: 2040-05-25
Also published as: CN111723672A

Abstract

The invention discloses a method, a device and a storage medium for acquiring a video recognition driving track, which comprises a data acquisition step, a data set division step, a data calibration and preprocessing step, a deep learning model training step and a step of determining an optimal convolutional neural network structure model by evaluating the deep learning model by adopting a verification set. The method is different from the method that the error caused by the difference between the driving track of the vehicle obtained by virtual driving and the reality is large, and the accuracy rate is higher.

Description

Method and device for acquiring video recognition driving track and storage medium

Technical Field

The invention relates to the field of traffic information processing, in particular to a method and a device for acquiring a video identification driving track and a storage medium.

Background

The driving track has very important meaning for each driver, the specific route of the vehicle can be known through the driving track, and the driving track can also be used as the basis for judging traffic accidents. In the field of road design and road traffic safety, the driving track can be used as the theoretical basis of route design, and the deviation degree of the driving track is closely related to the road safety.

The current methods for acquiring the driving track of the automobile can be divided into two categories. Firstly, GPS information is acquired through terminal software to perform data storage and GPS data splicing imaging, the processing method is very dependent on the precision of a GPS, and the current GPS centimeter-level high-precision technology is difficult to realize in a vehicle moving at a high speed. And secondly, establishing a road condition three-dimensional modeling for the real road environment, and obtaining a related vehicle running track through a multi-degree-of-freedom driving cabin for simulating driving. However, the simulation driving experiment is difficult to completely restore the real road condition, and the data and the natural driving experiment have certain difference.

Accordingly, the method is provided. There is an urgent need for a new way of acquiring a trajectory so as to overcome the above-mentioned drawbacks.

Disclosure of Invention

In order to overcome the defects and shortcomings in the prior art, the invention provides a method and a device for acquiring a driving track by video identification based on deep learning and a storage medium.

The invention adopts the following technical scheme:

a video identification driving track obtaining method comprises the following steps:

acquiring data, namely acquiring video data of the highway, wherein the video data comprises a monitoring time period, a monitoring scene and a monitoring line shape;

dividing a data set, namely dividing video data into a picture data set according to 20 frames per second, and dividing the picture data set into a training set train for training a model; a validation set is convenient for preliminary evaluation of training results and test sets;

data calibration and pretreatment, wherein the pretreatment comprises the steps of cutting a picture according to 20 frames, using a calibration tool to calibrate the number and the position of a vehicle in the picture, obtaining the number information and the coordinate information of the vehicle, and then pretreating the obtained information;

training a deep learning model to preliminarily obtain a driving track recognition model;

and evaluating the deep learning model by adopting a verification set to determine an optimal convolutional neural network structure model.

Preferably, the deep learning model is a convolutional neural network structure model, and includes a convolutional layer, a pooling layer and a full connection layer, the convolutional layer generates feature data of an image, the pooling layer performs aggregation statistics on the feature data, and the full connection layer combines a plurality of groups of pooled data features into a group of signal data to be output for image category identification.

Preferably, the loss function based on the convolutional neural network structure model is:

and taking the mean square error MSE of the accurate values Xi and Yi obtained by calibration of the training set and the predicted values X, i, Y and i obtained by the preliminary model as an evaluation standard.

Preferably, the monitoring time period refers to three time periods, namely early, middle and late, under the condition of different illumination and brightness;

the monitoring scene refers to the track data of vehicle running under the conditions of free flow, stable flow and blocked flow;

the monitoring line shape refers to the shape of a road, specifically a straight line, an ascending slope and a curve section.

Preferably, the data preprocessing comprises feature extraction, feature dimension reduction, feature null value processing, feature conversion and feature normalization.

Preferably, the deep learning model is evaluated by using a verification set, specifically:

preliminarily evaluating the training result of the deep learning model by adopting a verification set, adjusting parameters, and determining an optimal model if over-fitting occurs;

calibrating coordinate data obtained by the picture as an accurate value, taking the coordinate of the training model as a predicted value, calculating a loss function, and analyzing an evaluation result;

and (4) performing parameter adjustment according to the loss function inverse gradient, and determining an optimal convolutional neural network structure model.

Preferably, the data set partitioning is a training set, a validation set and a test set in terms of ratios 0.6, 0.2 and 0.2.

A device for acquiring a driving track by video recognition comprises the following steps:

the video image acquisition module acquires moving vehicle information in the traffic monitoring video through a camera on the highway;

the video image dividing module is used for carrying out classification division on a verification set, a detection set and a training set on the collected video information;

the video image calibration and pretreatment module is used for calibrating and pretreating pictures in the video;

and the model training evaluation module is used for training the deep learning model to preliminarily obtain a driving track recognition model, evaluating the deep learning model by adopting a verification set and determining an optimal convolutional neural network structure model.

A storage medium is used for storing the video identification driving track acquisition method.

The invention has the beneficial effects that:

the method provided by the invention has the advantages that the error caused by the difference between the vehicle running track obtained by virtual driving and the reality is large, the method is more authentic, and the method has consistency with the vehicle running track in a natural driving experiment; according to the method provided by the invention, the acquired vehicle driving track can reach a certain precision by a relatively mature deep learning method, and the scientific research requirement is met; the method provided by the invention is novel and high in practicability, does not depend on excessive external equipment, and can obtain the driving track only by video and computer algorithms.

Drawings

FIG. 1 is a schematic view of a video capture of the present invention;

FIG. 2 is a flow chart of the operation of the present invention;

FIG. 3 is a graph of the recognition effect of the present invention;

Detailed Description

The present invention will be described in further detail with reference to examples and drawings, but the present invention is not limited to these examples.

Example 1

As shown in fig. 1 to 3, a method for acquiring a video identification driving track includes the following steps:

the method comprises the steps of data acquisition, obtaining of video data of the highway, taking the highway monitoring video as a source of the video data, obtaining of enough amount of video from monitoring time periods (morning, noon and evening), monitoring scenes (free flow, stable flow and blocking flow), monitoring line shapes (straight lines, uphill and curve segments) and guaranteeing of diversity of video styles, wherein the video data comprise week data. The expressway video scene is diversified, and diversified training samples are provided for the convolutional neural network model structure.

And (3) dividing and preprocessing a data set, after the video is acquired, intercepting the acquired video according to 20 frames each to prepare the data set of the next stage. The acquired videos are classified to a certain degree, and the data set is divided into a training set, a verification set and a test set which account for 0.6, 0.2 and 0.2. The data sets are respectively calibrated, the vehicle types, serial numbers and positions are extracted from the pictures after calibration, the vehicle types, serial numbers and positions specifically comprise the same serial number of the same vehicle in different pictures and the vehicle position of each picture, and the (serial number and coordinate) data set related to the picture is generated after calibration. In this link, a verification set of parameters of the preliminary convolutional neural network model structure needs to be made for judging whether the convolutional neural network model structure which can reach the accuracy of the data set can reach the accuracy of the training set.

And preprocessing the calibrated information.

And (5) training the deep learning model to preliminarily obtain a driving track recognition model.

The deep learning model is specifically a convolutional neural network structure model and comprises a convolutional layer, a pooling layer and a full-connection layer, wherein the convolutional layer generates feature data of an image, the pooling layer performs aggregation statistics on the feature data, and the full-connection layer combines a plurality of groups of pooled data features into a group of signal data to be output for image category identification.

The method needs to obtain the accuracy of the offset of the driving track, needs to determine an evaluation standard after determining a neural network model as a training model, and uses the obtained approximate value variance as a loss function MSE by a convolutional neural network.

And selecting a convolutional neural network structure model, training the model through a data set obtained by the calibrated vehicle position and the sequence number, and preliminarily obtaining a driving track recognition model, namely a preliminary convolutional neural network model.

And (4) using coordinate data obtained by calibrating the picture by the training set as an accurate value, using coordinates obtained by the model obtained by training as a predicted value, calculating a loss function, and analyzing an evaluation result. And reversely solving the gradient according to the loss function to adjust the parameters, and comparing the adjustment with the last model result and the target. And (4) taking out error data, analyzing the existing problems, discussing the adjustment direction and recording the experimental result.

And repeating the training and the evaluation steps to finally obtain a structural model based on the convolutional neural network meeting the target accuracy.

After the model training is finished, the step of adopting the convolutional neural network structure model to obtain the driving track is as follows:

and converting video data acquired by using the monitoring video into pictures, inputting the preprocessed pictures into the model for moving vehicle detection, tracking and feature extraction of moving vehicles, and outputting vehicle position coordinates and drawing vehicle tracks when vehicle identification is greater than a confidence threshold. As shown in FIG. 3, the rectangular box is the vehicle location, "car" is the vehicle type, and the percentage is the confidence.

Example 2

A device for acquiring a driving track by video recognition comprises

and the video image division module is used for carrying out verification set, detection set and training set classification division on the acquired video information.

Example 3

A storage medium is used for storing a video identification driving track acquisition method.

The above embodiments are preferred embodiments of the present invention, but the present invention is not limited to the above embodiments, and any other changes, modifications, substitutions, combinations, and simplifications which do not depart from the spirit and principle of the present invention should be construed as equivalents thereof, and all such changes, modifications, substitutions, combinations, and simplifications are intended to be included in the scope of the present invention.

Claims

1. A method for acquiring a driving track by video identification is characterized by comprising the following steps:

acquiring data, namely acquiring video data of the highway, wherein the video data comprises video data of different monitoring time periods, different monitoring scenes and different monitoring line shapes;

the monitoring time period refers to three time periods of morning, noon and evening under the condition of different illumination and brightness;

the monitoring line shape refers to the shape of a road, specifically a straight line, an ascending slope and a curve section;

data calibration and pretreatment, including cutting a picture according to 20 frames, using a calibration tool to calibrate the number and the position of a vehicle in the picture, obtaining the number information and the coordinate information of the vehicle, and then preprocessing the obtained information;

the preprocessing comprises feature extraction, feature dimension reduction, feature null value processing, feature conversion and feature normalization;

evaluating the deep learning model by adopting a verification set to determine an optimal convolutional neural network structure model, which specifically comprises the following steps:

2. The video identification driving track acquisition method according to claim 1, wherein the deep learning model is a convolutional neural network structure model, and includes a convolutional layer, a pooling layer and a full connection layer, the convolutional layer generates feature data of an image, the pooling layer performs aggregation statistics on the feature data, and the full connection layer combines a plurality of pooled data features into a set of signal data to be output for image type identification.

3. The video identification driving track acquisition method according to claim 1, wherein the loss function based on the convolutional neural network structure model is:

precise value X obtained by calibration of training set_i,Y_iAnd predicted value X 'obtained from preliminary model'_i,Y′_iMean square error MSE of (a) as an evaluation criterion.

4. The video identification driving track acquisition method of claim 1, wherein the data set partitioning is a training set, a validation set and a test set according to a ratio of 0.6, 0.2 and 0.2.

5. An apparatus for implementing the method for acquiring a driving track for video recognition according to any one of claims 1 to 4, comprising:

6. A storage medium for storing the video recognition trajectory acquisition method of any one of claims 1 to 4.