CN110414389A

CN110414389A - A kind of object detection method of the fast area search based on deep learning

Info

Publication number: CN110414389A
Application number: CN201910631921.0A
Authority: CN
Inventors: 段智文; 郑晓凯
Original assignee: Heilongjiang Yulin Bay Technology Co Ltd
Current assignee: Heilongjiang Yulin Bay Technology Co Ltd
Priority date: 2019-07-12
Filing date: 2019-07-12
Publication date: 2019-11-05

Abstract

A kind of object detection method of the fast area search based on deep learning, is related to algorithm of target detection field.The object in addition to train field cannot be identified to solve existing train control system, it cannot be remembered and be learnt for the object other than train field, and some emergency cases can not be handled, it can not judge that whether there are obstacles in front of train driving, the problem of generation so as to cause accident.Build the target detection network of the fast area search based on deep learning；The image for having obstacle target to be identified is obtained, and it is labeled, is divided into test sample collection and training sample set；Training sample set is input to target detection network and carries out propagated forward calculating, calculates error and by error back propagation, adjusts network weight；Network Recognition accuracy rate is assessed using test sample collection.If being unsatisfactory for requiring, if meeting the requirements, network parameter is saved, training is completed.The present invention is suitable for carrying out detection positioning to the target optical imagery.

Description

A kind of object detection method of the fast area search based on deep learning

Technical field

The present invention relates to algorithm of target detection fields, and in particular to a kind of mesh of the fast area search based on deep learning Mark detection method.

Background technique

With the fast development of railway construction in China, train running speed is continuously improved, and the requirement to traffic safety is not yet Disconnected to improve, existing column are controlled and ATP system using nonpassage of signal, close to response means guarantees traffic safety, but train control system It can only guarantee to monitor this cooperative target of traveling train, for invading limit event with the noncooperative targets such as sudden, unpredictable Generation cannot give warning in advance.Especially, it should be noted that construction of railways operation recruitment is engaged in, mechanical car leads under the conditions of construction operation Reason condition ground signal system is in closing or failure state, cannot play the role of monitoring train.In existing Train Control Object of the identification in addition to train field in system, such as pedestrian, automobile, animal etc., and cannot be to paroxysmal situation It makes and timely judging, lead to the generation of accident.

In conclusion existing train control system cannot identify the object in addition to train field, train cannot be directed to Object other than field is remembered and is learnt, and can not handle some emergency cases, whether can not judge train driving front There are barriers, so as to cause accident generation.

Summary of the invention

The present invention is to solve existing train control system to identify object in addition to train field, cannot be for column Object other than vehicle field is remembered and is learnt, and can not handle some emergency cases, can not judge that train driving front is No there are barriers, the problem of generation so as to cause accident, and propose a kind of mesh of fast area search based on deep learning Mark detection method.

A kind of object detection method of fast area search based on deep learning of the invention, specific algorithm are as follows:

Step 1: building the target detection network of the fast area search based on deep learning；

Step 2: obtaining the image for having obstacle target to be identified, and it is labeled, is divided into test sample Collection and training sample set；

Step 3: training sample set, which is input to target detection network, carries out propagated forward calculating, error is calculated and by error Backpropagation adjusts network weight；

Step 4: assessing Network Recognition accuracy rate using test sample collection.If being unsatisfactory for requiring, network parameter is saved, Training is completed；

Step 5: for detecting the target of region of search after the completion of training；

Further, target detection network structure in the step one is for first 16 layers in target detection network structure Feature extraction layer, structure are alternate 8 convolutional layers and 8 pond layer compositions, and layer output in pond connects two full articulamentums, Full articulamentum output two different peculiar task layers of connection, one is prediction classification output, another sits for predicted position Mark；

The main purpose of each convolutional layer and pond layer is feature extraction and combination to the image progress depth of input, Convolution kernel number is respectively 64,128,256,256,256,256,256,128, and convolution kernel size is 3*3, can be protected in this way Card extract details low-level image feature abundant and it be combined, a pond layer is all connected with after each convolutional layer, be used for pair The feature extracted carries out dimensionality reduction, and optimization caused by preventing parameter excessive is difficult, and pond layer is having a size of 2*2, step-length 2；

Full articulamentum is used to carry out the depth characteristic extracted further global combination, the input nerve of full articulamentum 1 Member has 256, and output neuron number is 200.The input neuron number of full articulamentum 2 is 200, output neuron number It is 128；

Two peculiar layers of task are absolute construction, are input with the output of full articulamentum 2, the peculiar layer 1 of task is 1 layer Full articulamentum, input neuron number are 128, and output neuron number is target class number to be identified, are used for predicted detection Target type out, the multi-layer perception (MLP) that the peculiar layer 2 of task is one 3 layers, input neuron are 128, output neuron 3, Output result is target start position, bounding box length, bounding box width；

Training sample set is input to target detection network and carries out propagated forward calculating, meter in the further step three It calculates error and by error back propagation, adjusts the specific of network weight and calculate that steps are as follows:

If training sample set is combined into For sample data set,For I-th of training sample, N_sIt is total for training sample,For with X^sCorresponding classification space,It is instructed for i-th Practice sampleClass label,For the position of the sample in the picture, if f (x) is target detection network model in the present invention, x generation Table input sample,y_iIndicate the classification of prediction, p_iIndicate the position in the corresponding image of classification of prediction, root It is predicted that value and true tag calculate loss function WhereinFor Classification Loss,For Area Prediction loss, λ is that weight factor is lost for control area prediction in overall loss Specific gravity in function, Classification Loss part d () indicate true tagY is predicted with classification_iBetween difference, calculation It is not unique, Ω | | ω | |₂For L2 regular terms, for preventing disaggregated model over-fitting.

Compared with the prior art, the invention has the following beneficial effects:

One, the shortcomings that the present invention overcomes the prior arts, it is fixed using detect to target interested in optical imagery Position, to realize that whether there are obstacles in judgement front.

Two, algorithm acquisition of the present invention largely has the image data of target to be identified, and is manually marked to it Then note is built the fast area search target detection network based on deep learning, is instructed using the image largely marked Practice, can be realized under complex background condition after the completion of training, accurate detection of obstacles.

Three, the shortcomings that the present invention overcomes the prior arts realizes and accurately judges in front of train driving with the presence or absence of obstacle Object avoids that accident occurs；

Detailed description of the invention

Fig. 1 is the heretofore described fast area search algorithm of target detection block diagram based on deep learning.

Specific embodiment

Specific embodiment 1: embodiment is described with reference to Fig. 1, one kind described in present embodiment is based on deep learning Fast area search object detection method, specific algorithmic procedure is as follows:

Step 5: for detecting the target of region of search after the completion of training.

Specific embodiment 2: embodiment is described with reference to Fig. 1, present embodiment is to described in specific embodiment one Detection algorithm further restriction, a kind of fast area search based on deep learning described in present embodiment Object detection method, target detection network structure in the step one are characterized for first 16 layers in target detection network structure Extract layer, structure are alternate 8 convolutional layers and 8 pond layer compositions, and layer output in pond meets two full articulamentums, Quan Lian Layer output two different peculiar task layer of connection are connect, one is prediction classification output, another is used for predicted position coordinate.

Specific embodiment 3: embodiment is described with reference to Fig. 1, present embodiment is to described in specific embodiment two Detection algorithm further restriction, a kind of fast area search based on deep learning described in present embodiment Object detection method, the main purpose of each convolutional layer and pond layer is to carry out depth to the image of input Feature extraction and combination, convolution kernel number are respectively 64,128,256,256,256,256,256 and 128, and convolution kernel size is equal For 3*3, it can guarantee to extract details low-level image feature abundant in this way and it is combined, be all connected with after each convolutional layer One pond layer, for carrying out dimensionality reduction to the feature extracted, optimization caused by preventing parameter excessive is difficult, pond layer size For 2*2, step-length 2.

Specific embodiment 4: embodiment is described with reference to Fig. 1, present embodiment is to described in specific embodiment three Detection algorithm further restriction, a kind of fast area search based on deep learning described in present embodiment Object detection method, the full articulamentum is used to carry out the depth characteristic extracted further global combination, complete to connect The input neuron of layer 1 has 256, and output neuron number is 200.The input neuron number of full articulamentum 2 is 200 A, output neuron number is 128；

Two peculiar layers of task are absolute construction, are input with the output of full articulamentum 2, the peculiar layer 1 of task is one layer Full articulamentum, input neuron number be 128, output neuron number be target class number to be identified, for predicting The target type detected, the multi-layer perception (MLP) that the peculiar layer 2 of task is one three layers, input neuron are 128, output mind It is 3 through member, output result is target start position, bounding box length, bounding box width.

Specific embodiment 5: embodiment is described with reference to Fig. 1, present embodiment is to described in specific embodiment one Detection algorithm further restriction, a kind of fast area search based on deep learning described in present embodiment Object detection method, training sample set is input to target detection network and carries out propagated forward calculating in the step three, calculates Error and by error back propagation, adjusting the specific calculating of network weight, steps are as follows:

Claims

1. a kind of object detection method of the fast area search based on deep learning, it is characterised in that: the target detection is calculated Method, specific algorithmic procedure are as follows:

Step 2: obtaining the image for having obstacle target to be identified, and it is labeled, be divided into test sample collection and Training sample set；

Step 3: training sample set, which is input to target detection network, carries out propagated forward calculating, calculating error is simultaneously reversed by error It propagates, adjusts network weight；

Step 4: assessing Network Recognition accuracy rate using test sample collection.If being unsatisfactory for requiring, if meeting the requirements, save Network parameter, training are completed；

2. a kind of object detection method of fast area search based on deep learning according to claim 1, feature Be: target detection network structure in the step one is characterized extract layer for first 16 layers in target detection network structure, Structure is alternate 8 convolutional layers and 8 pond layer compositions, and layer output in pond connects two full articulamentums, and full articulamentum output connects Two different peculiar task layers are connect, one is prediction classification output, another is used for predicted position coordinate.

3. a kind of object detection method of fast area search based on deep learning according to claim 2, feature Be: the main purpose of each convolutional layer and pond layer is to carry out the image of input the feature extraction and combination of depth, volume Product core number is respectively 64,128,256,256,256,256,256 and 128, and convolution kernel size is 3*3, can be guaranteed in this way It extracts details low-level image feature abundant and it is combined, a pond layer is all connected with after each convolutional layer, for mentioning The feature got carries out dimensionality reduction, and optimization caused by preventing parameter excessive is difficult, and pond layer is having a size of 2*2, step-length 2.

4. a kind of object detection method of fast area search based on deep learning according to claim 3, feature exist In: full articulamentum, which is used to carry out the depth characteristic extracted further global combination, the input neuron of full articulamentum 1, to be had 256, output neuron number is 200.The input neuron number of full articulamentum 2 is 200, output neuron number It is 128；

Two peculiar layers of task are absolute construction, are input with the output of full articulamentum 2, and the peculiar layer 1 of task is one layer complete Articulamentum, input neuron number are 128, and output neuron number is target class number to be identified, are used for predicted detection Target type out, the multi-layer perception (MLP) that the peculiar layer 2 of task is one three layers, input neuron is 128, and output neuron is 3, output result is target start position, bounding box length, bounding box width.

5. a kind of object detection method of fast area search based on deep learning according to claim 1, feature Be: training sample set is input to target detection network and carries out propagated forward calculating in the step three, and calculating error simultaneously will Error back propagation, adjust network weight it is specific calculating steps are as follows:

If training sample set is combined into For sample data set,It is i-th A training sample, N_sIt is total for training sample,For with X^sCorresponding classification space,For i-th of training SampleClass label,For the position of the sample in the picture, if f (x) is target detection network model in the present invention, x Input sample is represented,y_iIndicate the classification of prediction, p_iIndicate the position in the corresponding image of classification of prediction, root It is predicted that value and true tag calculate loss function WhereinFor Classification Loss,For Area Prediction loss, λ is that weight factor is lost for control area prediction in overall loss letter Specific gravity in number, Classification Loss part d () indicate true tagY is predicted with classification_iBetween difference, calculation is not Uniquely, Ω | | ω | |₂For L2 regular terms, for preventing disaggregated model over-fitting.