CN114581904A

CN114581904A - End-to-end license plate detection and identification method based on deep learning

Info

Publication number: CN114581904A
Application number: CN202210332461.3A
Authority: CN
Inventors: 马宗方; 吴哲平; 张国飞; 宋琳; 赵慧轩
Original assignee: Xian University of Architecture and Technology
Current assignee: Xian University of Architecture and Technology
Priority date: 2022-03-31
Filing date: 2022-03-31
Publication date: 2022-06-03

Abstract

The invention discloses an end-to-end license plate detection and identification method based on deep learning, which comprises the steps of collecting license plate images, labeling and sorting license plate image data sets; constructing a first half part of an end-to-end license plate detection and recognition network, namely a detection network, wherein the network is divided into a backbone network, a feature extraction network and a full-connection network, the extracted feature is mapped onto a width vector, a height vector, a coordinate vector and an angle vector of a prediction frame, and a layer network is output, and finally width information, height information, coordinate information and angle information are output after activation; training the detection network by using the labeled data set; step four, a second half part, namely an identification network, in the end-to-end license plate detection and identification network is set up, the position of the prediction frame in the convolution layer of the main network is calculated, the characteristics of the part are respectively taken out, and pooling, splicing and character identification are carried out on the taken out characteristics; and step five, training, detecting and identifying the whole network. The invention eliminates the segmentation step, improves the efficiency and simultaneously improves the accuracy and the speed.

Description

End-to-end license plate detection and identification method based on deep learning

Technical Field

The invention relates to the technical field of pattern recognition and deep learning, in particular to an end-to-end license plate detection and recognition method based on deep learning.

Background

With the continuous development of deep learning and pattern recognition, license plate recognition methods are continuously updated, and from the beginning of license plate recognition based on an image processing method, to the later hog + svm recognition method based on machine learning, to the current recognition method based on deep learning, the recognition speed and the recognition accuracy are also improved. However, most of the existing license plate recognition adopts two-step or three-step strategies, namely license plate detection and license plate recognition; license plate detection, character segmentation and character recognition. The detection, the segmentation and the identification which are carried out step by step not only have very high requirements on the speed and the precision of each method, but also have higher requirements on the connection among the steps.

Disclosure of Invention

In order to overcome the defects of the prior art, the invention aims to provide an end-to-end license plate detection and recognition method based on deep learning, the method is based on pattern recognition and deep learning, firstly, a convolutional neural network is adopted to extract image characteristics, then, the detection of license plate positions and license plate rotation angles is carried out through the extracted characteristics, secondly, the license plate part in a characteristic diagram is cut out by using the detected result to carry out ROI posing sampling and fusion, the fused characteristics are sent to a character recognition network to carry out the next detection, and finally, the result is output.

In order to achieve the purpose, the invention adopts the technical scheme that:

an end-to-end license plate detection and identification method based on deep learning comprises the following steps;

acquiring license plate images under various weather conditions and scenes through a digital camera and a mobile phone camera, and labeling and sorting a license plate image data set;

step two, constructing a first half part, namely a detection network, in an end-to-end license plate detection and recognition network, wherein the network is divided into a backbone network and is used for extracting image characteristics of a license plate; the full-connection network is used for mapping the extracted image features of the license plate to the width, height, coordinates and angle vectors of the prediction frame; the output layer network is used for activating and finally outputting width, height, coordinate and angle information;

thirdly, training a detection network by using the license plate image data set which is arranged in the first step, training a basic license plate image data set in order to enable the network to be fitted more quickly, then adjusting the learning rate to train the rest data sets, and finally integrally training all the data sets;

step four, building a second half part-recognition network in the end-to-end license plate detection and recognition network, calculating the position of the prediction frame on the convolution layer of the main network on the basis of the prediction frame obtained in the step two, respectively taking out the characteristics of each characteristic diagram at the position, and performing pooling, splicing and character recognition on the taken out characteristics;

and step five, training, detecting and identifying the whole network.

The labeling and sorting of the license plate image data set in the first step specifically comprises the following steps:

step1, distinguishing all images according to categories, wherein the images comprise a basic license plate image, a license plate image rotating at a small angle and a license plate image rotating at a large angle;

the basic license plate image does not rotate, no complex scene exists, and a clear license plate image is shot on the front side;

the license plate image rotating at a small angle is rotated at a angle of-10 degrees, the license plate image is blurred, the license plate image with dark light and bright light is far away from the camera;

the rotation angle of the license plate image rotating at a large angle is-30 degrees, and the license plate image in rainy days, snowy days and foggy days and the license plate image without license plate are obtained;

step2, directly carrying out the next processing on the license plate image data set marked by the license plate target; marking the license plate target by using a RoLabelimage for the license plate image which is not marked;

step3, carrying out normalization processing on the labeled image data set;

step4, the license plate characters are coded into corresponding numbers, so that the loss is calculated conveniently after deep learning reasoning, and the number coded by the first character (province) is as follows:

wan (Anhui province)	Hu (Chinese character of 'Hu')	Jin-jin	Yu wine	Wing	Jin (jin)	Covering for window	Liao (Chinese character of 'Liao')	Lucky toy	Black colour	Su (Chinese character of 'su')	Zhejiang province
												1	2	3	4	5	6	7	8	9	10	11	12
Jing made of Chinese medicinal materials	Min (the Min)	Gan Jiang (Jiang)	Lu et al	Yu	Jaw	Xiang (Chinese character of Xiang)	Guangdong (a kind of Chinese character)	Sweet osmanthus	Qiongqiong (a Chinese character of' qiong	Sichuan style food	Noble
												13	14	15	16	17	18	19	20	21	22	23	24
Cloud	Tibetan medicine	Shaanxi	Sweet taste	Blue leaf	Ning (medicine for curing rheumatism)	New	Police	Study the design	0
												25	26	27	28	29	30	31	32	33	34

The second character (downtown) encodes the numbers as follows:

the numbers of the third to seventh characters are as follows:

because the number 0 and the letter O are difficult to distinguish, and the number 1 and the letter I are difficult to distinguish, the third to seventh characters of the license plate do not have the letters I and O according to the provisions of the motor vehicle license plate standard GA36-2007 5.9.1 of the people's republic of China, so that the two characters are omitted during coding, and the second character 0 is generally a police car;

step5, storing the address of the photo and the label corresponding to the license plate image into a file in a line form, wherein the storage format is as follows:

photo address, x_obj，y_obj，W_obj，H_obj，Angle_obj，[code₁，code₂，code₃，code₄，code₅，code₆，code₇]

x_obj，y_obj，W_obj，H_obj，Angle_objThe horizontal coordinate, vertical coordinate, length, width and angle parameter values of the prediction frame after Step3 conversion, code₁＝code₇The code value is obtained by the Step4 license plate character code.

The marking of the license plate target by the RoLabelimage specifically comprises the following steps:

1) the data set marked by the software ROLabelimage is converted by the following formula;

c_x，c_y: the horizontal coordinate and the vertical coordinate of the central point of the license plate target;

w_x，h_x: width and height of the license plate target;

angle: the rotation angle of the license plate target;

w, h: the width and height of the image of the license plate target;

x_obj，y_obj: the converted horizontal coordinate and vertical coordinate of the license plate target;

W_obj，H_obj: the width and the height of the license plate target after conversion;

Angle_obj: the converted rotation angle of the license plate target is clockwise positive and anticlockwise negative;

for a dataset labeled with four vertex coordinates, the following formula is required to convert to a normalized input, with the four points being the top left corner p₁(x₁，y₁) Upper right corner p₂(x₂，y₂) Lower right corner p₃(x₃，y₃) Lower left corner p₄(x₄，y₄) The conversion formula is as follows:

x_obj，y_obj: the converted horizontal coordinate and vertical coordinate of the license plate target center point;

k: the slope of the converted license plate target rotation angle;

Angle_obj: and the converted rotation angle of the license plate target is clockwise positive and anticlockwise negative.

The structure of the detection network in the second step is specifically as follows:

with VGG16() as the backbone network, the structure of each convolutional layer contained is as follows:

l1: using 64 convolution kernels of 3 x 3, using a LeakyRelu activation function;

l2: adopting 64 convolution kernels of 3 x 3, adopting a LeakyRelu activation function and adopting maximum pooling with the step length of 2;

l3: using 128 convolution kernels of 3 x 3, using a LeakyRelu activation function;

l4: adopting 128 convolution kernels of 3 x 3, adopting LeakyRelu activation function and adopting maximum pooling with the step length of 2;

l5: using 256 convolution kernels of 3 x 3 and using a LeakyRelu activation function;

l6: using 256 convolution kernels of 3 x 3 and using a LeakyRelu activation function;

l7: adopting 256 convolution kernels of 3 x 3, adopting a LeakyRelu activation function and adopting maximum pooling with the step length of 2;

l8: using 512 convolution kernels of 3 x 3 and using a LeakyRelu activation function;

l9: using 512 convolution kernels of 3 x 3 and using a LeakyRelu activation function;

l10: adopting 512 convolution kernels of 3 x 3, adopting a LeakyRelu activation function and adopting maximum pooling with the step length of 2;

l11: using 512 convolution kernels of 3 x 3 and using a LeakyRelu activation function;

l12: using 512 convolution kernels of 3 x 3 and using a LeakyRelu activation function;

l13: 512 convolution kernels of 3 x 3 are adopted, a LeakyRelu activation function is adopted, the maximum pooling with the step length of 2 is adopted to obtain a matrix, and then the matrix is flattened.

The full-connection layer network structure is as follows:

l14: a full-connected layer, Relu activation function, with an input dimension of 184832 and an output dimension of 4096;

l15: with a fully connected layer, Relu, of 4096 input dimensions and 128 output dimensions.

The output layer network structure is as follows:

l16: adopting a full-connection layer with an input dimension of 128 and an output dimension of 5, and a sigmod activation function, wherein the 5 output dimensions are the length, the width, the abscissa, the ordinate and the rotation angle of the license plate prediction frame respectively;

Angle_obj＝(angle-0.5)*Π

k₁＝tan(Angle_obj)

y₃＝(x₃-x₁)*k₁+y₁

x₄＝x₁+x₂-x₃

y₄＝y₁+y₂-y₃

(x₁，y₁)，(x₂，y₂)，(x₃，y₃)，(x₄，y₄): coordinates of four vertexes of the license plate are respectively;

x and y are the ratios of the horizontal coordinate and the vertical coordinate of the central point of the predicted license plate relative to the whole image;

w and h are the predicted width and height of the license plate relative to the whole image ratio respectively;

w, H is the actual width and height of the whole image;

angle is the predicted rotation angle of the license plate;

Angle_objis the license plate rotation angle after conversion;

k₁the slope of the license plate after rotation angle conversion is obtained;

the training detection network in the third step is specifically as follows:

firstly, scrambling a basic license plate data set, and training 100 batches;

secondly, disordering the rest license plate image data sets of which the license plates pass through, and training 200 batches;

then, the data sets without license plates are disordered and 50 batches of training are carried out;

finally, for all data sets, 50 batches were trained.

The identification network in the fourth step is specifically as follows:

converting the length, width, abscissa and ordinate of the L16 prediction box in the second step into specific characteristic coordinate information, wherein the conversion formula is as follows:

(x₁，y₁)，(x₂，y₂)，(x₃，y₃)，(x₄，y₄) Four points on the feature map, w and h are relative length of width and height, scale_jIs the actual size of each feature map, (x)_i，y_i) Is the actual coordinate point on each feature map;

calculating the characteristic graphs of L2, L4 and L7 according to the formula, and cutting out the characteristic graph L2 which needs to be subjected to license plate recognition_crop，L4_crop，L7_cropFeature size is (608 × w) × (608 × h) × 64, (304 × w) × (304 × h) × 128, (152 × w) × (152 × h) × 256, w and h are relative lengths of the predicted width and height of the license plate, respectively;

for the cut-out feature map L2_crop，L4_crop，L7_cropROI Pooling was performed to obtain three characteristic maps L2_pooling，L4_pooling，L7_poolingSizes of 8 × 16 × 64, 8 × 16 × 128 and 8 × 16 × 256 respectively;

the three signatures were stitched in the third dimension at size 8 x 16 x 448 and then expanded to synthesize signature F at size 1 x 57344.

The seven characters classify the license plate by constructing 7 classifiers;

classifier 1 (classifier)₁): an 57344 x 128 fully connected layer, a 128 x 34 fully connected layer and a sigmod activation function;

classifier 2 (classifier)₂): an 57344 x 128 fully connected layer, a 128 x 25 fully connected layer and a sigmod activation function;

classifier 3 (classifier)₃): an 57344 x 128 fully connected layer, a 128 x 34 fully connected layer and a sigmod activation function;

classifier 4 (classifier)₄): an 57344 x 128 fully connected layer, a 128 x 34 fully connected layer and a sigmod activation function;

classifier 5 (classifier)₄): an 57344 x 128 fully connected layer, a 128 x 34 fully connected layer and a sigmod activation function;

classifier 6 (classifier)₆): an 57344 x 128Fully-connected layers, a 128 x 34 fully-connected layer and a sigmod activation function;

classifier 7 (classifier)₇): an 57344 x 128 fully connected layer, a 128 x 34 fully connected layer and a sigmod activation function.

The invention has the beneficial effects that:

the method adopts a brand-new deep learning network structure, integrates the license plate detection and the license plate recognition into a large frame, reduces the steps of feature extraction frequency and license plate segmentation, and greatly improves the speed and the precision of the recognition.

Firstly, the traditional license plate recognition needs to be executed in multiple steps, namely license plate detection, license plate segmentation and license plate recognition, although the steps are clear, the continuity is not strong, and the middle of the license plate detection and the license plate recognition is interrupted by the license plate segmentation, so that the characteristics need to be extracted twice, the same steps are operated twice, and much time is wasted. Secondly, most of the traditional license plate recognition application scenes are provided with light supplement lamps and are shot from the front side in a short distance, and the method can collect license plate images for recognition from the side or a long distance scene and has a good recognition effect.

Drawings

FIG. 1 is a schematic view of a sorted data set according to the present invention.

FIG. 2 is a schematic diagram of a labeled data set according to the present invention.

FIG. 3 is a diagram illustrating the network results of the present invention.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings.

As shown in fig. 1-3: the invention discloses an end-to-end license plate detection and identification method based on deep learning, which specifically comprises the following steps:

firstly, license plate images under various weather conditions and scenes are collected, as shown in figure 1, and a license plate image data set is marked and sorted as shown in figure 2.

Firstly, the license plate image is uniformly processed into 608 × 608 by adopting self-adaptive scaling, then the unlabelled license plate image is labeled, as shown in the left diagram of fig. 2, the coordinate information of all labeled data sets is converted, so that the representation modes of the labeled data sets can be unified, and finally the converted data sets are sorted and stored, so that the label and the image information can be conveniently extracted in the subsequent training, as shown in the right diagram of fig. 2.

And step two, constructing a first half part, namely a detection network, in the end-to-end license plate detection and identification network, wherein the network structure is shown in figure 3.

Step1, the image with arbitrary size of m × n is subjected to adaptive scaling processing, firstly, the image is scaled to be an image with w and h both smaller than 608, then the image is placed in the image with 608 × 608 pixel value of (128, 128, 128), the image is subjected to normalization processing, and the processed data inputData is input into the network.

Step2, first extract features with 64 convolution kernels of 3 × 3, then normalize them, and finally activate them with LeakyRelu, with a feature map size of 608 × 64.

Step3, first extract features with 64 convolution kernels of 3 × 3, then normalize, then activate with LeakyRelu, and finally down sample with max pooling, with a feature map size of 608 × 64.

Step4, first extract features with 128 convolution kernels of 3 × 3, then perform normalization, and finally activate with LeakyRelu, with a feature map size of 304 × 128.

Step5, first extract features with 128 convolution kernels of 3 × 3, then normalize, then activate with LeakyRelu, and finally down sample with max pooling, with feature map size 304 × 128.

Step6, first extract features with 256 convolution kernels of 3 × 3, then normalize them, and finally activate them with LeakyRelu, with a feature map size of 152 × 256.

Step7, first extract features with 256 3 × 3 convolution kernels, then normalize, then activate with LeakyRelu, and finally down sample with max pooling, with feature map size 152 × 256.

Step8, extracting features by using 512 convolution kernels of 3 × 3, then carrying out normalization processing, and finally adopting LeakyRelu to carry out activation, wherein the size of a feature map is 76 × 512.

Step9, first extracting features with 512 convolution kernels of 3 × 3, then performing normalization processing, then activating with LeakyRelu, and finally performing down-sampling with maximum pooling, wherein the size of the feature map is 76 × 512.

Step10, first extract features with 512 convolution kernels 3 × 3, then normalize them, and finally activate them with LeakyRelu, with a feature map size of 38 × 512.

Step11, first extract features with 512 convolution kernels of 3 × 3, then perform normalization, then activate with LeakyRelu, and finally perform down-sampling with maximum pooling, with a feature map size of 38 × 512.

Feature map Feature after Step12 and Step11 processing_outSize 19 x 512, flattening the feature map to obtain a Vector of 1 x 184832₁。

Step13, Vector mapping is carried out by using the full connection layer, and Vector is mapped₁Mapping to another Vector₂，Vector₂Size 1 x 4096, using the LeakyRelu activation function.

Step14 Vector mapping with fully connected layers from Vector₂Mapping to another Vector₃，Vector₃Size 1 x 128, using the LeakyRelu activation function.

Step15, Vector mapping output, and adopting sigmod activation function to Vector₃Processing is performed to control the value range of the output variable to [0, 1 ]]In between, the output Vector is Vector₄The magnitude is 1 × 5, and the output vectors are x, y, w, h, and angle, respectively, and are converted by the following formula.

Angle_obj＝(angle-0.5)*Π

k₁＝tan(Angle_obj)

y₃＝(x₃-x₁)*k₁+y₁

x₄＝x₁+x₂-x₃

y₄＝y₁+y₂-y₃

x and y are the horizontal coordinate and the vertical coordinate of the central point of the predicted license plate;

w, H is the actual width and height of the image;

angle is the predicted rotation angle of the license plate;

Angle_objthe rotation angle of the license plate after conversion;

k₁is the slope after rotation angle conversion;

and step three, training a detection network by using the marked data sets, firstly training a basic license plate image data set in order to enable the network to fit more quickly, then adjusting the learning rate to train the rest data sets, and finally integrally training all the data sets together.

And step four, constructing a second half part, namely an identification network, in the end-to-end license plate detection and identification network.

And on the basis of the prediction frame obtained in the step two, calculating the position of the prediction frame in the main network feature map, and respectively extracting the features of the part.

As shown in FIG. 3, the recognition network calculates the characteristic graphs L2, L4 and L10 of the detection network by using the formula of the step two, and cuts out the characteristic graph L2 which needs to be subjected to license plate recognition_crop，L5_crop，L10_cropThe feature size is (608 × w) × (608 × h) × 64, (304 × w) × (304 × h) × 128, (152 × w) × (152 × h) × 256, w and h are the relative lengths of the width and height, respectively.

For the cut-out feature map L2_crop，L5_crop，L10_cropROI Pooling was performed to obtain three characteristic maps L2_pooling，L5_pooling，L10_poolingThe sizes are 8 × 16 × 64, 8 × 16 × 128, 8 × 16 × 256, respectively.

The three signatures were stitched in the first dimension at size 8 x 16 x 448 and then expanded to synthesize signature F at size 57344 x 1.

And 7 classifiers are constructed to classify seven characters on the license plate.

Classifier 1 (classifier)₁): an 57344 x 128 fully connected layer, a 128 x 34 fully connected layer and a sigmod activation function.

Classifier 2 (classifier)₂): an 57344 x 128 fully connected layer, a 128 x 25 fully connected layer and a sigmod activation function.

Classifier 3 (classifier)₃): an 57344 x 128 fully connected layer, a 128 x 34 fully connected layer and a sigmod activation function.

Classifier 4 (classifier)₄): an 57344 x 128 fully connected layer, a 128 x 34 fully connected layer and a sigmod activation function.

Classifier 5 (classifier)₄): a full connected layer of 57344 x 128, a full connected layer of 128 x 34 and aSigmod activates the function.

Classifier 6 (classifier)₆): an 57344 x 128 fully connected layer, a 128 x 34 fully connected layer and a sigmod activation function.

And finding the position of the maximum value in the vector and obtaining the subscript of the maximum value according to the result of the classifier, and finding the value corresponding to the subscript from the table of Step one 4 through the subscript.

And step five, training, detecting and identifying the whole network.

The invention carries out experiments in the public CCPD data set and the new energy license plate data set collected by the user, and the specific software and hardware environment is as follows:

TABLE 1 hardware and software environment parameter table

Table1.Parameters of the Hardware and Software

Environmental parameter	Description of the invention
		CPU	Inter(R)Core(TM)i5-9600
Memory device	16.00GB
		Display card	GEFORCE GTX 1660super
Hard disk	SA400S37/480G
		CUDA version	CUDA10.0
Operating system platform	Ubuntu16.04
		Experiment simulation platform	Python3.6

At present, the common indexes for evaluating target detection comprise the number of photos which can be processed per minute and average precision, experiments are respectively carried out on the 8 data sets, and the results are shown in the table:

	FPS	AP	Base	Rotate
					SSD300	40	94.4	99.1	95.6
YOLOV3-416	42	93.1	98	94
					Fast-Rcnn	15	92.9	98.1	91.8
MyNet	64	90.5	99.3	93

it can be seen from the experimental results that the speed of the invention can reach 64 frames per second, and although the average precision is reduced, the effect of the invention is the best no matter the average precision of all data sets or the basic data and the rotating license plate data sets.

Claims

1. An end-to-end license plate detection and identification method based on deep learning is characterized by comprising the following steps;

step two, establishing a detection network which is the first half part of an end-to-end license plate detection and identification network, wherein the network is divided into a backbone network and is used for extracting the image characteristics of the license plate; the full-connection network is used for mapping the extracted image features of the license plate to the width, height, coordinates and angle vectors of the prediction frame; the output layer network is used for activating and finally outputting width, height, coordinate and angle information;

training a detection network by using the license plate image data set arranged in the step one, training a basic license plate image data set firstly in order to enable the network to be fitted more quickly, then training the rest data sets by adjusting the learning rate, and finally integrally training by using all the data sets;

step four, building a second half part-recognition network in the end-to-end license plate detection and recognition network, calculating the position of the prediction frame on the convolution layer of the main network on the basis of the prediction frame obtained in the step two, respectively taking out the characteristics of the part, and performing pooling, splicing and character recognition on the taken out characteristics;

and step five, training, detecting and identifying the whole network.

2. The end-to-end license plate detection and identification method based on deep learning of claim 1, wherein the labeling and sorting of the license plate image data set in the first step is specifically as follows:

the license plate image rotating at a small angle has a rotating angle of-10 degrees, the license plate image is blurred, the license plate image with dark light and bright light is the license plate image farther away from the camera;

step2, directly carrying out the next processing on the license plate image data set marked on the license plate target; marking the license plate target by using a RoLabelimage for the license plate image which is not marked;

step3, carrying out normalization processing on the labeled image data set;

wan (Anhui province) Hu (Chinese character of 'Hu') Jin-jin Yu wine Wing Jin (jin) Covering for window Liao (Chinese character of 'Liao') Lucky toy Black colour Su (Chinese character of 'su') Zhejiang province 1 2 3 4 5 6 7 8 9 10 11 12 Jing made of Chinese medicinal materials Min (the Min) Gan Jiang (Jiang) Lu Yu Jaw Xiang (Chinese character of Xiang) Guangdong (Guangdong) meat Sweet osmanthus Qiongqiong (a Chinese character of' qiong Sichuan style food Noble 13 14 15 16 17 18 19 20 21 22 23 24 Cloud Tibetan medicine Shaanxi Sweet taste Green leaf of Chinese cabbage Ning (medicine for curing rheumatism) New Police Study the design 0 25 26 27 28 29 30 31 32 33 34

The second character (downtown) encodes the numbers as follows:

the numbers of the third to seventh characters are as follows:

3. The end-to-end license plate detection and identification method based on deep learning of claim 2, wherein the labeling of the license plate target by the rolelalimage is specifically as follows:

w_x，h_x: width and height of the license plate target;

an angle: the rotation angle of the license plate target;

w, h: the width and height of the image of the license plate target;

k: the slope of the converted license plate target rotation angle;

4. The end-to-end license plate detection and identification method based on deep learning of claim 1, wherein the structure of the detection network in the second step is specifically as follows:

the structure of each convolutional layer contained by adopting VGG16() as a backbone network is shown as follows:

l5, using 256 convolution kernels of 3 × 3, using the LeakyRelu activation function;

l6, using 256 convolution kernels of 3 × 3, using the LeakyRelu activation function;

l7, adopting 256 convolution kernels of 3 × 3, adopting LeakyRelu activation function, and adopting maximum pooling with the step length of 2;

l8, using 512 convolution kernels of 3 x 3 and using LeakyRelu activation function;

l9, using 512 convolution kernels of 3 x 3 and using LeakyRelu activation function;

l10, adopting 512 convolution kernels of 3 x 3, adopting LeakyRelu activation function, and adopting maximum pooling with step length of 2;

l11, using 512 convolution kernels of 3 x 3 and using LeakyRelu activation function;

l12, using 512 convolution kernels of 3 x 3 and using LeakyRelu activation function;

l13, adopting 512 convolution kernels of 3 × 3, adopting a LeakyRelu activation function, adopting maximum pooling with the step length of 2 to obtain a matrix, and then flattening the matrix.

5. The deep learning-based end-to-end license plate detection and identification method of claim 1, wherein the full-connection layer network structure is as follows:

l15: with a fully connected layer, Relu activation function, of 4096 input dimensions and 128 output dimensions.

The output layer network structure is as follows:

l16: adopting a full-connection layer with 128 input dimensions and 5 output dimensions, and a sigmod activation function, wherein the 5 output dimensions are the length, width, abscissa, ordinate and rotation angle of the license plate prediction frame respectively;

Angle_obj＝(angle-0.5)*Π

k₁＝tan(Angle_obj)

y₃＝(x₃-x₁)*k₁+y₁

x₄＝x₁+x₂-x₃

y₄＝y₁+y₂-y₃

w, H is the actual width and height of the whole image;

angle is the predicted rotation angle of the license plate;

Angle_objis the license plate rotation angle after conversion;

k₁the slope after the license plate rotation angle is converted is shown.

6. The end-to-end license plate detection and recognition method based on deep learning of claim 1, wherein the training detection network in the third step is specifically:

firstly, a basic license plate data set is disturbed, and 100 batches of training are carried out;

secondly, disturbing the rest license plate image data sets of the license plates which pass through the license plates, and training 200 batches;

finally, for all data sets, 50 batches were trained.

7. The deep learning-based end-to-end license plate detection and identification method according to claim 1, wherein the identification network in the fourth step is specifically:

calculating the characteristic diagrams of L2, L4 and L7 according to the formula, and cutting out the characteristic diagram L2 needing license plate recognition_crop，L4_crop，L7_cropFeature sizes are (608 × w) (608 × h) × 64, (304 × w) × (304 × h) × 128, (152 × w) × (152 × h) × 256, respectively, w and h are the relative lengths of the predicted width and height of the license plate;

for the cut-out feature map L2_crop，L4_crop，L7_cropPerforming ROI Pooling, three feature maps L2 can be finally obtained_pooling，L4_pooling，L7_poolingSizes of 8 × 16 × 64, 8 × 16 × 128 and 8 × 16 × 256 respectively;

8. The deep learning-based end-to-end license plate detection and recognition method of claim 1, wherein the seven characters classify the license plate by constructing 7 classifiers;

classifier 6 (classifier)₆): an 57344 x 128 fully connected layer, a 128 x 34 fully connected layer and a sigmod activation function;