CN107657275B

CN107657275B - License plate pre-detection method based on improved BING algorithm

Info

Publication number: CN107657275B
Application number: CN201710854432.2A
Authority: CN
Inventors: 马争; 解梅; 李佩伦; 秦方
Original assignee: University of Electronic Science and Technology of China
Current assignee: Houpu Clean Energy Group Co ltd
Priority date: 2017-09-20
Filing date: 2017-09-20
Publication date: 2020-07-21
Anticipated expiration: 2037-09-20
Also published as: CN107657275A

Abstract

The invention discloses a license plate pre-detection method based on an improved BING algorithm, and belongs to the field of license plate recognition. According to the invention, the BING detection window is improved according to the specific shape of the license plate, the BING detection characteristics are enriched, and the recommendation frame given by the BING can completely contain the whole license plate.

Description

License plate pre-detection method based on improved BING algorithm

Technical Field

The invention belongs to the field of computer vision, and particularly relates to the technical field of license plate recognition.

Background

With the rapid development of economy, more and more Chinese people own private cars. Therefore, an intelligent transportation system is needed to help people manage traffic order. The license plate is the identity card of the motor vehicle, and the only corresponding motor vehicle can be found when the license plate is found, so the license plate recognition system is an important component of the intelligent traffic system. The license plate detection is the first step of license plate recognition and is also the most important step, and a good license plate detection algorithm requires high detection speed and high recall ratio. The traditional license plate detection mainly comprises the following methods:

1) the method based on the edge detection mainly depends on the abundant texture characteristics of a license plate area, and if the background environment is complex or the license plate part is too close to an automobile exhaust outlet area (also has abundant textures), the license plate is difficult to detect.

2) According to the color-based license plate positioning method, most license plates are blue or yellow, so that the color area can be used for detecting the license plates, the effect of the license plates is good in many times by using color information, but the blue license plates or yellow license plates of blue cars cannot be completely detected, and meanwhile, the algorithm is sensitive to illumination and cannot detect the (white) license plates of police cars.

3) The BING algorithm detects objects under different scales by using 8-by-8 sliding windows, and the result of 8-by-8 is 64, so that the characteristics of a 64-bit computer can be fully utilized to improve the detection speed. Because the existing BING algorithm adopts 8 × 8 sliding windows, the detected recommendation frame (the position identifier of the detection target, usually a rectangular frame covering the detection target, whose coordinates are located by the coordinates of the upper left corner and the lower right corner of the rectangle) does not just include a complete object, but usually only includes most of the region of the target, and the detection rate accuracy is extremely low when the number of recommendation frames of a single picture reaches a certain number, for example, when the number of recommendation frames of a single picture reaches 5000, the recall rate (recall rate) of the target object can reach 99.5%.

Disclosure of Invention

The invention aims to: aiming at the existing problems, the method can quickly detect all regions which may be the license plate, reduce the subsequent detection range and further accelerate the speed of license plate recognition.

The invention discloses a license plate pre-detection method based on an improved BING algorithm, which comprises the following steps:

step 1: training a classifier:

101: inputting a training sample set, wherein the training sample set comprises an image containing a license plate and license plate position marking information in the image;

102: extracting a positive sample: extracting a license plate section image from the training sample image, and carrying out size normalization processing on the license plate section image to scale the size to 24 x 8; calculating a gradient image of the license plate section image after size normalization to obtain a gradient license plate section image, and carrying out image column vectorization to obtain a positive sample;

103: extracting a negative sample: randomly extracting slice images of non-license plates from training sample images, scaling the slice images to be 24 x 8, then calculating gradient images of the slice images, and vectorizing the gradient image columns to obtain negative samples;

104: training a classifier: training a classifier based on the positive and negative samples, and after removing a bias term of the classifier, arranging the classifier into 24 × 8 template pictures;

105: dividing the template picture trained in the step 104 into 3 sub-templates with the size of 8 x 8, and calculating the binary gradient amplitude of the pixel in each sub-template

Wherein b is_kRepresenting the kth bit plane, N_gAn estimated hyper-parameter representing a gradient image;

bit surface: the jth bit of each pixel in 8 bit/pixel of a gray level image is extracted to obtain a binary image called a bit plane.

Step 2: detection treatment:

step 201: inputting a picture to be pre-detected;

step 202: constructing an 8-layer image pyramid of a picture to be pre-detected, wherein the scaling coefficient of the image pyramid is 0.8;

step 203: calculating the binarization gradient amplitude of each pixel point of the 8-layer image pyramid, wherein the calculation formula of the binarization gradient amplitude of each layer is

Where l denotes the l-th layer image, b_k,lK-th bit plane, N, representing the l-th layer image_gEstimated hyper-parameters, N, representing a gradient image_gGenerally, 4 is taken to represent the bit plane of the upper 4 bits.

Adopting 24 × 8 sliding windows to perform sliding window detection on a picture to be detected, extracting the binary gradient amplitude of each pixel point in the window every time of sliding window, sequentially dividing the sliding window result into 3 8 × 8 sub-sliding window results according to the direction of the sliding window, and inputting the sub-sliding window results into a classifier (namely, inputting the sub-sliding window results into the classifier)The sub sliding window results are respectively matched with each sub template of the classifier), and the final judgment score s of each sliding window is obtained_l；

Wherein

Symbol<·>Representing an inner product operation;

step 204: final decision score s for each sliding window_lAnd performing non-maximum suppression processing, and outputting a recommended frame of the picture to be pre-detected, namely the position of the sliding window left after the non-maximum suppression processing.

In order to further increase the detection speed, an 8-bit char type is used for storing the gradient amplitude value g of a single pixel point_lAnd when the binary gradient amplitude of each pixel point in the window is extracted every time, only the first four digits (high four digits) are extracted.

In summary, due to the adoption of the technical scheme, the invention has the beneficial effects that: the innovation points of the invention are as follows:

the invention has the innovation that according to the specific shape of the license plate, the BING detection window is improved (three BING detections are superposed), and the BING detection characteristics are enriched, so that the given recommendation frame can almost completely contain the whole license plate. Because the BING detection form is changed, the detection is not carried out under images with different scales like the BING original method, and a good detection effect can be obtained only by adopting the traditional pyramid method; the detection mode of combining a plurality of BING classifiers is suitable for detecting rigid objects with any aspect ratio (by changing the combination form of the BING), expands the size of the sliding window, fully utilizes the operation performance of a modern 64-bit computer and improves the detection speed.

Drawings

FIG. 1 is a training flow diagram of an embodiment of the present invention;

FIG. 2 is a detection process according to an embodiment of the invention;

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in further detail with reference to the following embodiments and accompanying drawings.

The invention relates to a license plate pre-detection method based on an improved BING algorithm, which comprises the following specific implementation steps of:

step 1: training the classifier, see fig. 1, which comprises the following steps, respectively:

step 101: preparing a training sample set: in the embodiment, the training sample set comprises 1900 pictures containing the license plate and 1900 markup files, and the real position of the license plate in each picture is recorded in the markup files.

Step 102: extracting a positive sample: and respectively intercepting license plate region parts in 1900 pictures, wherein in the specific embodiment, only one license plate in each picture is limited, and then obtaining a slice image of the 1900 license plates.

And (3) carrying out size normalization processing on the slice images: each slice is scaled to a size of 24 x 8. The gradient of the slice after size normalization is calculated to obtain a gradient slice, and 24 × 8 pixels are aligned in a row to form a positive sample.

Step 103: extracting a negative sample: pictures are randomly cut out from 1900 images (the cut-out part cannot be overlapped with the positive sample), 10 slices are cut out from each picture, and 19000 negative sample slices are obtained. Similarly, these slices are scaled to a 24 × 8 picture size, the gradient is calculated, and 24 × 8 pixels of the gradient slice are aligned in a row to form a negative sample.

Step 104: training a classifier: after steps 102 and 103, the total training samples are 1900 (positive) +19000 (negative) ═ 20900. It is put into librinear (open source training program, applicable to training linear svm) for training, resulting in a 193-dimensional classifier, and the final dimension (bias term) is removed to become 192-dimensional. The 192-dimensional classifiers are then arranged into 24-by-8 template pictures.

Step 105: cutting by a classifier: the classifier adopted by the invention is 24 × 8 in size, so the classifier is divided into 3 sub-classifiers 8 × 8 in size, the dimension of each sub-classifier is 64, and each sub-classifier is represented by w. I.e. dividing the template picture into 3 8 x 8 sizesAnd calculating the binary gradient amplitude of the pixel in each sub-template

Wherein b is_kRepresenting the kth bit plane, N_gRepresenting the estimated hyper-parameters of the gradient image.

Step 2: detection processing, referring to fig. 2, the detection processing specifically comprises the steps of:

step 201: inputting a picture to be pre-detected;

step 202: constructing an image pyramid: since the specific sizes of the license plates in different pictures are different, in order to search in images with different scales, an image pyramid with a scaling coefficient of 0.8 is used in the invention. For example, if the original picture size is 640 x 320, the first level of pyramids is 640 x 320, the second level of pyramids is 512 x 256, the third level of pyramids is 410 x 405, and so on. The pyramid has 8 layers in total, and the scaling coefficient can balance the detection speed and the detection effect.

Step 203: since the linear model w can be represented as the sum of a series of binary basis vectors

Wherein N is_wThe estimated hyperparameter representing the linear classifier, generally taken as 4, represents the approximate estimate of the linear classifier's value with the high 4-bit plane, β_j(real number) denotes a calibration coefficient, a_j∈{-1,1}⁶⁴，a_jCan be further expressed as a binary vector and its complement:

the test model (object to be pre-detected) can be expressed as:

where b is the gradient magnitude after binarization.

Calculating the binarization gradient amplitude of each pixel point of the image pyramid obtained in the step 202

Where l denotes the l-th layer image, b_k,lK-th bit plane, N, representing the l-th layer image_gGenerally, 4 is taken to represent the bit plane of the upper 4 bits.

And an 8bits char type is used for storing the gradient amplitude g of a single pixel point_lFor an 8 × 8 sliding window (the sub-classifier of the present invention), it can be regarded as an 8 × 8 storage cube, and the operation of fetching bit faces can be performed, so as to obtain 8 × 8bit faces. In order to accelerate the operation, only the first four layers (high four bits) are put into the classifier trained in the step 1 for sliding window method detection, namely three sub-classifiers are simultaneously subjected to sliding window detection, and the sliding window scores are added, so that the final judgment score s is obtained_l：

Wherein c is_j,kCan be obtained quickly by using bit operation and SSE (sequencing SIMD extensions) instruction operation.

Step 204: carrying out non-maximum suppression processing on the sliding window detection result:

after step 203 is executed, a score map can be obtained, a higher score can be obtained at the correct position of the license plate, and the score near the correct position is not too low, so that a non-maximum value is required to inhibit and exclude a sliding window with the peripheral score not being the highest, the number of the finally output recommended frames is obviously reduced, and the accuracy is improved.

Step 205: and outputting the detection result, namely a recommendation box.

The pre-detection method can effectively detect whether the blue license plate, the yellow license plate or the white license plate is detected, has extremely high detection speed and low omission factor, and can obtain correct detection results only by carrying out one-time screening (non-maximum inhibition treatment) on the detection results in the subsequent process.

Claims

1. The license plate pre-detection method based on the improved BING algorithm is characterized by comprising the following steps of:

step 1: training a classifier:

step 2: detection treatment:

step 201: inputting a picture to be pre-detected;

Where l denotes the l-th layer image, b_k,lRepresenting the kth ratio of the l layer imageSpecial face, N_gAn estimated hyper-parameter representing a gradient image;

adopting 24 × 8 sliding windows to perform sliding window detection on a picture to be detected in advance, extracting the binary gradient amplitude of each pixel point in the window every time of sliding window, sequentially dividing the sliding window result into 3 8 × 8 sub-sliding window results according to the sliding window direction, inputting the sub-sliding window results into a classifier, and obtaining the final judgment score s of each sliding window_l：

Wherein

N_wRepresenting estimated hyper-parameters of the classifier, β_jRepresenting calibration coefficients, signs<·>Representing an inner product operation;

step 204: final decision score s for each sliding window_lAnd carrying out non-maximum suppression processing and outputting a recommendation frame of the picture to be pre-detected.

2. The method of claim 1, wherein the estimated hyper-parameter N of the gradient image_g＝4。