CN108388920B

CN108388920B - HOG and LBPH characteristic fused identity card copy detection method

Info

Publication number: CN108388920B
Application number: CN201810172048.9A
Authority: CN
Inventors: 柯逍; 卢安琪; 牛玉贞
Original assignee: Fuzhou University
Current assignee: Fuzhou University
Priority date: 2018-03-01
Filing date: 2018-03-01
Publication date: 2022-04-08
Anticipated expiration: 2038-03-01
Also published as: CN108388920A

Abstract

The invention discloses an identity card copy detection method fusing HOG and LBPH characteristics, which comprises the following steps: firstly, selecting a large number of identity card and non-identity card pictures as positive and negative samples of a training sample set, respectively extracting HOG characteristics and LBPH characteristics from the training sample set, training an SVM to obtain a first classifier and a second classifier, and performing target detection on a test image by using the first classifier to obtain the LBPH characteristics of a target detection result; and judging by using a second classifier according to the LBPH characteristics of the target detection result, and reserving the target with the judgment result of true. According to the invention, the HOG classifier is used for detection, and then the LBPH classifier is used for detecting the HOG detection result again, so that the method is simple, rapid and efficient, and the detection accuracy is high.

Description

HOG and LBPH characteristic fused identity card copy detection method

Technical Field

The invention relates to the technical field of pattern recognition, in particular to an identity card copy detection method fusing HOG and LBPH characteristics.

Background

With the development of science and technology and economy, government enterprises and public institutions have more and more workload, larger range of office activities, higher requirements and higher accuracy of marketization on office speed, and less identification card copy detection software available in the market. Most procedures for detecting identity card copies are still completely based on manual work, which causes a great deal of waste of time, manpower and material resources, and the result is greatly influenced by human factors. Most of traditional identity card copy detection software only uses a single characteristic for detection, and the detection accuracy is low. Both of these detection methods have significant disadvantages. Government enterprises and public institutions urgently need office automation software capable of automatically, quickly and accurately detecting whether procedures related to identity card copies are correct or not, so that the government enterprises and public institutions can spend less manpower, material resources and time and can quickly and accurately detect the identity card copies. When business related to the financial industry is handled, identity card copies need to be provided. Such as debit card transactions, security accounting transactions, financial insurance transactions, and the like. With the development of science and economy and the more and more people pay more attention to economic management, the workload of business personnel in many financial industries is increased sharply, and identity card copies must be detected regularly.

The traditional processing method selects a single characteristic for detection, and the detection method of the identity card copy on the market is few, and basically still completely depends on the condition of manual detection.

Disclosure of Invention

Aiming at the problems that the traditional method for manually detecting the identity card copy or detecting the identity card copy only according to a single characteristic has low detection efficiency and detection accuracy, the invention provides the identity card copy detection method fusing HOG and LBPH characteristics.

In order to achieve the purpose, the technical scheme of the invention is as follows: an identity card copy detection method fusing HOG and LBPH characteristics comprises the following steps:

step S1: selecting a large number of identity card and non-identity card pictures as positive and negative samples of a training sample set, and carrying out scale normalization on each picture in the training sample set;

step S2: extracting HOG characteristics of the training samples after the scale normalization, training the SVM based on the HOG characteristics, and obtaining a first classifier;

step S3: extracting the LBPH characteristic of the training sample after the scale normalization, training the SVM based on the LBPH characteristic, and obtaining a second classifier;

step S4: preprocessing the test image of the identity card copy;

step S5: carrying out target detection on the preprocessed test image by using a first classifier;

step S6: calculating the LBP characteristics of the target detection result of the step S5, and generating the LBPH characteristics according to the obtained LBP characteristics;

step S7: and judging by using a second classifier according to the LBPH characteristics generated in the step S6, and reserving the target with the judgment result of true to obtain the ID card copy in the test image.

Further, the method for extracting the HOG features of the training samples after the scale normalization in step S2 specifically includes:

normalizing gamma space and color space and calculating the gradient of each pixel of the training sample, and using the following equation:

H(s,t)＝H(s,t)^gamma

G_s(s,t)＝H(s+1,t)-H(s-1,t) G_t(s,t)＝H(s,t+1)-H(s,t-1)

where s denotes the abscissa on the training sample image, t denotes the ordinate on the training sample image, G_s(s, t) represents the horizontal gradient at the pixel point (s, t), G_t(s, t) represents a vertical gradient at the pixel point (s, t), G (s, t) represents a gradient magnitude value at the pixel point (s, t), α (s, t) represents a gradient direction at the pixel point (s, t), a parameter gamma is 0.5, and H (s, t) represents a pixel value at the pixel point (s, t);

the training sample is divided into cell cells, a histogram of gradient direction is constructed for each cell, then the cells are combined into blocks, the gradient histograms need to be normalized in the blocks, and finally the HOG feature vector of the training sample is generated.

Further, the method for training the SVM based on the HOG feature in step S2 to obtain the first classifier specifically includes:

using a linear kernel function, and using the following equation:

wherein x is_εTo representHOG feature vector, x, of sample ε_lThe HOG feature vector representing the sample l, k represents the kernel function,

denotes x_εThe transposition of (2) and storing the training result into an XML file;

reading an array alpha, an array support vector and a floating point rho from the obtained XML file, multiplying the alpha and the support vector to obtain a row vector, multiplying the front of the row vector by-1, and adding the floating point rho at the last of the row vector to obtain a first classifier.

Further, the method for extracting the LBPH feature of the training sample after the scale normalization in the step S3 specifically includes:

calculating LBP characteristics of a training sample, dividing an image LBP into a plurality of coded images, and calculating the pixel value of each coded image by using the following formula:

dx_n＝-radius*sin(2.0*π*n/neighbors)dy_n＝radius*cos(2.0*π*n/neighbors)，

where x represents the abscissa on the image, y represents the ordinate on the image, radius represents the sampling radius, neighbor represents the neighborhood size, parameter n is an integer, dx_nThe nth neighborhood representing pixel point (x, y) corresponds to pixel offset abscissa, dy_nRepresenting the pixel shift ordinate corresponding to the nth neighborhood of the pixel (x, y), gray (x, y) representing the gray value at the pixel (x, y), gray (x, y)_nRepresenting the gray value of the nth neighborhood of the pixel (x, y), lbp (x, y) representing the encoded value at the pixel (x, y), lbp (x, y)_nRepresenting the coded value of the nth neighborhood of the pixel point (x, y), w representing the width of each area of the LBP coded image, and h representing the height of each area of the LBP coded image; generatingAnd training the LBPH characteristics of the sample, and acquiring the width and the height of each grid by using the following formula:

wherein, grid_xIndicates the number of cells, grid, in the width direction_yIndicating the number of bins in the height direction, LBP_iDenotes the i-th coded picture area in the LBP coded picture, cols denotes the number of columns, rows denotes the number of rows, LBP_iCols denotes the number of columns of the i-th coded picture area in the LBP code map, LBP_iRows represents the number of rows, grad, of the ith encoded image region in the LBP encoded map_wRepresenting the width of the grid, grad_hRepresenting the height of the grid, counting the height of each value of the histogram in each grid according to the row sequence, and storing the result to each row of the corresponding histogram matrix according to the sequence; then normalizing the height of the histogram; then using the row as main sequence to convert the corresponding histogram matrix into 1 row M2^neighborsA vector matrix of columns, M representing the total number of regions; and finally, connecting the local histograms to obtain a histogram of the whole training sample.

Further, the step S4 specifically includes:

step S41: inputting an identity card copy test image;

step S42: the test image is normalized in scale using a bilinear interpolation algorithm using the following equation:

f (lambda + u, j + v) ═ 1-u) (1-v) f (lambda, j) + (1-u) vf (lambda, j +1) + u (1-v) f (lambda +1, j) + uvf (lambda +1, j +1), wherein lambda represents the horizontal coordinate of the tested image, j represents the vertical coordinate of the tested image, lambda and j are integers, u and v are small numbers which are greater than or equal to 0 and smaller than 1, and f (lambda, j) represents the pixel value of a pixel point (lambda, j) on the tested image;

step S43: the test image is converted to a gray scale map using the following formula:

Gray(λ,j)＝[Red(λ,j)+Green(λ,j)+Blue(λ,j)]/3

red (lambda, j) represents a Red channel value at the pixel point (lambda, j), Green (lambda, j) represents a Green channel value at the pixel point (lambda, j), Blue (lambda, j) represents a Blue channel value at the pixel point (lambda, j), and Gray (lambda, j) represents a Gray value at the pixel point (lambda, j) in a Gray scale image;

step S44: gaussian smoothing is performed using the following equation:

wherein, σ represents the variance of the gaussian function, and Gauss (λ, j) represents the pixel value at the pixel point (λ, j) on the measured image after gaussian filtering processing.

Further, the step S5 specifically includes:

step S51: reading the first classifier, and performing target detection on the test image;

step S52: removing the area with the internal and external inclusion relation in the detection result, and utilizing the following formula:

(Rect_e&Rect_ψ)＝＝Rect_e

wherein Rect is_eRepresents a rectangular box e, Rect_ψThe rectangular frame phi is shown, if the above formula is judged to be true, the rectangular frame e and the rectangular frame phi are in the relation of containing inside and outside, and a large area is reserved;

step S53: judging whether the detection results are intersected or not, and utilizing the following formula:

x_c1＝max(x_a1,x_b1)y_c1＝max(y_a1,y_b1)x_c2＝min(x_a2,x_b2)y_c2＝min(y_a2,y_b2)x_c1＜＝x_c2y_c1＜＝y_c2

wherein x is_a1Abscissa, y, representing the upper left corner of the rectangular box a_a1Ordinate, x, representing the upper left corner of the rectangular box a_a2Abscissa, y, representing the lower right corner of the rectangular frame a_a2Ordinate, x, representing the lower right corner of the rectangular frame a_b1Abscissa, y, representing the upper left corner of the rectangular box b_b1Ordinate, x, representing the upper left corner of the rectangular box b_b2Representing momentsAbscissa, y, of the lower right corner of the frame b_b2Ordinate, x, representing the lower right corner of the rectangular frame b_c1Maximum value, y, representing the abscissa of the upper left corner of the rectangular box a and the rectangular box b_c1Maximum value, x, representing the ordinate of the upper left corner of rectangular box a and rectangular box b_c2Represents the minimum value, y, of the abscissa of the lower right corner of the rectangular frame a and the rectangular frame b_c2The minimum value of the vertical coordinates of the lower right corners of the rectangular frame a and the rectangular frame b is represented, and if the above formula is judged to be true, the intersection relationship between the rectangular frame a and the rectangular frame b is represented;

step S54: fusing the crossed areas in the detection result, if the rectangular frames are crossed, solving the crossed area, and if the crossed area is larger than a threshold value, fusing the two rectangular frames by using the following formula:

x_d1＝min(x_g1,x_r1) y_d1＝min(y_g1,y_r1) x_d2＝max(x_g2,x_r2) y_d2＝max(y_g2,y_r2) Wherein x is_d1Denotes the abscissa, y, of the top left corner of the fused rectangle_d1Denotes the ordinate, x, of the top left corner of the fused rectangle_d2Abscissa, y, representing the lower right corner of the fused rectangle_d2Ordinate, x, representing the lower right corner of the fused rectangle_g1Abscissa, y, representing the upper left corner of the rectangular box g_g1Denotes the ordinate, x, of the upper left corner of the rectangular box g_g2Abscissa, y, representing the lower right corner of the rectangular frame g_g2Ordinate, x, representing the lower right corner of the rectangular frame g_r1Abscissa, y, representing the upper left corner of the rectangular box r_r1Denotes the ordinate, x, of the upper left corner of the rectangular box r_r2Abscissa, y, representing the lower right corner of the rectangular frame r_r2The ordinate of the lower right corner of the rectangular frame r is shown.

Compared with the prior art, the invention has the beneficial effects that: firstly, training an SVM based on HOG characteristics, and saving a first classifier; secondly, training the SVM based on LBPH characteristics to obtain a second classifier, and then primarily determining the identity card through the first classifier; and finally, the preliminarily determined target is handed to a second classifier for secondary detection and classification, and the final target is determined. The invention can automatically and rapidly detect, and the combination of the HOG characteristic and the LBPH can accurately detect the identity card, thereby improving the detection accuracy, avoiding the manual detection, saving the time and the energy and avoiding the errors in the manual operation.

Drawings

FIG. 1 is a flow chart of the method for detecting the copy of the identity card with the HOG and LBPH characteristics fused.

Detailed Description

The invention is further explained below with reference to the drawings and the embodiments.

As shown in fig. 1, a method for detecting an identity card copy with the fused HOG and LBPH features comprises the following steps:

the method for extracting the HOG characteristics of the training sample after the scale normalization specifically comprises the following steps:

H(s,t)＝H(s,t)^gamma

G_s(s,t)＝H(s+1,t)-H(s-1,t) G_t(s,t)＝H(s,t+1)-H(s,t-1)

The method for training the SVM based on the HOG features to obtain the first classifier specifically comprises the following steps:

using a linear kernel function, and using the following equation:

wherein x is_εHOG feature vector, x, representing sample ε_lThe HOG feature vector representing the sample l, k represents the kernel function,

the method for extracting the LBPH characteristic of the training sample after the scale normalization specifically comprises the following steps:

dx_n＝-radius*sin(2.0*π*n/neighbors)dy_n＝radius*cos(2.0*π*n/neighbors)，

where x represents the abscissa on the image, y represents the ordinate on the image, radius represents the sampling radius, neighbor represents the neighborhood size, parameter n is an integer, dx_nThe nth neighborhood representing pixel point (x, y) corresponds to pixel offset abscissa, dy_nRepresenting the pixel shift ordinate corresponding to the nth neighborhood of the pixel (x, y), gray (x, y) representing the gray value at the pixel (x, y), gray (x, y)_nRepresenting the gray value of the nth neighborhood of the pixel (x, y), lbp (x, y) representing the encoded value at the pixel (x, y), lbp (x, y)_nRepresenting the coded value of the nth neighborhood of the pixel point (x, y), w representing the width of each area of the LBP coded image, and h representing the height of each area of the LBP coded image;

generating LBPH characteristics of training samples, and acquiring the width and the height of each grid by using the following formula:

wherein, grid_xIndicates the number of cells, grid, in the width direction_yIndicating the number of bins in the height direction, LBP_iDenotes the i-th coded picture area in the LBP coded picture, cols denotes the number of columns, rows denotes the number of rows, LBP_iCols denotes the number of columns of the i-th coded picture area in the LBP code map, LBP_iRows represents the number of rows, grad, of the ith encoded image region in the LBP encoded map_wRepresenting the width of the grid, grad_hRepresenting the height of the grid, counting the height of each value of the histogram in each grid according to the row sequence, and storing the result to each row of the corresponding histogram matrix according to the sequence; the histogram is then normalized in height, i.e., the total histogram height divided by the grad_w*grad_h；

Then using the row as main sequence to convert the corresponding histogram matrix into 1 row M2^neighborsA vector matrix of columns, M representing the total number of regions; finally, the local histogram is connected to obtain the whole training sampleA histogram of (a).

The SVM is trained based on LBPH features. Using a linear kernel function, the following equation is utilized:

wherein x is_εLBPH feature vector, x, representing sample ε_lDenotes the LBPH feature vector of sample l, κ denotes the kernel function,

denotes x_εAnd (4) transposing, and training to obtain a second classifier.

Step S4: preprocessing the test image of the identity card copy;

the method specifically comprises the following steps:

step S41: inputting an identity card copy test image;

Gray(λ,j)＝[Red(λ,j)+Green(λ,j)+Blue(λ,j)]/3

step S44: gaussian smoothing is performed using the following equation:

the method specifically comprises the following steps:

(Rect_e&Rect_ψ)＝＝Rect_e

wherein x is_a1Abscissa, y, representing the upper left corner of the rectangular box a_a1Ordinate, x, representing the upper left corner of the rectangular box a_a2Abscissa, y, representing the lower right corner of the rectangular frame a_a2Ordinate, x, representing the lower right corner of the rectangular frame a_b1Abscissa, y, representing the upper left corner of the rectangular box b_b1Ordinate, x, representing the upper left corner of the rectangular box b_b2Abscissa, y, representing the lower right corner of rectangular box b_b2Ordinate, x, representing the lower right corner of the rectangular frame b_c1Maximum value, y, representing the abscissa of the upper left corner of the rectangular box a and the rectangular box b_c1Maximum value, x, representing the ordinate of the upper left corner of rectangular box a and rectangular box b_c2Representing rectangular boxes a and bMinimum of the abscissa of the lower right corner, y_c2The minimum value of the vertical coordinates of the lower right corners of the rectangular frame a and the rectangular frame b is represented, and if the above formula is judged to be true, the intersection relationship between the rectangular frame a and the rectangular frame b is represented;

x_d1＝min(x_g1,x_r1)y_d1＝min(y_g1,y_r1)x_d2＝max(x_g2,x_r2)y_d2＝max(y_g2,y_r2) Wherein x is_d1Denotes the abscissa, y, of the top left corner of the fused rectangle_d1Denotes the ordinate, x, of the top left corner of the fused rectangle_d2Abscissa, y, representing the lower right corner of the fused rectangle_d2Ordinate, x, representing the lower right corner of the fused rectangle_g1Abscissa, y, representing the upper left corner of the rectangular box g_g1Denotes the ordinate, x, of the upper left corner of the rectangular box g_g2Abscissa, y, representing the lower right corner of the rectangular frame g_g2Ordinate, x, representing the lower right corner of the rectangular frame g_r1Abscissa, y, representing the upper left corner of the rectangular box r_r1 denotes the ordinate, x, of the upper left corner of the rectangular box r_r2Abscissa, y, representing the lower right corner of the rectangular frame r_r2The ordinate of the lower right corner of the rectangular frame r is shown.

The above are preferred embodiments of the present invention, and all changes made according to the technical scheme of the present invention that produce functional effects do not exceed the scope of the technical scheme of the present invention belong to the protection scope of the present invention.

Claims

1. A method for detecting an identity card copy fused with HOG and LBPH characteristics is characterized by comprising the following steps:

step S4: preprocessing the test image of the identity card copy;

step S7: judging by using a second classifier according to the LBPH characteristics generated in the step S6, and reserving a target with a true judgment result to obtain an identity card copy in the test image;

in step S2, the method for training the SVM based on the HOG feature to obtain the first classifier specifically includes:

using a linear kernel function, and using the following equation:

reading an array alpha, an array support vector and a floating point rho from the obtained XML file, multiplying the alpha and the support vector to obtain a row vector, multiplying the front of the row vector by-1, and adding the floating point rho at the last of the row vector to obtain a first classifier;

the step S4 specifically includes:

step S41: inputting an identity card copy test image;

step S42: the test image is normalized in scale using a bilinear interpolation algorithm using the following equation: f (lambda + u, j + v) ═ 1-u) (1-v) f (lambda, j) + (1-u) vf (lambda, j +1) + u (1-v) f (lambda +1, j) + uvf (lambda +1, j +1), wherein lambda represents the horizontal coordinate of the tested image, j represents the vertical coordinate of the tested image, lambda and j are integers, u and v are small numbers which are greater than or equal to 0 and smaller than 1, and f (lambda, j) represents the pixel value of a pixel point (lambda, j) on the tested image;

Gray(λ,j)＝[Red(λ,j)+Green(λ,j)+Blue(λ,j)]/3

step S44: gaussian smoothing is performed using the following equation:

wherein, σ represents the variance of the Gaussian function, and Gauss (λ, j) represents the pixel value at the pixel point (λ, j) on the measured image after Gaussian filtering processing;

the step S5 specifically includes:

(Rect_e&Rect_ψ)＝＝Rect_e

wherein Rect is_eRepresenting momentsFrame e, Rect_ψThe rectangular frame phi is shown, if the above formula is judged to be true, the rectangular frame e and the rectangular frame phi are in the relation of containing inside and outside, and a large area is reserved;

x_c1＝max(x_a1,x_b1)

y_c1＝max(y_a1,y_b1)

x_c2＝min(x_a2,x_b2)

y_c2＝min(y_a2,y_b2)

x_c1＜＝x_c2 y_c1＜＝y_c2

wherein x is_a1Abscissa, y, representing the upper left corner of the rectangular box a_a1Ordinate, x, representing the upper left corner of the rectangular box a_a2Abscissa, y, representing the lower right corner of the rectangular frame a_a2Ordinate, x, representing the lower right corner of the rectangular frame a_b1Abscissa, y, representing the upper left corner of the rectangular box b_b1Ordinate, x, representing the upper left corner of the rectangular box b_b2Abscissa, y, representing the lower right corner of rectangular box b_b2Ordinate, x, representing the lower right corner of the rectangular frame b_c1Maximum value, y, representing the abscissa of the upper left corner of the rectangular box a and the rectangular box b_c1Maximum value, x, representing the ordinate of the upper left corner of rectangular box a and rectangular box b_c2Represents the minimum value, y, of the abscissa of the lower right corner of the rectangular frame a and the rectangular frame b_c2The minimum value of the vertical coordinates of the lower right corners of the rectangular frame a and the rectangular frame b is represented, and if the above formula is judged to be true, the intersection relationship between the rectangular frame a and the rectangular frame b is represented;

x_d1＝min(x_g1,x_r1)

y_d1＝min(y_g1,y_r1)

x_d2＝max(x_g2,x_r2)

y_d2＝max(y_g2,y_r2)

wherein x is_d1Denotes the abscissa, y, of the top left corner of the fused rectangle_d1Denotes the ordinate, x, of the top left corner of the fused rectangle_d2Abscissa, y, representing the lower right corner of the fused rectangle_d2Ordinate, x, representing the lower right corner of the fused rectangle_g1Abscissa, y, representing the upper left corner of the rectangular box g_g1Denotes the ordinate, x, of the upper left corner of the rectangular box g_g2Abscissa, y, representing the lower right corner of the rectangular frame g_g2Ordinate, x, representing the lower right corner of the rectangular frame g_r1Abscissa, y, representing the upper left corner of the rectangular box r_r1Denotes the ordinate, x, of the upper left corner of the rectangular box r_r2Abscissa, y, representing the lower right corner of the rectangular frame r_r2The ordinate of the lower right corner of the rectangular frame r is represented;

the method for extracting the HOG features of the training samples after the scale normalization in the step S2 specifically includes:

H(s,t)＝H(s,t)^gamma

G_s(s,t)＝H(s+1,t)-H(s-1,t)

G_t(s,t)＝H(s,t+1)-H(s,t-1)

where s denotes the abscissa on the training sample image, t denotes the ordinate on the training sample image, G_s(s, t) represents the horizontal gradient at the pixel point (s, t), G_t(s, t) represents the vertical gradient at the pixel (s, t), G (s, t) represents the gradient magnitude at the pixel (s, t), and α (s, t) represents the gradient magnitude at the pixel (s, t)H (s, t) represents a pixel value at the pixel point (s, t);

dividing a training sample into cell cells, constructing a histogram of gradient direction for each cell, combining the cells into blocks, normalizing the gradient histograms in the blocks, and finally generating an HOG feature vector of the training sample;

the method for extracting the LBPH feature of the training sample after the scale normalization in the step S3 specifically includes:

dx_n＝-radius*sin(2.0*π*n/neighbors)

dy_n＝radius*cos(2.0*π*n/neighbors)