CN107527018A

CN107527018A - Momentum method for detecting human face based on BP neural network

Info

Publication number: CN107527018A
Application number: CN201710617052.7A
Authority: CN
Inventors: 蒋林华; 蒋云良; 曹书慧; 林晓; 胡文军; 龙伟
Original assignee: Huzhou University
Current assignee: Huzhou University
Priority date: 2017-07-26
Filing date: 2017-07-26
Publication date: 2017-12-29

Abstract

Gabor characteristic and factor of momentum back-propagation algorithm are combined by the momentum method for detecting human face based on BP neural network, this method.The Gabor characteristic of training set is extracted first, and is entered into factor of momentum reverse transmittance nerve network and is trained.Then, go to detect in input picture using the system trained and whether there is face, if there is then being marked with rectangle.In order to improve the training effect of conventional counter propagation algorithm, factor of momentum is added in the algorithm, effectively slows down concussion trend of the neutral net in training, algorithm can be avoided to be absorbed in local minimum.In addition, increased factor of momentum can be adaptively adjusted the weighted value of every layer of reverse transmittance nerve network.It is substantial amounts of test result indicates that, compared with classics or state-of-the-art Face datection model, our experimental program is effective and has a competitiveness.

Description

Momentum method for detecting human face based on BP neural network

Technical field

The present invention relates to image real time transfer field, more particularly to a kind of momentum Face datection based on BP neural network Method.

Background technology

Face datection is a problem for being worth research in computer vision.In past many years, scientific research personnel Substantial amounts of experience has been launched in the field of Face datection.The essence of Face datection is by face rectangle mark present in image Mark out and.With the increase that Face datection is applied, Face datection is developing progressively as an independent research topic, by The concern of researcher.

Generally, Face datection can be divided into two classes：One kind is the face inspection on still image (gray scale or coloured image) Survey, according to the number of face on image, single or multiple faces can be detected on image.Another kind of is in dynamic image Upper carry out Face datection, also referred to as target with.This paper research is based on the Face datection in coloured image.Face datection Process be actually to the comprehensive descision of Face pattern feature.The image of input may include substantial amounts of pattern feature, these Feature can be divided into two classes by color attribute：One kind is skin color feature, and another kind of is gray feature.

Application of Neural Network was just occurred in Face datection early in 1994, occurs using convolutional Neural afterwards Network is to train grader, retina Connection Neural Network to improve obverse face detection, detection has offset angle in the picture The face of degree and the method for newest convolutional neural networks cascade are used for Face datection, and this neural network is in Face datection In identification, all more or less there is the problem of neural network algorithm convergence time is long, and go out in network training process Now certain concussion trend, makes algorithm be absorbed in local minimum.Ultimately result in the unstability of neural network algorithm.

In face identification method, Gabor transformation is applied in neutral net what prior art was related to, Gabor Conversion is a window fourier transform.It is related on different directions that Gabor functions can be extracted in different scale in frequency domain Feature.Gabor cores can be in the feature of specific frequency abstraction image.

Similar China Patent No.：The CN201210057616 neutral net based on local feature Gabor wavelet；Shanghai " face identification method based on Gabor wavelet conversion and local binary patterns optimization " patent of university's application, Publication No. CN102024141A.The patent is converted using one kind based on Gabor wavelet and local binary patterns are excellent is merged；Tsing-Hua University " face component feature and the face identification method and its device of the fusion of Gabor face characteristics " patent of university's application, publication number For CN101276421, Gabor wavelet is converted with face component Fusion Features to get up.Above-mentioned prior art, not only data meter Calculation amount is bigger, and convergence time is also long.

The content of the invention

The technical problem to be solved in the present invention is：Design is a kind of to solve neutral net convergence time ratio in Face datection It is longer, and the problem of concussion is easily produced in training.

In order to solve the above-mentioned technical problem, the present invention proposes a kind of momentum method for detecting human face based on BP neural network.

Step 1：Extract the image Gabor characteristic of training set；

Step 2：It is entered into factor of momentum reverse transmittance nerve network and is trained；

Step 3：Go to detect in input picture using the system trained and whether there is face, if there is then using rectangle Mark.

As a kind of preferred：The method of Gabor characteristic extraction in step 1 is to have selected five yardsticks and eight directions On Gabor cores be used for extracting Gabor characteristic in image, input picture is subjected to convolution with 5*8 Gabor cores, generated not The characteristics of image of 40 different scales under same frequency.

Beneficial effect of the present invention：

1st, in order to improve the training effect of conventional counter propagation algorithm, factor of momentum is added in the algorithm, effectively subtracted Slow concussion trend of the neutral net in training, can avoid algorithm from being absorbed in local minimum.In addition, increased factor of momentum The weighted value of every layer of reverse transmittance nerve network can be adaptively adjusted.It is substantial amounts of test result indicates that, with classics or most Advanced Face datection model is compared, and our experimental program is effective.

2nd, extracted for the Gabor characteristic of image, convolution carried out using 5*8 Gabor core, can to complicated image, Particularly there is color texture, or heterochromia degree is relatively low, and the unconspicuous image of characteristic value is effectively extracted, effectively Lift the rapidity of follow-up BP neural network training.

Brief description of the drawings

Accompanying drawing 1：The process of the inventive method BP back-propagation algorithms.

Accompanying drawing 2：The inventive method detects the design sketch of single face figure.

Accompanying drawing 3：The inventive method detects the design sketch of plurality of human faces figure.

Accompanying drawing 4：The inventive method and the comparison figure of BP mean consumption times.

Accompanying drawing 5：The schematic diagram of 40 Gabor filters.

Accompanying drawing 6：Imitated using extraction of 40 Gabor filters in the inventive method to the characteristics of image of a secondary portrait Fruit demonstration graph；

Embodiment

The Gabor cores that the present invention have selected on five yardsticks and eight directions first are used for extracting in input picture Gabor characteristic, input picture and 5*8 Gabor cores are subjected to convolution, as shown in figure 5, different using these Gabor karyogenesis The characteristics of image of 40 different scales under frequency, it is referred to as 40 Gabor filters.Row represents eight differences in figure Direction, row represent five different yardsticks.Detailed process is as follows：

Given input picture is as corresponding input signal f_in, first transformed to frequency domain with Fourier transformation

Coordinate on (x, y) representation space domain.Then, the result of spacing wave is used to be multiplied by obtain to be filtered by Gabor The Fourier transform of the Gabor cores of the result images of ripple device filtering.

It is as follows using convolution theorem formula (2),

Wherein Gabor cores and input signal carry out convolution, obtain the response of the input signal close to some neighborhood.

Fig. 6 gives a kind of utilization of formula (2), and wherein 6- (a) is the original image of input.6- (b) illustrates 5*8 Individual different frequency, different scale Gabor cores (i.e.：40 Gabor filters) original image 6- (a) is got in different frequencies Response condition in rate and different scale, it is extracted using this response results as characteristics of image.

The two-dimensional complex number ripple represented with formula (3) is multiplied by the two-dimensional Gaussian function being calculated in formula (4), so as to obtain Obtain the Gabor cores in formula (5).

S (x, y)=exp (i (2 π (u₀x+v₀y))+p) (3)

Wherein, influences of the initial phase p to Gabor cores is little, it is convenient to omit.

δ_xAnd δ_y" spread " situation of the Gaussian function on x, y directions is controlled respectively.

(x₀,y₀) be Gaussian kernel central point, θ is the direction of rotation of Gaussian kernel, (δ_x,δ_y) it is Gaussian kernel in x, y directions On yardstick, (u₀,v₀) it is frequency domain coordinates, K is the amplitude proportional of Gaussian kernel.

Basis of the BP neural network being widely used as the present invention.The shortcomings that in order to overcome BP algorithm, momentum term can For improving algorithm the convergence speed, algorithm is avoided to be absorbed in local minimum.It is proposed that face detection system in nerve net The overall step of network algorithm is shown in that Fig. 1 schematically show the workflow of neutral net.Next it is reverse factor of momentum to be discussed in detail Propagation Neural Network algorithm.

The image Gabor characteristic 1 of extraction is used as the input of neutral net, and neutral net is the network structure connected entirely. As shown in figure 1, one layer is input layer 2, another layer is hidden layer 3.

Shown in the transmission function equation below (6) of hidden layer neuron

In this function, net is the input data from input layer.Each input is considered as x_iIf given n Individual input data, wherein, 1≤i≤n, then, net formula is such as shown in (7)：

Net=x₁w₁+x₂w₂+…+x_nw_n (7)

Wherein, w_iIt is the initial weight of neutral net.In addition, in neutral net, the number of the node of input layer and output layer Mesh is known, and the nodes h in hidden layer is determined by formula (8).

Wherein, m and n represents the number of nodes of input layer and output layer respectively, and A represents the adjustable constant between 1 and 10 Value.

Fig. 1 adds a factor of momentum for being used as feedback parameter by BP neural network output valve 4, is missed in forwarding feedback Increase feedback parameter during difference signal, improve the training performance of traditional BP neural network.

The process of forward-propagating can be calculated as follows by formula (9).

Wherein, x_j=f (S_j), net=S_j。

BP core concept is exactly with some form by hidden layer to input layer successively anti-pass by output error.Initial power Weight w can be used during the forward-propagating of counterpropagation network, and it is generated by random initializtion.But with ideal Output compare, actual output may produce larger error.In order to constantly adjust w, can be obtained in formula (10) To error function, wherein y_jRepresent reality output.

The training set of input is calculated as follows step and is trained during the training period.

1. the error term formula (11) in each unit of each layer is calculated from reverse output layer.

error_i=y_j(1-y_j)(d_j-y_j) (11)

2. calculate the error formula (12) for hiding node layer.

error_h=y_h(1-y_h)error_h (12)

3. each weight equation (13) of renewal.

w_ik=w_ik+μ·error_k·x_ik (13)

Wherein, Δ w_ik=μ error_k·x_ikBe weight renewal rule, x_ikRepresent input value, w_ikIt is node i and section Corresponding weighted value between point k.

These are mathematical thoughts most basic in neutral net, can train positive example and negative examples, have phase Longer time is consumed.It is the improvement formula (14) of neutral net below.

Δw_ik(n)=μ δ_kx_ik+αΔw_ik(n-1) (14)

Wherein 0≤α ＜ 1 are momentum terms.And mono- global parameter of α, it can be determined by the output valve tested with output layer. When the direction that gradient maintains like, then it can increase step-length and and guide Iterative path to point to minimum target value.Generally this is that have very much Necessary, (i.e. α close to 1) needs to reduce learning parameter μ when α is very big.If gradient direction is changing always, move The addition of quantifier can make it that the change of Iterative path is smoothened.

This mainly in the case where neural metwork training is bad, i.e., is differed, this is just formed in the curvature of different directions The narrow curved surface paddy of length of different sizes.To most points on curved surface paddy, the gradient of Iterative path is not implied that to minimum value Direction, so as to be shaken between the continuous step-length of Iterative path from one side to other one side, Iterative path, converge to minimum Speed during value is also significantly slack-off, if increase momentum term, can effectively weaken and shake and greatly improve convergence rate.

In formula (14), the weight of nth iteration depends on the weight of (n-1)th iteration.Increase momentum to a certain extent Item improves the effect of search step, and this can be such that algorithm quickly restrains.On the other hand, due to multitiered network because loss Function may cause algorithmic statement to local minimum, therefore momentum term can cross over some Local Minimums to a certain extent Value, avoids algorithm from being absorbed in the situation of local minimum.

Experimental verification：

Choose The Face Detection Data Set human face datas collection and about containing 5000 front faces Face database of the CMU and Harvard face databases as training set.Wherein CMU and Harvard face databases are about Include 5000 face samples, the face in these facial images has some deviation angles and simple background.Meanwhile We also using 250 landscape images without face using boot mode obtains 2500 or so non-face images and made For the non-face storehouse of training set.

Fig. 2 and Fig. 3 is shown carries out dough figurine face using momentum Face datection in Partheenpan test data sets Some result images obtained after detection.In Fig. 2 2- (a), we have selected front face image as input.Fig. 3's In 3- (a), each input picture includes multiple faces., all will detection in output image in Fig. 2 2- (b) and Fig. 3 3- (b) To face marked with rectangle.

The momentum face detection system that we train tests some common face databases：

1Partheenpan Data Set (Partheenpan),

2The Annotated Faces in the Wild (AFW),

3The Face Detection Data Set&Benchmark(FDD-B)

4CMU Data set(CMU)

In Fig. 4, the average detected time of the inventive method and classic BP neural algorithm are detected in above-mentioned 4 kinds of data The average time of face has done a comparison.

Because can have multiple faces and complex image to carry on the back in every image in AFW and CMU two datasets Scape, so performance improvement becomes apparent in the detection on the two data sets.

Table 1 below gives the verification and measurement ratio on four groups of difference face databases and error detection.Including picture size Diversity and the limitation that brings of image definition, the inventive method have certain false detection rate, but all in the prior art It is in a leading position.

Verification and measurement ratio and error detection situation on 1 four groups of difference face database test sets of table

For example in AFW test sets, there are verification and measurement ratio=(451-65)/451=85.6%, false detection rate=39/ (39+ 451)=7.95%.

General principle, principal character and the advantages of the present invention of the present invention has been shown and described above.The technology of the industry Personnel are it should be appreciated that the present invention is not limited to the above embodiments, and the simply explanation described in above-described embodiment and specification is originally The principle of invention, various changes and modifications of the present invention are possible without departing from the spirit and scope of the present invention, these changes Change and improvement all fall within the protetion scope of the claimed invention.The claimed scope of the invention by appended claims and Its equivalent defines.

Claims

1. the momentum method for detecting human face based on BP neural network, it is characterised in that：

Step 1：Extract the image Gabor characteristic of training set；

Step 3：Go to detect in input picture using the system trained and whether there is face, if there is then being marked with rectangle.

2. according to the method for claim 1, it is characterised in that：In described step 1 Gabor characteristic extraction method be It has selected the Gabor cores on five yardsticks and eight directions to be used for extracting the Gabor characteristic in image, by input picture 5*8 Individual Gabor cores carry out convolution, generate the characteristics of image of 40 different scales under different frequency.

3. according to the method for claim 1, it is characterised in that：Factor of momentum in described step 2 is Δ w_ik(n)=μ δ_kx_ik+αΔw_ik(n-1) wherein 0≤α ＜ 1 are momentum terms；In above formula, the weight of nth iteration depends on (n-1)th iteration Weight.