CN105550634A - Facial pose recognition method based on Gabor features and dictionary learning - Google Patents

Facial pose recognition method based on Gabor features and dictionary learning Download PDF

Info

Publication number
CN105550634A
CN105550634A CN201510796987.7A CN201510796987A CN105550634A CN 105550634 A CN105550634 A CN 105550634A CN 201510796987 A CN201510796987 A CN 201510796987A CN 105550634 A CN105550634 A CN 105550634A
Authority
CN
China
Prior art keywords
human face
face posture
attitude
dictionary
gabor characteristic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510796987.7A
Other languages
Chinese (zh)
Other versions
CN105550634B (en
Inventor
陈友斌
廖海斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Micropattern Software Co Ltd
Original Assignee
Guangdong Micropattern Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Micropattern Software Co Ltd filed Critical Guangdong Micropattern Software Co Ltd
Priority to CN201510796987.7A priority Critical patent/CN105550634B/en
Publication of CN105550634A publication Critical patent/CN105550634A/en
Application granted granted Critical
Publication of CN105550634B publication Critical patent/CN105550634B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/172Classification, e.g. identification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/19Recognition using electronic means
    • G06V30/192Recognition using electronic means using simultaneous comparisons or correlations of the image signals with a plurality of references
    • G06V30/194References adjustable by an adaptive method, e.g. learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • G06V40/171Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/174Facial expression recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Databases & Information Systems (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)

Abstract

The invention discloses a facial pose recognition method based on Gabor features and dictionary learning. The facial pose recognition method comprises the following steps that firstly a facial pose is discretized into different subspaces, and a child dictionary is trained for each subspace by using K-SVD so that each subspace is enabled to be corresponding to one category; then all the dictionaries are combined into a super complete dictionary; and finally pose classification is performed by adopting a method based on the gabor features and sparse expression. A shielding face dictionary is reconstructed in order to enhance robustness of the algorithm so that the problem of face shielding in facial pose recognition can be solved. The problems of illumination, noise and shielding in facial pose estimation can be solved so that the front face, head raising, nodding, left deflection, the left side face, right deflection and the right side face can be rapidly and robustly recognized. The facial pose recognition method can be greatly applied to the field of safe driving, human-computer and face recognition.

Description

Based on the human face posture recognition methods of Gabor characteristic and dictionary learning
Technical field
The invention belongs to image procossing, pattern-recognition, computer vision and human-computer interaction technique field, relate to a kind of human face posture recognition methods, be specifically related to a kind of human face posture recognition methods based on dictionary learning and rarefaction representation.
Background technology
Human face modeling has huge application prospect in intelligent video monitoring, recognition of face, man-machine interaction and field of virtual reality.Such as, in intelligent video monitoring, human face modeling can be applied to driving supervisory system, identifies that whether driver focuses one's attention on driving, avoids the generation of collision situation by the human face posture change monitoring driver.In addition, the accuracy of human face modeling to recognition of face has a great impact, many face recognition algorithms can reach good discrimination to front face image, but for the non-frontal facial image of multi-pose, its recognition accuracy meeting degradation, and be a kind of important channel solving plurality of human faces gesture recognition by human face posture pre-estimation.
Current existing human face posture detection method can be divided three classes substantially: texture subspace method, 3D method, other class methods.First kind method is by realizing the Detection and estimation of attitude based on the learning method of 2D face outward appearance.Wherein more typically there are principal component analysis (PCA) and linear discriminant (LDA) etc.Because PCA is a kind of linear dimension reduction method, and human face posture 3D to rotate in the degree that alters a great deal be a kind of nonlinearities change.Therefore scholars use kernel principal component analysis (KPCA), and Manifold learning method solves this nonlinearities change problem.But kernel method and Manifold learning method have a defect: along with face training sample increases, it is difficult to isolate identity and attitude.This just means, when face training storehouse is enough large, the accuracy rate of Attitude estimation can change according to the difference of people.The maximum feature of first kind method is that processing speed is fast, easily realizes, but needs the training by great amount of samples, comparatively responsive to the change such as illumination, expression of face, particularly declines obviously to its accuracy rate of video human face image of illumination extreme difference.
Equations of The Second Kind method thinks that human face posture detects an inherently 3D problem, only has the essential characteristic that could be characterized human face posture by 3D information.Therefore these class methods characterize different attitude often through extraction 3D feature, or utilize the multiple image under different visual angles, rebuild the detection of the 3D model realization attitude of face in three dimensions.These class methods often to the size of image and quality requirements very high, and a large amount of operation time can be spent.Equations of The Second Kind method can obtain very high-accuracy by 3D method, but real-time is not high, simultaneously to the ultra-low resolution in video monitoring with to block facial image effect be not fine.
3rd class methods are some non-mainstream methods, can only solve subproblem in human face modeling and maybe can only be applied to some specific occasion.Such as, Rafael the face pose estimation of polyphaser is proposed Deng people.In order to correctly estimate human face posture, the 6 width images needing to utilize 6 cameras all around to take pictures in their method carry out fusion and differentiate.The method that the people such as J.Nuevo propose block cluster carries out human face modeling, achieves good effect, but the attitude that their method is estimated is limited in scope (attitudes vibration that can only identify 45 degree of scopes).The people such as the Chen Zhenxue of Shandong University propose leg-of-mutton face pose estimation, obtain the accuracy rate of about 91%, but their method can only deflect effectively around Y and Z axis face, and invalid for rotating human face posture around X-axis, namely loses efficacy to the upper and lower rotational case of face.
Want computing machine to possess the gesture recognition ability the same with the mankind at present to be also difficult to, main cause is illumination, noise, block, the change of the factor such as resolution, identity, expression all can produce tremendous influence to the accuracy of Attitude estimation, the impact how eliminating these factors is the problem needing solution at present badly.
Summary of the invention
The object of the invention is to the deficiency overcoming above method, a kind of human face posture recognition methods based on Gabor characteristic and dictionary learning is proposed, the problem such as solve illumination, the noise in human face posture identification and block, the identifying front, come back, nod of robustness, left avertence turns, left side face, right avertence turn and right side face.
Object of the present invention is achieved through the following technical solutions:
Based on a human face posture recognition methods for Gabor characteristic and dictionary learning, comprise the following steps:
Gabor characteristic extraction is carried out to the human face posture image to be identified of online input, builds Gabor characteristic vector y;
Use the complete dictionary of attitude to carry out linear combination expression to described Gabor characteristic vector y, set up sparse representation model and solve coefficient vector, wherein Gabor characteristic vector m is Gabor characteristic vector dimension;
Coefficient vector according to the above-mentioned linear combination solved carries out human face posture Classification and Identification.
Further, described human face posture is divided into 7 different attitude classifications, is defined as that left avertence turns, left side face, right avertence turn respectively, right side face, front, comes back and nod, the subspace that its each correspondence is different.
Further, the complete dictionary of attitude is used to carry out before linear combination represents to described Gabor characteristic vector y, also comprise the training step of the complete dictionary of described attitude, the complete dictionary of wherein said attitude comprise corresponding unscreened human face posture the first attitude complete dictionary D and to there being the complete dictionary D of human face posture second attitude blocked e, described first attitude complete dictionary D and the complete dictionary D of described second attitude etraining independently complete.
Further, the training process of described first attitude complete dictionary D is specific as follows:
Collect the human face posture image pattern of each attitude classification respectively, and the human face posture Gabor characteristic training set that Gabor filtering process and feature extraction dyad form each attitude classification is carried out to described human face posture image pattern;
Use K-SVD to carry out training optimization to the human face posture Gabor characteristic training set of every class attitude classification and draw best sub-dictionary D respectively i, i=1,2 ..., 7;
By sub-for all kinds of the best dictionary D iform the complete dictionary D=of the first attitude [D 1, D 2..., D 7].
Further, the Gabor characteristic of described human face posture image is extracted as:
S = ( a 0,0 ( ρ ) ; a 1,0 ( ρ ) ; . . . ; a 7,4 ( ρ ) )
Wherein, by the mould to Gabor filter factor carry out ρ sampling and the column vector that obtains, μ, v are direction and the yardstick of Gabor filter, adopt the Gabor filter of direction and Scale invariant to describe the feature of human face posture image, for face pose presentation and Gabor core ψ μ, vconvolution, Gabor core is defined as:
ψ μ , v ( z ) = | | k μ , v | | 2 σ 2 e - | | k μ , v | | 2 | | z | | 2 / 2 σ 2 ( e i k μ , v z - e - σ 2 / 2 )
Wherein, z (x, y) represents pixel; for small echo item, k v=k max/ f v, φ μ=π μ/8, σ=1.5 π controls the ratio of Gauss's window width and wavelength.
Further, when the human face posture image to be identified of described online input is unscreened human face posture image, extract the Gabor characteristic vector of described human face posture image to be estimated m is Gabor characteristic vector dimension, y is regarded as the complete dictionary of described first attitude linear combination represent: wherein N is the total number of dictionary atom, and described sparse representation model is
Further, described sparse representation model
Solve by being with the least square method of sparse constraint:
x = min x | | y - Dx | | F 2 + λ | | x | | 1
Wherein, λ is balance factor, plays Equilibrium fitting error and openness effect.
Further, the human face posture image to be identified of described online input is when having the human face posture image blocked, and extracts the Gabor characteristic vector of described human face posture image to be estimated m is Gabor characteristic vector dimension, is regarded as by y by the described first complete dictionary D and the complete dictionary of the second attitude linear combination represent:
y = y 0 + e 0 = Dx + e 0 = [ D , D e ] x x e = Bω ,
Wherein unobstructed image y 0with block error image e 0respectively can by the first attitude complete dictionary D and the complete dictionary of the second attitude rarefaction representation, D efor orthogonal matrices, described sparse representation model is
Further, described sparse representation model following optimization problem is changed into by there being the human face posture problem of image recognition of blocking:
This problem can be solved by the linearity specifications method of standard.
Further, the described coefficient vector according to the above-mentioned linear combination solved carries out human face posture Classification and Identification and adds up specifically by carrying out validity to the coefficient vector of the described linear combination solved, using the maximum foundation as judging classification of aggregate-value, namely
max i p i ( y ) = Σ i δ i ( f ( x ^ 1 ) )
Wherein, f () is special function, and the negative factor for the expression coefficient by sparse representation model sets to 0, selection of Function, for only selecting the expression coefficient of sparse representation model in expression coefficient corresponding to the i-th class subspace matrices, and other is represented that coefficient sets to 0, p iy () is that the validity of the expression coefficient of subspace that in training sample set, the i-th class attitude classification the is corresponding sparse representation model corresponding with it adds up the factor.
The present invention has following advantage and effect relative to prior art:
1) current most of face gesture recognition method is only classified to face deflection, and the present invention not only can classify to face deflection, with carrying out upper and lower attitude classification.The present invention can identify the roughly left and right deflection state up and down of facial image at about 0.3 second, accuracy of identification, more than 95%, can be applied to the field such as safe driving and man-machine interaction preferably.
2) owing to the present invention is based on dictionary learning and rarefaction representation principle, so illumination, noise in human face posture identification can be solved, express one's feelings and the difficult problem such as to block simultaneously;
3) because Gabor filter can from the local feature of multiple dimensioned multi-direction extraction image.Therefore, the present invention adopts the Gabor characteristic of facial image as the atom vector of super complete dictionary.The method improves the stability of human face posture identification.
Accompanying drawing explanation
In order to be illustrated more clearly in the technical scheme in the embodiment of the present invention, be briefly described to the accompanying drawing used required in embodiment or description of the prior art below, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to these accompanying drawings.
The division classification schematic diagram of human face posture image in Fig. 1 the present invention;
The main-process stream schematic diagram of Fig. 2 human face posture recognition methods based on Gabor characteristic and dictionary learning disclosed by the invention;
In Fig. 3 the present invention, human face posture image Gabor characteristic extracts schematic diagram;
Based on the human face posture classification schematic diagram of rarefaction representation in Fig. 4 human face posture recognition methods disclosed by the invention.
Embodiment
The technological means realized for making the present invention, creation characteristic, reach object and effect is easy to understand, developing simultaneously referring to accompanying drawing, the present invention is described in more detail for embodiment.Should be appreciated that specific embodiment described herein only in order to explain the present invention, be not intended to limit the present invention.
Term " first ", " second ", " the 3rd " and " the 4th " etc. in instructions of the present invention and claims and above-mentioned accompanying drawing are for distinguishing different object, instead of for describing particular order.In addition, term " comprises " and " having " and their any distortion, and intention is to cover not exclusive comprising.Such as contain the process of series of steps or unit, method, system, product or equipment and be not defined in the step or unit listed, but also comprise the step or unit do not listed alternatively, or also comprise alternatively for other intrinsic step of these processes, method, product or equipment or unit.
To be described in detail respectively according to embodiment below.
Embodiment
Based on the human face posture recognition methods of Gabor characteristic and dictionary learning disclosed in the embodiment of the present invention, mainly carry out human face posture classification according to the thought of dictionary learning and rarefaction representation.
In advance human face posture is divided, turn to different subspaces by discrete for human face posture, the corresponding a kind of human face posture classification of each word space.The definition of the division subspace corresponding to it of 7 kinds of inhomogeneity attitude classifications is adopted in the embodiment of the present invention, by discrete for human face posture be 7 different subspace, i.e. left side 1, left side 2, right side 1 and right side 2, front, new line, first-class 7 the attitude classifications of point, as shown in Figure 1, be defined as that left avertence turns, left side face, right avertence turn respectively, right side face, front, come back and nod.
The training of attitude dictionary and attitude Classification and Identification two large divisions is comprised based on the human face posture recognition methods of Gabor characteristic and dictionary learning disclosed in the embodiment of the present invention.Wherein attitude dictionary training part comprises again Gabor characteristic extraction, and K-SVD dictionary is optimized and built three steps such as the complete dictionary of attitude; Attitude Classification and Identification part comprises Gabor characteristic and extracts, and sparse representation model coupling solves and 3 steps such as attitude Classification and Identification.One by one each step is described in detail below, based on the human face posture recognition methods of Gabor characteristic and dictionary learning flow process as shown in Figure 2:
Step S1: attitude dictionary training
This part is that off-line carries out.First, collect the human face posture image pattern of each attitude classification respectively, and the human face posture Gabor characteristic training set that Gabor filtering process and feature extraction dyad form each attitude classification is carried out to described human face posture image pattern, human face posture is divided into 7 different subspace in the present embodiment, i.e. left side 1, left side 2, right side 1 and right side 2, front, new line, first-class 7 the attitude classifications of point, therefore respectively to quantizing the different human face posture Gabor characteristic training set of generation 7.Then, adopt K-SVD method to carry out training optimization to Different categories of samples collection and obtain all kinds of sub-dictionary.Finally, all kinds of sub-dictionary is merged the complete attitude dictionary of composition.
Wherein the complete dictionary of attitude comprises the first attitude complete dictionary D and the complete dictionary D of the second attitude e, wherein the first attitude complete dictionary D corresponding unscreened human face posture, wherein complete dictionary D of the second attitude ethe human face posture that correspondence is blocked, has another name called and blocks corrosion dictionary D e.Described first attitude complete dictionary D and the complete dictionary D of described second attitude etraining independently complete.
Concrete steps and embodiment comprise:
Step S1a:Gabor feature extraction
The Gabor characteristic of human face posture image is extracted as:
S = ( a 0,0 ( ρ ) ; a 1,0 ( ρ ) ; . . . ; a 7,4 ( ρ ) )
Wherein, by the mould to Gabor filter factor carry out ρ sampling and the column vector that obtains, μ, v are direction and the yardstick of Gabor filter, adopt the Gabor filter of direction and Scale invariant to describe the feature of human face posture image. for face pose presentation and Gabor core ψ μ, vconvolution.Gabor core is defined as:
ψ μ , v ( z ) = | | k μ , v | | 2 σ 2 e - | | k μ , v | | 2 | | z | | 2 / 2 σ 2 ( e i k μ , v z - e - σ 2 / 2 )
Wherein, z (x, y) represents pixel; for small echo item, k v=k max/ f v, φ μ=π μ/8, σ=1.5 π controls the ratio of Gauss's window width and wavelength.The value of each parameter of the present invention is k max=pi/2, ρ ≈ 40, μ=0 ..., 7}, v={0 ..., 4}.The example that Gabor characteristic is extracted as shown in Figure 3.
Step S1b:K-SVD optimizes training
First, in employing, one step process carries out Gabor characteristic extraction composition training set to all training samples, then, adopts K-SVD method to be optimized training obtain the complete dictionary D of attitude to training set.K-SVD is a kind of dictionary training algorithm of classics, according to error minimum principle, carries out SVD decomposition to error term, selects the decomposition item making error minimum as the dictionary atom upgraded and corresponding atom coefficient, through continuous iteration thus the solution be optimized.
Detailed process is: first, uses K-SVD to carry out training optimization draw best sub-dictionary D to every class attitude sample set i; Then, by all kinds of sub-dictionary D iform the complete dictionary D=of the first attitude [D 1, D 2..., D 7].
The present invention adopts K-SVD to carry out training optimization and draws best sub-dictionary D i.K-SVD is the process of a kind of cross-iteration rarefaction representation and dictionary updating.For having n iindividual sample training collection, the objective function of K-SVD is:
min D , X | | Y - D i X | | F 2 s . t . ∀ i , | | x i | | 0 ≤ T 0
Wherein, for the sparse coefficient collection of sample, be i-th training sample y irarefaction representation coefficient vector, k is dictionary D iatom number.K-SVD algorithm comprises two processes.In first process, fixing dictionary D i, therefore above formula objective function becomes the optimization problem solving rarefaction representation coefficient, has many kinds of method for solving at present.Second process is that the sparse coefficient utilizing the first process to solve upgrades dictionary D i.This process is by progressive updating D ioften row d kwith i-th row of X realize.And d k, solve, be realized by svd (SVD).
Step S1c: build the complete dictionary of attitude
The sub-dictionary D after all kinds of optimization is obtained by step 1 (b) i, all sub-dictionaries are merged the super complete dictionary of composition.Because human face posture is divided into 7 kinds by the present invention, therefore the complete dictionary of human face posture is: wherein, m is sample characteristics dimension, and n is the total number of dictionary atom.
Step 2: attitude Classification and Identification
This part is at line process.First, to online its Gabor characteristic vector of input human face posture image zooming-out.Then, use the complete dictionary of attitude built to carry out linear combination expression to the Gabor characteristic vector of the human face posture image of online input, set up sparse representation model, and adopt Lars-Lasso algorithm to carry out Model Matching to solve.Finally, utilize and solve coefficient and carry out attitude classification.Concrete steps and embodiment comprise:
Step S2a:Gabor feature extraction
This part is identical with step S1a, extracts the Gabor characteristic vector y of the human face posture image of online input.
Step S2b: sparse representation model mates and solves
After extracting the Gabor characteristic vector y of the human face posture image of online input, the complete dictionary of attitude built is used to carry out linear combination expression, vectorization composition human face posture Gabor characteristic training set.
Wherein the complete dictionary of attitude comprises the first attitude complete dictionary D and the complete dictionary D of the second attitude e, wherein the first attitude complete dictionary D corresponding unscreened human face posture, wherein complete dictionary D of the second attitude eto the human face posture blocked should be had, have another name called and block corrosion dictionary D e.Extract the Gabor characteristic vector of human face posture image to be estimated m is Gabor characteristic vector dimension, y is regarded as the complete dictionary of the first attitude the linear combination of (N is the total number of dictionary atom) represents:
Wherein it is a very sparse linear combination coefficient vector.Ideally, if test sample y belongs to the i-th class attitude, so x except i-th class of its correspondence be not except 0, remaining is all 0.The attitude classification of test pattern y can be obtained according to coefficient vector x.Therefore, human face posture identification problem is converted into the problem of solve linear equations.If m > is N, system of equations is overdetermination, and x has unique solution or without solution.If m < is N, x has multiple solution.In human face posture identification application, often m < N, namely needs to solve underdetermined system of equations problem, follows the trail of, the achievement in research of compressed sensing and sparse representation method shows according to atom, if the solution of the above-mentioned underdetermined system of equations is enough sparse, and just can be by the minimization problem of norm regularization is obtained:
This problem can be solved by the linearity specifications method of standard.
Namely sparse representation model coupling solves following system of linear equations:
(unobstructed) or (blocking)
Only having the coefficient vector x (for unobstructed) of linear combination or ω (to block) in above-mentioned system of equations for unknown, therefore can solve by being with the least square method of sparse constraint:
x = min x | | y - Dx | | F 2 + &lambda; | | x | | 1
Wherein, λ is balance factor, and play Equilibrium fitting error and openness effect, the present invention gets λ=0.01.Above-mentioned minimum Optimized model the present invention adopts minimum angles to return (LeastAngleRegression, LAR) Lars-Lasso algorithm carries out solving (B.Efron, T.Hastie, I.JohnstoneandR.Tibshirani, " LeastAngleRegression ", AnnalsofStatistics, 32,407-499,2004).Similar for there being circumstance of occlusion to solve.
Step S2c: human face posture identification
Coefficient vector x according to the linear combination solved carries out human face posture Classification and Identification.In theory should only with the related intimate of the test sample book of a certain class attitude in training sample, the sign coefficient non-zero of its correspondence.Therefore, can clearly classify to this attitude to be measured.But, because noise and modeling error can cause the part of sample to be tested uncorrelated categorized representation coefficient to occur the nonzero element that numerical value is very little, bring impact to correct classification.Considering that sample to be tested closes rarefaction representation based on entirety training set at training sample set, considering that sample to be tested is carried out validity at the expression coefficient that training sample set closes to be added up accordingly, using the maximum foundation as judging classification of aggregate-value.
The coefficient vector x solved according to step S2b can obtain the attitude classification of human face posture image y to be estimated.As shown in Figure 4, test human face posture belongs to the 4th class, and therefore its non-zero coefficient mainly concentrates on x 4in, the coefficient overwhelming majority of other classification is 0.
In order to reduce the impact of noise and modeling error; Considering that sample to be tested closes rarefaction representation based on entirety training set at training sample set simultaneously, considering that the expression coefficient validity by sample to be tested closes at training sample set adds up accordingly, using the maximum foundation as judging classification of aggregate-value.Therefore the attitude based on rarefaction representation is categorized as:
max i p i ( y ) = &Sigma; i &delta; i ( f ( x ^ 1 ) )
Wherein, f () is a special function, represents and the negative factor of rarefaction representation coefficient is set to 0; be a Selection of Function, only select rarefaction representation coefficient in expression coefficient corresponding to the i-th class subspace matrices, and other to be set to 0; p iy () is that the validity of the rarefaction representation coefficient that in training sample set, the i-th class subspace is corresponding with it adds up the factor.
Correspondence blocks human face posture process, solves human face posture image occlusion issue by increasing the method for blocking attitude face dictionary.Due to, above-mentioned human face posture recognition methods make use of rarefaction representation sorting technique principle dexterously.Therefore, to human face light, noise, expression and change resolution, there is robustness.In order to solve human face posture occlusion issue: first, setting up one again according to step S1 and blocking corrosion dictionary D e.Then, the Gabor characteristic vector of human face posture image to be estimated is extracted according to step S2a and step S2b m is Gabor characteristic vector dimension, is regarded as by y by unobstructed dictionary D and blocks corrosion dictionary collinearity combination represents that (the dictionary learning model in corresponding step S2b) is:
y = y 0 + e 0 = Dx + e 0 = [ D , D e ] x x e = B&omega;
Wherein, unobstructed image y 0with block error image e 0by dictionary D and dictionary can be blocked respectively rarefaction representation.D efor orthogonal matrices.Finally, block human face posture identification problem and change into following optimization problem:
This problem can be solved by the linearity specifications method of standard.Namely sparse representation model coupling solves following system of linear equations:
(blocking)
The coefficient vector ω (to block) of linear combination is only had for unknown in above-mentioned system of equations.
Arriving this, blocking dictionary D by increasing emethod successfully solve face occlusion issue.
Above-described embodiment is the present invention's preferably embodiment; but embodiments of the present invention are not restricted to the described embodiments; change, the modification done under other any does not deviate from Spirit Essence of the present invention and principle, substitute, combine, simplify; all should be the substitute mode of equivalence, be included within protection scope of the present invention.

Claims (10)

1., based on a human face posture recognition methods for Gabor characteristic and dictionary learning, it is characterized in that, comprise the following steps:
Gabor characteristic extraction is carried out to the human face posture image to be identified of online input, builds Gabor characteristic vector y;
Use the complete dictionary of attitude to carry out linear combination expression to described Gabor characteristic vector y, set up sparse representation model and solve coefficient vector, wherein Gabor characteristic vector m is Gabor characteristic vector dimension;
Coefficient vector according to the above-mentioned linear combination solved carries out human face posture Classification and Identification.
2. the human face posture recognition methods based on Gabor characteristic and dictionary learning according to claim 1, is characterized in that,
Described human face posture is divided into 7 different attitude classifications, is defined as that left avertence turns, left side face, right avertence turn respectively, right side face, front, comes back and nod, the subspace that its each correspondence is different.
3. the human face posture recognition methods based on Gabor characteristic and dictionary learning according to claim 1, is characterized in that,
The complete dictionary of attitude is used to carry out before linear combination represents to described Gabor characteristic vector y, also comprise the training step of the complete dictionary of described attitude, the complete dictionary of wherein said attitude comprise corresponding unscreened human face posture the first attitude complete dictionary D and to there being the complete dictionary D of human face posture second attitude blocked e, described first attitude complete dictionary D and the complete dictionary D of described second attitude etraining independently complete.
4. the human face posture recognition methods based on Gabor characteristic and dictionary learning according to claim 3, it is characterized in that, the training process of described first attitude complete dictionary D is specific as follows:
Collect the human face posture image pattern of each attitude classification respectively, and the human face posture Gabor characteristic training set that Gabor filtering process and feature extraction dyad form each attitude classification is carried out to described human face posture image pattern;
Use K-SVD to carry out training optimization to the human face posture Gabor characteristic training set of every class attitude classification and draw best sub-dictionary D respectively i, i=1,2 ..., 7;
By sub-for all kinds of the best dictionary D iform the complete dictionary D=of the first attitude [D 1, D 2..., D 7].
5. the human face posture recognition methods based on Gabor characteristic and dictionary learning according to claim 4, it is characterized in that, the Gabor characteristic of described human face posture image is:
S = ( a 0 , 0 ( &rho; ) , a 1 , 0 ( &rho; ) , ... , a 7 , 4 ( &rho; ) )
Wherein, by the mould to Gabor filter factor carry out ρ sampling and the column vector that obtains, μ, v are direction and the yardstick of Gabor filter, for face pose presentation and Gabor core ψ μ, vconvolution, Gabor core is defined as:
&psi; &mu; , v ( z ) = | | k &mu; , v | | 2 &sigma; 2 e - | | k &mu; , v | | 2 | | z | | 2 / 2 &sigma; 2 ( e ik &mu; , v z - e - &sigma; 2 / 2 )
Wherein, z (x, y) represents pixel; for small echo item, k v=k max/ f v, φ μ=π μ/8, σ=1.5 π controls the ratio of Gauss's window width and wavelength.
6. the human face posture recognition methods based on Gabor characteristic and dictionary learning according to claim 3, it is characterized in that, when the human face posture image to be identified of described online input is unscreened human face posture image, extract the Gabor characteristic vector of described human face posture image to be estimated m is Gabor characteristic vector dimension, y is regarded as the complete dictionary of described first attitude linear combination represent: wherein N is the total number of dictionary atom, and described sparse representation model is
7. the human face posture recognition methods based on Gabor characteristic and dictionary learning according to claim 6, is characterized in that, described sparse representation model
Solve by being with the least square method of sparse constraint:
x = m i n x | | y - D x | | F 2 + &lambda; | | x | | 1
Wherein, λ is balance factor, plays Equilibrium fitting error and openness effect.
8. the human face posture recognition methods based on Gabor characteristic and dictionary learning according to claim 3, it is characterized in that, the human face posture image to be identified of described online input is when having the human face posture image blocked, and extracts the Gabor characteristic vector of described human face posture image to be estimated m is Gabor characteristic vector dimension, is regarded as by y by the first described attitude complete dictionary D and the complete dictionary of the second attitude collinearity combination represents:
y = y 0 + e 0 = D x + e 0 = &lsqb; D , D e &rsqb; x x e = B &omega; ,
Wherein unobstructed image y 0with block error image e 0respectively can by the first attitude complete dictionary D and the complete dictionary of the second attitude rarefaction representation, D efor orthogonal matrices, described sparse representation model is
9. the human face posture recognition methods based on Gabor characteristic and dictionary learning according to claim 8, is characterized in that, described sparse representation model following optimization problem is changed into by there being the human face posture problem of image recognition of blocking:
(l 1):ω=argmin||ω|| 1s.t.Bω=y,
This problem can be solved by the linearity specifications method of standard.
10. the human face posture recognition methods based on Gabor characteristic and dictionary learning according to claim 1, it is characterized in that, the described coefficient vector according to the above-mentioned linear combination solved carries out human face posture Classification and Identification and adds up specifically by carrying out validity to the coefficient vector of the described linear combination solved, using the maximum foundation as judging classification of aggregate-value, namely
m a x i p i ( y ) = &Sigma; i &delta; i ( f ( x ^ 1 ) )
Wherein, f () is special function, and the negative factor for the expression coefficient by sparse representation model sets to 0, selection of Function, for only selecting the expression coefficient of sparse representation model in expression coefficient corresponding to the i-th class subspace matrices, and other is represented that coefficient sets to 0, p iy () is that the validity of the expression coefficient of subspace that in training sample set, the i-th class attitude classification the is corresponding sparse representation model corresponding with it adds up the factor.
CN201510796987.7A 2015-11-18 2015-11-18 Human face posture recognition methods based on Gabor characteristic and dictionary learning Active CN105550634B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510796987.7A CN105550634B (en) 2015-11-18 2015-11-18 Human face posture recognition methods based on Gabor characteristic and dictionary learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510796987.7A CN105550634B (en) 2015-11-18 2015-11-18 Human face posture recognition methods based on Gabor characteristic and dictionary learning

Publications (2)

Publication Number Publication Date
CN105550634A true CN105550634A (en) 2016-05-04
CN105550634B CN105550634B (en) 2019-05-03

Family

ID=55829817

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510796987.7A Active CN105550634B (en) 2015-11-18 2015-11-18 Human face posture recognition methods based on Gabor characteristic and dictionary learning

Country Status (1)

Country Link
CN (1) CN105550634B (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106066992A (en) * 2016-05-13 2016-11-02 哈尔滨工业大学深圳研究生院 Differentiation dictionary learning algorithm based on adaptive local constraint and face identification system
CN106980825A (en) * 2017-03-15 2017-07-25 广东顺德中山大学卡内基梅隆大学国际联合研究院 A kind of face posture sorting technique based on normalization pixel difference feature
CN107066955A (en) * 2017-03-24 2017-08-18 武汉神目信息技术有限公司 A kind of method that whole face is reduced from local facial region
CN107239732A (en) * 2017-04-18 2017-10-10 广东工业大学 A kind of tired expression recognition method based on Gabor characteristic and rarefaction representation
CN107330412A (en) * 2017-07-06 2017-11-07 湖北科技学院 A kind of face age estimation method based on depth rarefaction representation
CN107908999A (en) * 2017-06-23 2018-04-13 广东工业大学 A kind of tired expression recognition method of architectural feature stratification
CN108090409A (en) * 2017-11-06 2018-05-29 深圳大学 Face identification method, device and storage medium
CN108182429A (en) * 2018-02-01 2018-06-19 重庆邮电大学 The method and device of facial image feature extraction based on symmetry
CN109766813A (en) * 2018-12-31 2019-05-17 陕西师范大学 Dictionary learning face identification method based on symmetrical face exptended sample
CN109858342A (en) * 2018-12-24 2019-06-07 中山大学 A kind of face pose estimation of fusion hand-designed description son and depth characteristic
CN110955879A (en) * 2019-11-29 2020-04-03 腾讯科技(深圳)有限公司 Device control method, device, computer device and storage medium
CN111462238A (en) * 2020-04-03 2020-07-28 清华大学 Attitude estimation optimization method and device and storage medium
CN112215034A (en) * 2019-07-10 2021-01-12 重庆邮电大学 Occlusion face recognition method based on robust representation and classification of matrix decomposition and Gabor features
CN113343885A (en) * 2021-06-23 2021-09-03 杭州天翼智慧城市科技有限公司 Feature point reconstruction method for complex human face posture

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101079103A (en) * 2007-06-14 2007-11-28 上海交通大学 Human face posture identification method based on sparse Bayesian regression
US20090074259A1 (en) * 2005-07-29 2009-03-19 Madalina Baltatu Automatic biometric identification based on face recognition and support vector machines
CN104392246A (en) * 2014-12-03 2015-03-04 北京理工大学 Inter-class inner-class face change dictionary based single-sample face identification method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090074259A1 (en) * 2005-07-29 2009-03-19 Madalina Baltatu Automatic biometric identification based on face recognition and support vector machines
CN101079103A (en) * 2007-06-14 2007-11-28 上海交通大学 Human face posture identification method based on sparse Bayesian regression
CN104392246A (en) * 2014-12-03 2015-03-04 北京理工大学 Inter-class inner-class face change dictionary based single-sample face identification method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
廖海斌 等: ""基于稀疏表示的人脸姿态估计研究"", 《电视技术》 *
杜海顺 等: ""基于Gabor低秩恢复稀疏表示分类的人脸图像识别方法"", 《电子学报》 *
詹曙 等: ""基于Gabor特征和字典学习的高斯混合稀疏表示图像识别"", 《电子学报》 *

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106066992A (en) * 2016-05-13 2016-11-02 哈尔滨工业大学深圳研究生院 Differentiation dictionary learning algorithm based on adaptive local constraint and face identification system
CN106980825A (en) * 2017-03-15 2017-07-25 广东顺德中山大学卡内基梅隆大学国际联合研究院 A kind of face posture sorting technique based on normalization pixel difference feature
CN106980825B (en) * 2017-03-15 2020-11-13 广东顺德中山大学卡内基梅隆大学国际联合研究院 Human face posture classification method based on normalized pixel difference features
CN107066955B (en) * 2017-03-24 2020-07-17 武汉神目信息技术有限公司 Method for restoring whole human face from local human face area
CN107066955A (en) * 2017-03-24 2017-08-18 武汉神目信息技术有限公司 A kind of method that whole face is reduced from local facial region
CN107239732A (en) * 2017-04-18 2017-10-10 广东工业大学 A kind of tired expression recognition method based on Gabor characteristic and rarefaction representation
CN107908999A (en) * 2017-06-23 2018-04-13 广东工业大学 A kind of tired expression recognition method of architectural feature stratification
CN107330412B (en) * 2017-07-06 2021-03-26 湖北科技学院 Face age estimation method based on depth sparse representation
CN107330412A (en) * 2017-07-06 2017-11-07 湖北科技学院 A kind of face age estimation method based on depth rarefaction representation
CN108090409A (en) * 2017-11-06 2018-05-29 深圳大学 Face identification method, device and storage medium
CN108090409B (en) * 2017-11-06 2021-12-24 深圳大学 Face recognition method, face recognition device and storage medium
CN108182429A (en) * 2018-02-01 2018-06-19 重庆邮电大学 The method and device of facial image feature extraction based on symmetry
CN108182429B (en) * 2018-02-01 2022-01-28 重庆邮电大学 Method and device for extracting facial image features based on symmetry
CN109858342A (en) * 2018-12-24 2019-06-07 中山大学 A kind of face pose estimation of fusion hand-designed description son and depth characteristic
CN109858342B (en) * 2018-12-24 2021-06-25 中山大学 Human face posture estimation method integrating manual design descriptor and depth feature
CN109766813A (en) * 2018-12-31 2019-05-17 陕西师范大学 Dictionary learning face identification method based on symmetrical face exptended sample
CN112215034A (en) * 2019-07-10 2021-01-12 重庆邮电大学 Occlusion face recognition method based on robust representation and classification of matrix decomposition and Gabor features
CN110955879A (en) * 2019-11-29 2020-04-03 腾讯科技(深圳)有限公司 Device control method, device, computer device and storage medium
CN111462238A (en) * 2020-04-03 2020-07-28 清华大学 Attitude estimation optimization method and device and storage medium
CN113343885A (en) * 2021-06-23 2021-09-03 杭州天翼智慧城市科技有限公司 Feature point reconstruction method for complex human face posture

Also Published As

Publication number Publication date
CN105550634B (en) 2019-05-03

Similar Documents

Publication Publication Date Title
CN105550634A (en) Facial pose recognition method based on Gabor features and dictionary learning
CN107657279B (en) Remote sensing target detection method based on small amount of samples
CN107122752B (en) Human body action comparison method and device
US20190318158A1 (en) Multi-pose face feature point detection method based on cascade regression
CN104680144B (en) Based on the lip reading recognition methods and device for projecting very fast learning machine
CN105138998B (en) Pedestrian based on the adaptive sub-space learning algorithm in visual angle recognition methods and system again
CN111325146B (en) Truck type and axle type identification method and system
CN103218609B (en) A kind of Pose-varied face recognition method based on hidden least square regression and device thereof
Li et al. Fabric defect detection based on biological vision modeling
CN103530633B (en) Semantic mapping method of local invariant feature of image and semantic mapping system
CN102136062B (en) Human face retrieval method based on multi-resolution LBP (local binary pattern)
CN108509833B (en) Face recognition method, device and equipment based on structured analysis dictionary
CN103699874B (en) Crowd abnormal behavior identification method based on SURF (Speed-Up Robust Feature) stream and LLE (Locally Linear Embedding) sparse representation
CN103065158B (en) The behavior recognition methods of the ISA model based on relative gradient
CN106909946A (en) A kind of picking system of multi-modal fusion
CN104834941A (en) Offline handwriting recognition method of sparse autoencoder based on computer input
CN102332086A (en) Facial identification method based on dual threshold local binary pattern
CN109726671B (en) Action recognition method and system from global to category feature expression learning
CN104408405A (en) Face representation and similarity calculation method
CN105205449A (en) Sign language recognition method based on deep learning
Zamani et al. Saliency based alphabet and numbers of American sign language recognition using linear feature extraction
CN110826462A (en) Human body behavior identification method of non-local double-current convolutional neural network model
He et al. Automatic recognition of traffic signs based on visual inspection
CN102855486A (en) Generalized image target detection method
CN104794498A (en) Image classification method based on combination of SRC and MFA

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant