CN100492399C - Method for making human face posture estimation utilizing dimension reduction method - Google Patents

Method for making human face posture estimation utilizing dimension reduction method Download PDF

Info

Publication number
CN100492399C
CN100492399C CNB2007100380763A CN200710038076A CN100492399C CN 100492399 C CN100492399 C CN 100492399C CN B2007100380763 A CNB2007100380763 A CN B2007100380763A CN 200710038076 A CN200710038076 A CN 200710038076A CN 100492399 C CN100492399 C CN 100492399C
Authority
CN
China
Prior art keywords
boltzmann machine
limited boltzmann
dimension
node
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB2007100380763A
Other languages
Chinese (zh)
Other versions
CN101021900A (en
Inventor
杨杰
杜春华
张田昊
署光
杨晓超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Jiaotong University
Original Assignee
Shanghai Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Jiaotong University filed Critical Shanghai Jiaotong University
Priority to CNB2007100380763A priority Critical patent/CN100492399C/en
Publication of CN101021900A publication Critical patent/CN101021900A/en
Application granted granted Critical
Publication of CN100492399C publication Critical patent/CN100492399C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Image Analysis (AREA)

Abstract

This invention discloses a method for estimating man-face gestures by a dimensionality reduction method including the following steps: 1, preprocessing man-face image trained samples of different gestures, 2, carrying out PCA process to processed data, 3, initializing limited nerve network of a Boltzmann machine, 4, pre-training the limited Boltzmann machine nerve network with the data processed by PCA, 5, adjusting the limited Boltzmqnn nerve network parameters, 6, identifying gestures of new man-face images, which reduces the error rate even further compared with the present technology and reduces its dimensions greatly.

Description

Utilize dimension reduction method to carry out the human face posture estimation approach
Technical field
The present invention relates to the method in a kind of image recognition technology field, specifically is that a kind of dimension reduction method that combines principal component analysis (PCA) (PCA) and limited Boltzmann machine neural network carries out the human face posture estimation approach.
Background technology
The research emphasis of recognition of face at present from the 2 D extension to the three-dimensional, and in the three-dimensional face identification the most the technology of core be exactly how to estimate the 3 d pose of this people's face according to two-dimension human face image.It is a classification problem in essence that attitude is estimated, judges promptly which attitude the people's face in the width of cloth facial image belongs to.But facial image is typical high dimensional data, and general sorting technique can not be applied directly to attitude to be estimated, therefore at first must carry out dimensionality reduction to higher-dimension people face data, and then the data behind the dimensionality reduction is carried out attitude estimate.
Find by prior art documents, G.G.Hilton etc. are at " Science " (science magazine, 2006 the 313rd volume the 504th page of the 5786th phase) " the Reducing the Dimensionality of Data withNeural Networks " that delivers (reducing the data dimension) with neural network, this article has proposed a kind of Nonlinear Dimension Reduction method based on limited Boltzmann machine neural network, experimental results show that this method can be used to attitude and estimate.Utilize the facial image data of a large amount of different attitudes that network is trained in advance and obtain initial weight, and then make network reach optimum condition to the weights adjustment of whole network with raw data.But the training time of this method sharply increases along with the rising of the number of training sample and dimension.But in practical application, facial image is subjected to extraneous interference and has comprised the information and the noise of a lot of redundancies in generative process, information that these are redundant and noise not only can increase computing time, also can influence the accuracy of identification in later stage, therefore be necessary to remove these redundant information and noises in real neural network that these data are utilized before carrying out dimension-reduction treatment, limited in addition Boltzmann machine neural network dimension reduction method also has defectives such as speed is slow, precision is low simultaneously.
Summary of the invention
The objective of the invention is to overcome deficiency of the prior art, provide a kind of dimension reduction method that utilizes to carry out the human face posture estimation approach, promptly at first raw data is carried out PCA (principal component analysis (PCA)) thereby processing removal redundant information and noise, and then utilize the data after neural net method is handled principal component analysis (PCA) to carry out the dimension-reduction treatment operation.This new dimension reduction method is applied to human face posture identification, thereby can provides human face posture information can under multi-angle, carry out recognition of face.
The present invention is achieved by the following technical solutions, the present invention at first carries out principal component analysis (PCA) to training sample and handles, remove redundant information and noise in the raw data, and the data after obtaining PCA and handling, use these data that limited Boltzmann machine neural network is trained in advance then, obtain pre-training network parameter, and adjust whole network parameter with the gradient descending method.The facial image of estimating for pending attitude carries out principal component analysis (PCA) to it and handles at last, and the data after again principal component analysis (PCA) being handled are sent into and carried out the gesture recognition classification in this neural network of succeeding in school.
The present invention specifically comprises the steps:
(1) the facial image training sample to different attitudes carries out pretreatment operation;
(2) pretreated data being carried out principal component analysis (PCA) handles;
(3) the limited Boltzmann machine neural network of initialization;
(4) with the limited Boltzmann machine neural network of the pre-training of the data after the principal component analysis (PCA) processing;
(5) adjust limited Boltzmann machine neural network parameter;
(6) new facial image is carried out gesture recognition;
Described step (1) is meant: for each facial image training sample, at first its be scaled high for h pixel, widely be the image of w pixel.Then the facial image behind this convergent-divergent is transformed to gray level image, and the gray-scale value of all pixels is normalized to [0,1], at last they are pulled into the vector that length is h * w, dimension that promptly should vector is h * w.
Described step (2), be meant: the pretreated data in the step (1) are carried out the principal component analysis (PCA) operation, remove redundant information and noise, keep 98% information, all data are all dropped to the s dimension from h * w, obtain average vector x and proper vector P simultaneously, if the vector data of original h * w dimension is expressed as X, s dimension data behind the dimensionality reduction is expressed as b, and X can be expressed as so: X=x+Pb, the data b behind the dimensionality reduction can be expressed as: b=P T(X-x).
Described step (3) is meant: setting this neural network has the L layer.The nodal point number of each layer is respectively N1, N2 ..., NL.The classification number is C, trains in advance and the number of times of adjusting parameter is respectively Pt and Pc.Number according to the network number of plies and every layer of node is determined network structure, and the random number between the generation [0,1] is as connecting weights between the network node simultaneously.
Described step (4), be meant: the node that the visual layers of the limited Boltzmann machine of ground floor in the network is set is corresponding one by one with s value in the middle vector of step (2), the nodal point number that is the visual layers of the limited Boltzmann machine of ground floor is s, train the weighting parameter between this limited Boltzmann machine visual layers node and the hidden layer node, train Pt time altogether.And then with the hidden layer node of the limited Boltzmann machine of ground floor as the limited Boltzmann machine visual layers of second layer node, train the weighting parameter between this limited Boltzmann machine visual layers node and the hidden layer node equally, also train Pt time.The rest may be inferred, i.e. the visual layers node of the limited Boltzmann machine of one deck is to train down the limited Boltzmann machine of one deck under the hidden layer node of the limited Boltzmann machine of the last layer conduct.So just finish the pre-training of whole network, also obtained simultaneously the parameter of the limited Boltzmann machine of each layer that train in advance.
Described step (5) is meant: to utilize the gradient descent method be the criterion backpropagation with the reconstruction error minimum and adjust weighting parameter between all limited Boltzmann machine visual layers nodes and the hidden layer node, and this step is carried out Pc time altogether.
Described step (6), be meant: for the facial image of the new pending gesture recognition of a width of cloth, earlier it is zoomed to the high h that is, wide is the image of w, and be the image transformation of this convergent-divergent gray level image, then the gradation of image value is normalized to [0,1], then the image after the normalization is pulled into the vectorial X that a length is h * w, and utilize b=P T(X-x) this vectorial dimension is dropped to s, at last the vector of this s dimension is sent into the network that has trained and carry out gesture recognition.
The human face posture method of estimation that the present invention proposes has quite high precision, does test with the face database of taking, and its identification error rate is 1.95%.The method of directly carrying out dimensionality reduction with limited Boltzmann machine nerve net with not handling through principal component analysis (PCA) is compared, and the error rate of this method further reduces.Simultaneously since the dimension of its data handle through principal component analysis (PCA) after dimensionality reduction greatly, the node number of each layer also reduces thereupon in the follow-up network, this has not only reduced the training time, test speed also greatly improves simultaneously.
Description of drawings
Fig. 1 is a present embodiment attitude synoptic diagram
Wherein: the attitude angle of 1 to 9 facial image of representing respectively is-90 ° ,-60 ° ,-45 ° ,-30 °, 0 °, 30 °, 45 °, 60 °, 90 °.
Fig. 2 is the result of present embodiment gesture recognition
Wherein: the attitude angle of facial image is 60 °.
Embodiment
Below in conjunction with accompanying drawing embodiments of the invention are elaborated: present embodiment has provided detailed embodiment and detailed process being to implement under the prerequisite with the technical solution of the present invention, but protection scope of the present invention is not limited to following embodiment.
Embodiment
1. (this face database includes the facial image of 9 different attitudes of 2270 people face database.As shown in Figure 1, the attitude of figure a, b, c, d, e, f, g, h, these 9 facial images of i is respectively-90 ° ,-60 ° ,-45 ° ,-30 °, 0 °, 30 °, 45 °, 60 °, 90 °.Image in the face database is divided into 9 classes according to its different attitude, and each class has 2270.) in all images to be scaled height be 30 pixels, wide is the image of 30 pixels, then the facial image behind this convergent-divergent is transformed to gray level image, and the grey scale pixel value of image is normalized to [0,1], at last gray level image being pulled into length is 900 vector.
2. all vector datas in the step (1) being carried out principal component analysis (PCA) handles, keep 98% information, finally the dimension of vector is dropped to 342 dimensions from 900 dimensions, obtain average vector x and proper vector P simultaneously, if the vector data of original 900 dimensions is expressed as X, 342 dimension data behind the dimensionality reduction are expressed as b, and X can be expressed as so: X=x+Pb, the data b behind the dimensionality reduction can be expressed as: b=P T(X-x).
3. setting this neural network haves three layers.The nodal point number of each layer is respectively 300,300, and 800.The classification number is 9, trains in advance and the number of times of adjusting parameter is respectively 50 and 100.Number according to the network number of plies and every layer of node is determined network structure, and the random number between the generation [0,1] is as connecting weights between the network node simultaneously.
4. the node that the visual layers of the limited Boltzmann machine of ground floor in the network is set is corresponding one by one with 342 values in the middle vector of step (2), the nodal point number that is the visual layers of the limited Boltzmann machine of ground floor is 342, train the weighting parameter between 300 nodes of 342 nodes of this limited Boltzmann machine visual layers and hidden layer, train altogether 50 times.And then with the hidden layer node of the limited Boltzmann machine of ground floor as the limited Boltzmann machine visual layers of second layer node, train the weighting parameter between 300 nodes of 300 nodes of this limited Boltzmann machine visual layers and hidden layer equally, also train 50 times.The rest may be inferred, i.e. the visual layers node of the limited Boltzmann machine of one deck is to train down the limited Boltzmann machine of one deck under the hidden layer node of the limited Boltzmann machine of the last layer conduct.So just finish the pre-training of whole network, also obtained simultaneously the parameter of the limited Boltzmann machine of each layer that train in advance.
5. to utilize the gradient descent method be the criterion backpropagation with the reconstruction error minimum and adjust weighting parameter between all limited Boltzmann machine visual layers nodes and the hidden layer node, and this step is carried out 100 times altogether.
6. for the facial image of the new pending gesture recognition of a width of cloth, earlier it being zoomed to height is 30, wide is 30 image, and be the image transformation of this convergent-divergent gray level image, then the gradation of image value is normalized to [0,1], then the image after the normalization being pulled into a length is 900 vector, and utilizes b=P T(X-x) this vectorial dimension is dropped to 342, at last the vector of these 342 dimensions is sent into the network that has trained and carry out gesture recognition.As shown in Figure 2, the image of this pending gesture recognition is one 60 ° a facial image, can correctly identify the attitude of this facial image with method of the present invention.
From as can be seen above, the human face posture recognition methods that present embodiment proposes can further be applied to three-dimensional face identification.

Claims (7)

1. one kind is utilized dimension reduction method to carry out the human face posture estimation approach, specifically comprises the steps:
(1) the facial image training sample to different attitudes carries out pretreatment operation;
(2) pretreated data being carried out principal component analysis (PCA) handles;
(3) the limited Boltzmann machine neural network of initialization;
(4) with the limited Boltzmann machine neural network of the pre-training of the data after the principal component analysis (PCA) processing;
(5) adjust limited Boltzmann machine neural network parameter;
(6) new facial image is carried out gesture recognition.
2. the dimension reduction method that utilizes according to claim 1 carries out the human face posture estimation approach, it is characterized in that, described step (1) specifically comprises:
1) for each facial image training sample, at first its be scaled high for h pixel, widely be the image of w pixel;
2) facial image behind this convergent-divergent is transformed to gray level image;
3) gray-scale value of all pixels is normalized to [0,1];
4) they are pulled into the vector that length is h * w, dimension that promptly should vector is h * w.
3. the dimension reduction method that utilizes according to claim 2 carries out the human face posture estimation approach, it is characterized in that, described step (2) specifically comprises:
1) pretreated data is carried out the principal component analysis (PCA) operation, remove redundant information and noise, keep 98% quantity of information;
2) all data are all dropped to the s dimension from h * w, obtain average vector x and proper vector P simultaneously, physical relationship is as follows: X=x+Pb, the data b behind the dimensionality reduction is expressed as: b=P T(X-x), wherein X is the vector data of original h * w dimension, and b is the s dimension data behind the dimensionality reduction.
4. the dimension reduction method that utilizes according to claim 1 carries out the human face posture estimation approach, it is characterized in that, described step (3) specifically comprises:
1) sets this neural network the L layer is arranged; The nodal point number of each layer is respectively N1, N2 ..., NL; The classification number is C; The number of times of pre-training and adjustment parameter is respectively Pt and Pc;
2) determine network structure according to the number of the network number of plies and every layer of node, the random number between the generation [0,1] is as connecting weights between the network node simultaneously.
5. the dimension reduction method that utilizes according to claim 3 carries out the human face posture estimation approach, it is characterized in that, described step (4) specifically comprises:
1) s value in the vector is corresponding one by one in the node that the visual layers of the limited Boltzmann machine of ground floor in the network is set and the step (2), the nodal point number that is the visual layers of the limited Boltzmann machine of ground floor is s, train the weighting parameter between this limited Boltzmann machine visual layers node and the hidden layer node, train Pt time altogether;
2) again with the hidden layer node of the limited Boltzmann machine of ground floor as the limited Boltzmann machine visual layers of second layer node, train the weighting parameter between this limited Boltzmann machine visual layers node and the hidden layer node equally, also train Pt time;
3) the rest may be inferred, the hidden layer node conduct that is the limited Boltzmann machine of last layer descends the visual layers node of the limited Boltzmann machine of one deck to train down the limited Boltzmann machine of one deck, finally finish the pre-training of whole network, also obtained simultaneously the parameter of the limited Boltzmann machine of each layer that train in advance.
6. the dimension reduction method that utilizes according to claim 1 carries out the human face posture estimation approach, it is characterized in that, described step (5) is meant: to utilize the gradient descent method be the criterion backpropagation with the reconstruction error minimum and adjust weighting parameter between all limited Boltzmann machine visual layers nodes and the hidden layer node, and this step is carried out Pc time altogether.
7. the dimension reduction method that utilizes according to claim 1 carries out the human face posture estimation approach, it is characterized in that, described step (6) is meant: for the facial image of the new pending gesture recognition of a width of cloth, earlier it is zoomed to the high h that is, wide is the image of w, and is the image transformation of this convergent-divergent gray level image, then the gradation of image value is normalized to [0,1], then the image after the normalization is pulled into the vectorial X that a length is h * w, and utilize b=P T(X-x) this vectorial dimension is dropped to s, at last the vector of this s dimension is sent into the network that has trained and carry out gesture recognition.
CNB2007100380763A 2007-03-15 2007-03-15 Method for making human face posture estimation utilizing dimension reduction method Expired - Fee Related CN100492399C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2007100380763A CN100492399C (en) 2007-03-15 2007-03-15 Method for making human face posture estimation utilizing dimension reduction method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2007100380763A CN100492399C (en) 2007-03-15 2007-03-15 Method for making human face posture estimation utilizing dimension reduction method

Publications (2)

Publication Number Publication Date
CN101021900A CN101021900A (en) 2007-08-22
CN100492399C true CN100492399C (en) 2009-05-27

Family

ID=38709659

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2007100380763A Expired - Fee Related CN100492399C (en) 2007-03-15 2007-03-15 Method for making human face posture estimation utilizing dimension reduction method

Country Status (1)

Country Link
CN (1) CN100492399C (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107103266A (en) * 2016-02-23 2017-08-29 中国科学院声学研究所 The training of two-dimension human face fraud detection grader and face fraud detection method

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101908139A (en) * 2010-07-15 2010-12-08 华中科技大学 Method for supervising learning activities of learning machine user
CN103530651A (en) * 2013-10-22 2014-01-22 东南大学 Head attitude estimation method based on label distribution
CN104657776B (en) * 2013-11-22 2018-06-05 华为技术有限公司 Nerve network system, method for analyzing image and device based on nerve network system
CN103605972B (en) * 2013-12-10 2017-02-15 康江科技(北京)有限责任公司 Non-restricted environment face verification method based on block depth neural network
CN104063720A (en) * 2014-07-03 2014-09-24 浙江大学 Method for detecting images of prohibited commodities of e-commerce websites based on deep Boltzmann machine
CN104363981B (en) * 2014-07-14 2018-06-05 中国科学院自动化研究所 Face verification method and system
CN104899921B (en) * 2015-06-04 2017-12-22 杭州电子科技大学 Single-view videos human body attitude restoration methods based on multi-modal own coding model
CN104992417B (en) * 2015-06-30 2017-12-19 上海交通大学 Face video sight modification method and system based on Kinect
CN105206270B (en) * 2015-08-20 2019-04-02 长安大学 A kind of isolated digit speech recognition categorizing system and method combining PCA and RBM
CN105740688B (en) * 2016-02-01 2021-04-09 腾讯科技(深圳)有限公司 Unlocking method and device
CN106156727B (en) * 2016-06-24 2019-07-09 厦门中控智慧信息技术有限公司 A kind of recognition methods and terminal of biological characteristic
CN106295584A (en) * 2016-08-16 2017-01-04 深圳云天励飞技术有限公司 Depth migration study is in the recognition methods of crowd's attribute
CN108229246A (en) * 2016-12-14 2018-06-29 上海交通大学 Real-time three-dimensional human face posture method for tracing based on vehicle computing machine platform
CN106650825B (en) * 2016-12-31 2020-05-12 中国科学技术大学 Motor vehicle exhaust emission data fusion system
CN106611090B (en) * 2016-12-31 2018-04-10 中国科学技术大学 A kind of road side air pollutant concentration Forecasting Methodology based on reconstruct deep learning
CN108449295A (en) * 2018-02-05 2018-08-24 西安电子科技大学昆山创新研究院 Combined modulation recognition methods based on RBM networks and BP neural network
CN109858433B (en) * 2019-01-28 2020-06-30 四川大学 Method and device for identifying two-dimensional face picture based on three-dimensional face model
CN111404911B (en) * 2020-03-11 2022-10-14 国网新疆电力有限公司电力科学研究院 Network attack detection method and device and electronic equipment
CN111401230B (en) * 2020-03-13 2023-11-28 深圳市商汤科技有限公司 Gesture estimation method and device, electronic equipment and storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107103266A (en) * 2016-02-23 2017-08-29 中国科学院声学研究所 The training of two-dimension human face fraud detection grader and face fraud detection method
CN107103266B (en) * 2016-02-23 2019-08-20 中国科学院声学研究所 The training of two-dimension human face fraud detection classifier and face fraud detection method

Also Published As

Publication number Publication date
CN101021900A (en) 2007-08-22

Similar Documents

Publication Publication Date Title
CN100492399C (en) Method for making human face posture estimation utilizing dimension reduction method
CN108615010B (en) Facial expression recognition method based on parallel convolution neural network feature map fusion
CN107657279B (en) Remote sensing target detection method based on small amount of samples
CN111582059B (en) Face expression recognition method based on variational self-encoder
CN110321833B (en) Human body behavior identification method based on convolutional neural network and cyclic neural network
CN107122375B (en) Image subject identification method based on image features
CN109583322B (en) Face recognition deep network training method and system
Koetsier et al. Kernel maximum likelihood hebbian learning
CN100561500C (en) Human face posture recognition methods based on limited Boltzmann machine neural network
CN107784288B (en) Iterative positioning type face detection method based on deep neural network
CN112528928B (en) Commodity identification method based on self-attention depth network
CN107767416B (en) Method for identifying pedestrian orientation in low-resolution image
CN107169117B (en) Hand-drawn human motion retrieval method based on automatic encoder and DTW
CN104537647A (en) Target detection method and device
CN103020582A (en) Method for computer to identify vehicle type by video image
CN106709528A (en) Method and device of vehicle reidentification based on multiple objective function deep learning
CN108520213A (en) A kind of face beauty prediction technique based on multiple dimensioned depth
CN115966010A (en) Expression recognition method based on attention and multi-scale feature fusion
CN110135327A (en) A kind of driving behavior recognition methods based on multi-region feature learning model
CN108256473A (en) Methods, devices and systems, computer readable storage medium for data amplification
CN110210380A (en) The analysis method of personality is generated based on Expression Recognition and psychology test
CN114492634B (en) Fine granularity equipment picture classification and identification method and system
CN110598737B (en) Online learning method, device, equipment and medium of deep learning model
CN110334747A (en) Based on the image-recognizing method and application for improving convolutional neural networks
CN110991554A (en) Improved PCA (principal component analysis) -based deep network image classification method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20090527

Termination date: 20120315