CN110298225A - A method of blocking the human face five-sense-organ positioning under environment - Google Patents
A method of blocking the human face five-sense-organ positioning under environment Download PDFInfo
- Publication number
- CN110298225A CN110298225A CN201910242013.2A CN201910242013A CN110298225A CN 110298225 A CN110298225 A CN 110298225A CN 201910242013 A CN201910242013 A CN 201910242013A CN 110298225 A CN110298225 A CN 110298225A
- Authority
- CN
- China
- Prior art keywords
- model
- face
- sense
- human face
- organ
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/161—Detection; Localisation; Normalisation
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Oral & Maxillofacial Surgery (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Image Analysis (AREA)
Abstract
The human face five-sense-organ location algorithm that the invention discloses a kind of in the case where blocking environment, belongs to field of image processing.Its overall step are as follows: S1: data set is simultaneously divided into training set and test set two parts by building data set;S2: being normalized data set and carries out data set expansion to it;S3: building human face five-sense-organ point occlusion detection model;S4: training face facial feature localization coarse localization model and the rough position for predicting human face five-sense-organ point;S5: the target position of the result of fusion coarse localization model and occlusion detection model creation finely positioning model;S6: training face facial feature localization finely positioning model simultaneously predicts finely positioning stage position.The present invention proposes a kind of human face five-sense-organ location algorithm, solves the problems, such as currently to can be applied to block under the human face five-sense-organ positioning scene in more serious situation to block environment human face facial feature localization algorithm inaccurate.
Description
Technical field
The present invention relates to technical field of image processing, and in particular to a kind of human face five-sense-organ location algorithm blocked under environment.
Background technique
Human face five-sense-organ positioning or face alignment (such as eyes, nose, mouth and chin) are for such as face recognition, face
The task of portion's tracking, FA Facial Animation and 3 dimension D facial modellings etc is essential.With current personal and network photo quick-fried
Fried formula increases, and needs a kind of full-automatic, highly efficient and robust facial alignment schemes.For working as front in free environment
Method, these requirements are still challenging, and due to facial appearance, illumination and partial occlusion will lead to many face points position very
Difficulty is fixed.
The target of human face five-sense-organ positioning is to accurately position some face points.Accurately human face five-sense-organ positioning is for very
It is all a very crucial step: such as face verification, facial Expression Analysis, motion analysis for more visual tasks.These tasks are all
Need facial feature localization as its pretreated step.From the perspective of entirety, human face five-sense-organ positioning can be expressed as in face
The problem of predefined face points (also referred to as face shape) is searched on portion's image, usually since rough original shape,
And gradually estimate that accurately shape is until convergence.In search process, usually using two different information sources: facial appearance
And shape information.The latter is intended to the spatial relationship between the clearly position of simulation of facial point, to ensure that the face points estimated can
To form effective face shape.Some methods do not use shape information clearly, but usually combine both information sources
Get up.Although many methods all achieve relatively good achievement in public data collection, for some rugged environments,
Such as block and compare under serious situation, for example wear dark sunglasses or in the case of face blocked by other barriers, this
The performance of a little methods is not good enough, is primarily due to local minimum problem.It thus must be to the circumstance of occlusion of each face point position
Differentiated, then carries out unrelated crucial point location using this some of supplemental information.
The more popular method of academic circles at present is building Gradual regression analysis model.Local binarization characteristic model is using at random
Forest extracts the binaryzation feature near picture face point, and each stage is then completed using support vector regression model
Recurrence task;The block message that conditional regression tree-model extracts around human face five-sense-organ point using random forest is put into support vector and returns
Return and obtains predicted value in model;Some think that the secondary attribute of face can help model prediction face point position, and propose
Dynamic is early to stop tactful (early stopping) to solve the overfitting problem of secondary attribute study, to improve the essence of model
Exactness;The all not explicit solution provided under circumstance of occlusion of the above method, thus under large area circumstance of occlusion
Human face five-sense-organ positioning performance is not good enough.
Summary of the invention
Present invention aims at at present in a natural environment since partial occlusion causes face key point location inaccurate
True problem provides a kind of human face five-sense-organ location algorithm for introducing occlusion detection module, mainly comprises the steps that
Step 1: data prediction: reading the accurate position of face of facial image and the face of correspondence image point in training set
It sets, facial image is converted into gray level image, and whitening processing is carried out to facial image, the facial image after being normalized,
Subtract mean value divided by standard deviation, as shown in formula one.
Wherein IiWhat is indicated is i-th image for inputting facial image, what μ and σ divided that table indicates be training set mean value and
Standard deviation, IRiWhat is indicated is i-th image after normalized.
Step 2: data set expands: carrying out data set expansion to the training set that public data is concentrated.By reversion, translate,
Facial image in training set is extended for original 20 times by rotation process, and data set, which expands in step, will include to face
Amplification and reduce, since it is considered that actual conditions human face it is shared in picture ratio it is different caused by result difference.
Step 3: seeking average shape: utilizing the face point position of the training set after the expansion got in step 2
Solve average value, the original shape S as face point initial alignment0, and by S0Be converted to S0,i, i.e., every face figure in training set
In the coordinate system of picture.
Step 4: building occlusion detection model: creation is based on occlusion detection more of mobile network's model (MobileNet)
Business regression model, the training set constructed using in step 2 export occlusion state vector, the often one-dimensional of vector is face as input
The state being blocked at some face point position in image, as shown in formula two.
Wherein j=0, what 1 ..., J-1, J were indicated is the number of human face five-sense-organ key point.
Step 5: building coarse localization model: being based on convolutional neural networks model (Convolutional Neural
Networks) building human face five-sense-organ positions regression model, and for i-th facial image, the target of model prediction is actual position
Information Sgt,iWith S0,iBetween discrepancy delta Sr,i, as shown in formula three.
ΔSr,i=Sgt,i-S0,iFormula three
Step 6: generate be accurately positioned simulated target position: the human face five-sense-organ point occlusion state that combining step four obtains to
Human face five-sense-organ point rough position obtained in amount and step 5, obtains modified finely positioning phase targets position, such as formula four
It is shown.
What wherein weight was indicated is the amplitude for blocking scaling at a position,What is indicated is the occlusion state of prediction, Δ
Sr,i,jThat indicate is the difference between accurate positioning and coarse localization, Sp,i,jWhat is indicated is the target position in finely positioning stage.
Step 7: building finely positioning model: being based on convolutional neural networks model (Convolutional Neural
Networks finely positioning stage regression model) is constructed, the target position in training is the target position solved in step 6,
In building model process in view of influence of the magnitude of inclination to positioning result of face, thus model additional studies face is rectified
Positive matrix coefficient.The facial image of input is can to further decrease the fitting of model in this way through the image after overcorrection
Difficulty obtains the output result in finely positioning stage by anti-corrective operations after models fitting.
Step 8: obtaining final result of the invention, formula to the anti-scaling of finely positioning prediction result that step 7 obtains
As shown in formula five.
WhereinIndicate be prediction finely positioning as a result, eps indicate be a very little positive number, in order to prevent
Except 0 phenomenon, and So,i,jWhat is indicated is the final prediction result of algorithm.
The beneficial effects of the invention are as follows the deep learning models used stage by stage, under occlusion on facial image
The specific location of face point is predicted.It is pre- for the face point position for blocking place in order to make up conventional face's facial feature localization model
Indeterminable disadvantage, algorithm predict whether each face point is blocked by occlusion detection, mend for face key point position prediction
Additional information has been filled, model accuracy is further increased.
Detailed description of the invention
Fig. 1 is a kind of flow chart of the method for human face five-sense-organ positioning blocked under environment provided by the invention
Fig. 2 is algorithm input gray level facial image
Fig. 3 is mean value picture (a) and standard deviation picture (b) in training set facial image
Fig. 4 is coarse localization result figure
Fig. 5 is the coarse localization result figure for adding block information
Fig. 6 is algorithm final effect figure
Specific embodiment
Here is that the present invention is further illustrated in conjunction with the accompanying drawings and embodiments.
The purpose of the present embodiment is that the position of each face key point of a width facial image is predicted under occlusion
Out, following steps are specifically included, general process is as shown in Figure 1:
Step 1: data prediction: reading the accurate position of face of facial image and the face of correspondence image point in training set
Set, facial image be converted into gray level image, as shown in Fig. 2, and to facial image carry out whitening processing, after being normalized
Facial image, the facial image after being normalized subtract mean value divided by standard deviation, the mean value image and standard difference image of image
As shown in Figure 3.
Step 2: data set expands: carrying out data set expansion to the training set that public data is concentrated.
Step 3: seeking average shape: utilizing the face point position of the training set after the expansion got in step 2
Solve average value, the original shape S as face point initial alignment0, and by S0Be converted to S0,i, i.e., every face figure in training set
In the coordinate system of picture.
Step 4: building occlusion detection model: creation is based on occlusion detection more of mobile network's model (MobileNet)
Business regression model, the training set constructed using in step 2 export occlusion state vector, the often one-dimensional of vector is face as input
The state being blocked at some face point position in image, as shown in formula one.
Wherein j=0, what 1 ..., J-1, J were indicated is the number of human face five-sense-organ key point.
Step 5: building coarse localization model: being based on convolutional neural networks model (Convolutional Neural
Networks) building human face five-sense-organ positions regression model, and for i-th facial image, the target of model prediction is actual position
Information Sgt,iWith S0,iBetween discrepancy delta Sr,i, as shown in formula two, the human face five-sense-organ position result of coarse localization model prediction
As shown in figure 4, the occlusion state information in additional step four, as shown in figure 5, figure orbicular spot is the unshielding point of prediction, and is pitched
Point prediction is to block a little.
ΔSr,i=Sgt,i-S0,iFormula two
Step 6: generate be accurately positioned simulated target position: the human face five-sense-organ point occlusion state that combining step four obtains to
Human face five-sense-organ point rough position obtained in amount and step 5, obtains modified finely positioning phase targets position, such as formula three
It is shown,
What wherein weight was indicated is the amplitude for blocking scaling at a position,What is indicated is the occlusion state of prediction, Δ
Sr,i,jThat indicate is the difference between accurate positioning and coarse localization, Sp,i,jWhat is indicated is the target position in finely positioning stage.
Step 7: building finely positioning model: being based on convolutional neural networks model (Convolutional Neural
Networks finely positioning stage regression model) is constructed, the target position in training is the target position solved in step 6,
In building model process in view of influence of the magnitude of inclination to positioning result of face, thus model additional studies face is rectified
Positive matrix coefficient.The facial image of input is can to further decrease the fitting of model in this way through the image after overcorrection
Difficulty obtains the output result in finely positioning stage by anti-corrective operations after models fitting.
Step 8: obtaining final result of the invention, formula to the anti-scaling of finely positioning prediction result that step 6 obtains
As shown in formula four.
WhereinIndicate be prediction finely positioning as a result, eps indicate be a very little positive number, in order to prevent
Except 0 phenomenon, and So,i,jWhat is indicated is the final prediction result of algorithm, as shown in Figure 6.
Above embodiment is not limitation of the present invention, and the present invention is also not limited to the example above, this technology neck
The variations, modifications, additions or substitutions that the technical staff in domain is made within the scope of technical solution of the present invention, also belong to this hair
Bright protection scope.
Claims (3)
1. a kind of method for the human face five-sense-organ positioning blocked under environment, it is characterised in that the overall step of the method are as follows:
Step 1: data prediction: the face exact position of facial image and the face of correspondence image point in training set is read, it will
Facial image is converted to gray level image, and carries out whitening processing to facial image, the facial image after being normalized;
Step 2: data set expands: carrying out data set expansion to the training set that public data is concentrated;
Step 3: seeking average shape: being solved using the face point position of the training set after the expansion got in step 2
Average value, the original shape S as face point initial alignment0, and by S0Be converted to S0,i, i.e., every facial image in training set
In coordinate system;
Step 4: building occlusion detection model: creation is returned based on the occlusion detection multitask of mobile network's model (MobileNet)
Return model, the training set constructed using in step 2 exports occlusion state vector, the often one-dimensional of vector is facial image as input
In the state that is blocked at some face point position, as shown in formula one;
Wherein j=0, what 1 ..., J-1, J were indicated is the number of human face five-sense-organ key point;
Step 5: building coarse localization model: being based on convolutional neural networks model (Convolutional Neural
Networks) building human face five-sense-organ positions regression model, and for i-th facial image, the target of model prediction is actual position
Information Sgt,iWith S0,iBetween discrepancy delta Sr,i, as shown in formula two;
ΔSr,i=Sgt,i-S0,iFormula two
Simulated target position: the human face five-sense-organ point occlusion state vector sum that combining step four obtains is accurately positioned Step 6: generating
The point rough position of human face five-sense-organ obtained in step 5, obtains modified finely positioning phase targets position, as shown in formula three;
What wherein weight was indicated is the amplitude for blocking scaling at a position,That indicate is the occlusion state of prediction, Δ Sr,i,j
That indicate is the difference between accurate positioning and coarse localization, Sp,i,jWhat is indicated is the target position in finely positioning stage;
Step 7: building finely positioning model: being based on convolutional neural networks model (Convolutional Neural
Networks finely positioning stage regression model) is constructed, the target position in training is the target position solved in step 6,
In building model process in view of influence of the magnitude of inclination to positioning result of face, thus model additional studies face is rectified
Positive matrix coefficient.The facial image of input is can to further decrease the fitting of model in this way through the image after overcorrection
Difficulty obtains the output result in finely positioning stage by anti-corrective operations after models fitting;
Step 8: obtaining final result of the invention to the anti-scaling of finely positioning prediction result that step 6 obtains, formula is such as public
Shown in formula four;
WhereinIndicate be prediction finely positioning as a result, eps indicate be a very little positive number, in order to prevent except 0 is existing
As, and So,i,jWhat is indicated is the final prediction result of algorithm.
2. method as claimed in claim 1, it is characterised in that: occlusion detection model is based on general tight in the step 4
Gather type convolutional neural networks model creation, its object is to the arithmetic speeds of acceleration model;The target of model is returned for multitask
Return model, is capable of the ability of abundant mining model extraction feature.
3. method as claimed in claim 1, it is characterised in that: the side of scaling and anti-scaling in the step 5 and step 7
Method can evade encountered in human face five-sense-organ location algorithm because of the local minimum problem caused by occlusion issue, thus further
Model is improved in the prediction accuracy for blocking place.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910242013.2A CN110298225A (en) | 2019-03-28 | 2019-03-28 | A method of blocking the human face five-sense-organ positioning under environment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910242013.2A CN110298225A (en) | 2019-03-28 | 2019-03-28 | A method of blocking the human face five-sense-organ positioning under environment |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110298225A true CN110298225A (en) | 2019-10-01 |
Family
ID=68026536
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910242013.2A Pending CN110298225A (en) | 2019-03-28 | 2019-03-28 | A method of blocking the human face five-sense-organ positioning under environment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110298225A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110807402A (en) * | 2019-10-29 | 2020-02-18 | 深圳市梦网百科信息技术有限公司 | Facial features positioning method, system and terminal equipment based on skin color detection |
CN112069993A (en) * | 2020-09-04 | 2020-12-11 | 西安西图之光智能科技有限公司 | Dense face detection method and system based on facial features mask constraint and storage medium |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110075934A1 (en) * | 2006-03-12 | 2011-03-31 | Salih Burak Gokturk | Techniques for enabling or establishing the use of face recognition algorithms |
CN102270308A (en) * | 2011-07-21 | 2011-12-07 | 武汉大学 | Facial feature location method based on five sense organs related AAM (Active Appearance Model) |
CN108229442A (en) * | 2018-02-07 | 2018-06-29 | 西南科技大学 | Face fast and stable detection method in image sequence based on MS-KCF |
CN108805040A (en) * | 2018-05-24 | 2018-11-13 | 复旦大学 | It is a kind of that face recognition algorithms are blocked based on piecemeal |
CN109002758A (en) * | 2018-06-06 | 2018-12-14 | 武汉理工大学 | Man face characteristic point positioning method, device, equipment and storage medium |
CN109086711A (en) * | 2018-07-27 | 2018-12-25 | 华南理工大学 | Facial Feature Analysis method, apparatus, computer equipment and storage medium |
CN110287760A (en) * | 2019-03-28 | 2019-09-27 | 电子科技大学 | A kind of human face five-sense-organ point occlusion detection method based on deep learning |
-
2019
- 2019-03-28 CN CN201910242013.2A patent/CN110298225A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110075934A1 (en) * | 2006-03-12 | 2011-03-31 | Salih Burak Gokturk | Techniques for enabling or establishing the use of face recognition algorithms |
CN102270308A (en) * | 2011-07-21 | 2011-12-07 | 武汉大学 | Facial feature location method based on five sense organs related AAM (Active Appearance Model) |
CN108229442A (en) * | 2018-02-07 | 2018-06-29 | 西南科技大学 | Face fast and stable detection method in image sequence based on MS-KCF |
CN108805040A (en) * | 2018-05-24 | 2018-11-13 | 复旦大学 | It is a kind of that face recognition algorithms are blocked based on piecemeal |
CN109002758A (en) * | 2018-06-06 | 2018-12-14 | 武汉理工大学 | Man face characteristic point positioning method, device, equipment and storage medium |
CN109086711A (en) * | 2018-07-27 | 2018-12-25 | 华南理工大学 | Facial Feature Analysis method, apparatus, computer equipment and storage medium |
CN110287760A (en) * | 2019-03-28 | 2019-09-27 | 电子科技大学 | A kind of human face five-sense-organ point occlusion detection method based on deep learning |
Non-Patent Citations (6)
Title |
---|
JIE ZHANG 等: "Occlusion-Free Face Alignment: Deep Regression Networks Coupled with De-Corrupt AutoEncoders", 《2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION》 * |
张金焕: "人脸检测及特征点定位技术研究", 《中国优秀博硕士学位论文全文数据库(硕士) 信息科技辑》 * |
李阳: "基于深度学习的有遮挡人脸五官定位算法研究", 《中国优秀博硕士学位论文全文数据库(硕士) 信息科技辑》 * |
王科俊等: "一种快速眼睛定位与人脸平面旋转校正方法", 《计算机辅助设计与图形学学报》 * |
赵莉: "人脸遮挡的判别与分析", 《中国优秀博硕士学位论文全文数据库(硕士) 信息科技辑》 * |
陆洋: "人脸遮挡和活性检测若干关键技术研究", 《中国优秀博硕士学位论文全文数据库(硕士) 信息科技辑》 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110807402A (en) * | 2019-10-29 | 2020-02-18 | 深圳市梦网百科信息技术有限公司 | Facial features positioning method, system and terminal equipment based on skin color detection |
CN110807402B (en) * | 2019-10-29 | 2023-08-08 | 深圳市梦网视讯有限公司 | Facial feature positioning method, system and terminal equipment based on skin color detection |
CN112069993A (en) * | 2020-09-04 | 2020-12-11 | 西安西图之光智能科技有限公司 | Dense face detection method and system based on facial features mask constraint and storage medium |
CN112069993B (en) * | 2020-09-04 | 2024-02-13 | 西安西图之光智能科技有限公司 | Dense face detection method and system based on five-sense organ mask constraint and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Luo et al. | Aircraft target detection in remote sensing images based on improved YOLOv5 | |
CN107895160A (en) | Human face detection and tracing device and method | |
CN110399809A (en) | The face critical point detection method and device of multiple features fusion | |
CN105426882B (en) | The method of human eye is quickly positioned in a kind of facial image | |
CN105608710A (en) | Non-rigid face detection and tracking positioning method | |
Jin et al. | DWCA‐YOLOv5: An Improve Single Shot Detector for Safety Helmet Detection | |
CN110298225A (en) | A method of blocking the human face five-sense-organ positioning under environment | |
Kim et al. | Real-time facial feature extraction scheme using cascaded networks | |
Qiu et al. | Outdoor semantic segmentation for UGVs based on CNN and fully connected CRFs | |
Feng et al. | [Retracted] Research and Application of Multifeature Gesture Recognition in Human‐Computer Interaction Based on Virtual Reality Technology | |
Li et al. | BCMNet: Cross-layer extraction structure and multiscale downsampling network with bidirectional transpose FPN for fast detection of wildfire smoke | |
Liu et al. | Double Mask R‐CNN for Pedestrian Detection in a Crowd | |
Chowdhury et al. | Human face detection and recognition protection system based on machine learning algorithms with proposed AR technology | |
Li | [Retracted] Human Skeleton Detection and Extraction in Dance Video Based on PSO‐Enabled LSTM Neural Network | |
Liu et al. | Estimation of sunlight direction using 3D object models | |
Bong et al. | Face recognition and detection using haars features with template matching algorithm | |
Mao et al. | SC-YOLO: Provide Application-Level Recognition and Perception Capabilities for Smart City Industrial Cyber-Physical System | |
Li et al. | [Retracted] Human Sports Action and Ideological and PoliticalEvaluation by Lightweight Deep Learning Model | |
Sun et al. | Intelligent Site Detection Based on Improved YOLO Algorithm | |
Chen et al. | Image Illumination Enhancement for Construction Worker Pose Estimation in Low-light Conditions | |
Yu et al. | MKIoU loss: toward accurate oriented object detection in aerial images | |
Cai et al. | Realization of interactive animation creation based on artificial intelligence technology | |
Jiang et al. | Machine Vision and Big Data‐Driven Sports Athletes Action Training Intervention Model | |
Zhang et al. | A real-time personalized face modeling method for peking opera with depth vision device | |
Wang et al. | Earf‐YOLO: An Efficient Attention Receptive Field Model for Recognizing Symbols of Zhuang Minority Patterns |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
AD01 | Patent right deemed abandoned | ||
AD01 | Patent right deemed abandoned |
Effective date of abandoning: 20221101 |