CN105335756B

CN105335756B - A kind of image classification method and image classification system based on Robust Learning model

Info

Publication number: CN105335756B
Application number: CN201510726581.1A
Authority: CN
Inventors: 张召; 江威明; 李凡长; 张莉
Original assignee: Suzhou University
Current assignee: Suzhou University
Priority date: 2015-10-30
Filing date: 2015-10-30
Publication date: 2019-06-11
Anticipated expiration: 2035-10-30
Also published as: CN105335756A

Abstract

The invention discloses a kind of image classification method and image classification system based on Robust Learning model, it include: to be initialized to training set, initial category label matrix is obtained, the training sample in training set includes its known classification and demarcates the sample for having class label corresponding with its classification and its unknown classification and the sample for not demarcating class label；The building method that power is defined and reconstructed based on neighbour handles training sample, according to the similitude between sample, constructs reconstruction coefficients matrix, and carry out symmetrization, normalized；The soft label without calibration sample is determined using reconstruction coefficients matrix and initial category label matrix, and l is carried out to the soft label of training sample by the way of iteration_2,1Norm regularization obtains projection matrix and soft label matrix；Sample to be tested is mapped using projection matrix, obtains its soft label；Sample to be tested is its unknown classification and the sample for not demarcating classification.Improve classification accuracy.

Description

Image classification method and image classification system based on robust learning model

Technical Field

The invention relates to the technical field of pattern recognition and data mining, in particular to an image classification method and an image classification system based on a robust learning model.

Background

With the continuous development of computer technology and intelligence, image classification technology has developed into one of the most important research subjects in the fields of data mining, machine learning, and the like. The image classification technology is mainly used for judging the classification of unknown data, and has great significance in the fields of medical data analysis, text, webpage, credit card rating and the like, so that great social and economic benefits can be brought by putting the accurate classification technology into use. Multiple studies prove that the performance of the supervised learning method is obviously superior to that of the unsupervised learning method, but in the real world, the supervised data used for the supervised learning method is often difficult to obtain, and the class information of the supervised data obtained in a manual calibration mode consumes a large amount of time and manpower, so that the practicability of the supervised learning method is greatly reduced. Therefore, semi-supervised learning based on similarity graph construction has been developed as one of practical and popular classification tools due to its practicality and classification accuracy, and semi-supervised learning is mainly performed by calibrating the class of a small amount of data in each class of a large amount of data, and then spreading the class of data to unknown classes through similarity graphs, thereby predicting the class of data of the unknown classes.

In recent years, a label propagation method based on a label propagation theory in semi-supervised learning is one of typical representatives of semi-supervised learning due to the advantages of simplicity, effectiveness and rapidness. The label propagation method is used for propagating the category information of supervised labeled samples (i.e. samples with known categories) to unlabeled samples (i.e. samples with unknown categories) by learning the similarity between the samples, and then estimating the category information of the unlabeled samples. At present, most label propagation methods adopt soft labels as classification results of unlabeled samples, however, the inventor finds that in the label propagation method adopting the soft labels as the classification results of the unlabeled samples, original spaces corresponding to the labeled samples often contain mixed signals, and the mixed signals can cause adverse effects on the classification of the unlabeled samples, thereby causing the inaccurate classification results of the unlabeled samples.

In summary, there is a problem in the prior art that the accuracy of the classification result of the unlabeled sample is low due to the influence of the mixed signal of the original space.

Disclosure of Invention

The invention aims to provide an image classification method and an image classification system based on a robust learning model, and aims to solve the problem that the classification result accuracy of a label-free sample is low due to the influence of a mixed signal of an original space in the prior art.

In order to achieve the above purpose, the invention provides the following technical scheme:

an image classification method based on a robust learning model comprises the following steps:

initializing a pre-acquired training set to obtain an initial class label matrix, wherein the training set comprises a preset amount of training samples, and the training samples comprise samples of which the classes are known and are marked with class labels corresponding to the classes, and samples of which the classes are unknown and are not marked with class labels;

processing the training sample based on a construction method of neighbor definition and reconstruction weight to obtain a similarity measurement matrix corresponding to the training sample, and presetting the similarity measurement matrix to obtain a reconstruction coefficient matrix;

determining the soft label of the training sample without calibrating the class label through an effective balance manifold smooth item and a label fitting item based on the reconstruction coefficient matrix and the initial class label matrix, and carrying out l on the soft label of the training sample in an iteration mode_2，1Normalizing the norm to obtain a projection matrix and a soft label matrix corresponding to the soft label of the training sample;

mapping a sample to be detected by using the projection matrix to obtain a soft label of the sample to be detected; the sample to be detected is a sample of which the category is unknown and the category label is not calibrated.

Preferably, the constructing method based on neighbor definition and reconstruction weight processes the training sample to obtain a similarity metric matrix corresponding to the training sample, and includes:

processing each training sample by using a K nearest neighbor algorithm to obtain K nearest neighbor samples of each training sample, wherein K is a positive integer;

and acquiring a similarity measurement matrix corresponding to the training sample by using the training sample and K nearest neighbor samples of each training sample by adopting an LLE-reconstruction weight construction method.

Preferably, the presetting the similarity metric matrix to obtain a reconstruction coefficient matrix includes:

and carrying out normalization processing and symmetry processing on the similarity measurement matrix to obtain the reconstruction coefficient matrix.

Preferably, the mapping the sample to be tested by using the projection matrix to obtain the soft label of the sample to be tested includes:

obtaining a target to be testedSample x_new；

By P^Tx_newThe sample x to be tested is processed_newEmbedding the prediction vector into a projection matrix P to obtain a prediction vector;

determining that the soft label corresponding to the element with the maximum probability in the prediction vector is the sample x to be detected_newThe soft tag of (1).

Preferably, the initializing a pre-acquired training set to obtain an initial class label matrix includes:

obtaining an initialization matrix with Y₀＝[y₁,y₂,…,y_l+u]Representing the training setIt is shown that, among others,representing the matrix space of n times l + u, X_LFor samples labeled with class labels, X_L＝[x₁,x₂,...,x_l]，X_UFor samples not labeled with class labels, X_U＝[x_l+1,x_l+2,...,x_l+u]N, l and u are positive integers;

determining any y in the initialization matrix_iIs a column vector, any column vector corresponds to the ith training sample x in the training set_i，i＝1,2,...,l+u；

Training sample x with any calibration class label_jIf the training sample x_jBelongs to the ith category, then y is determined_i，jIf the training sample x is 1_jIf it does not belong to the i-th category, y is determined_i，j0; for any training sample x not marked with class label_jDetermining y_i，jAn initial class label matrix Y is obtained, where j is 1, 2.

obtaining the similarity metric matrix using the following formula:

wherein,representing a training sample x_iK nearest neighbor samples, q corresponds to K,for the current training sample x_iThe reconstructed coefficient vector of (2) characterizing its neighborsOn reconstruction of training sample x_iThe degree of contribution of the time-varying,representing the corresponding similarity reconstruction coefficient matrix.

Preferably, the soft label of the training sample is subjected to l in an iterative manner_2，1And normalizing the norm to obtain a projection matrix and a soft label matrix corresponding to the soft label of the training sample, wherein the soft label matrix comprises the following steps:

the projection matrix and the soft label matrix are obtained by the following formulas:

subj f_i≥0,e^Tf_i＝1 for i＝1,2,...,l+u

wherein F represents a soft label matrix, P represents a projection matrix, W represents a reconstruction coefficient matrix, D represents a diagonal matrix, and D represents_ii＝∑_jW_i,j，fⁱTo representThe ith row vector of (2), X^TP-F^TRepresenting a regression residue term that is used to measure X^TP and Soft tag F^TThe degree of difference of (a); mu.s_iα, γ is the corresponding trade-off parameter;

or, the projection matrix and the soft label matrix are obtained by the following formulas:

subjF≥0,e^TF＝e^T

wherein, omega ═ I-W^T-W+W^TW and I are unit matrixes, and only diagonal elements of the unit matrixes are nonzero and are 1; u denotes a diagonal matrix with the ith diagonal element μ_iFor labeled training samples, μ_iSet to + ∞, for training samples not labeled with class labels, μ_iIs set to 0.

An image classification system comprising:

the system comprises a preprocessing module, a classification module and a classification module, wherein the preprocessing module is used for initializing a training set acquired in advance to obtain an initial classification label matrix, the training set comprises a preset amount of training samples, and the training samples comprise samples of which the classification is known and the classification labels corresponding to the classification are calibrated and samples of which the classification is unknown and the classification labels are not calibrated; the construction method is used for processing the training sample based on neighbor definition and reconstruction weight to obtain a similar measurement matrix corresponding to the training sample, and the similar measurement matrix is subjected to preset processing to obtain a reconstruction coefficient matrix;

a training module, configured to determine a soft label of a training sample without a class label by effectively balancing a manifold smoothing term and a label fitting term based on the reconstruction coefficient matrix and the initial class label matrix, and perform l on the soft label of the training sample in an iterative manner_2，1Normalizing the norm to obtain a projection matrix and a soft label matrix corresponding to the soft label of the training sample;

the prediction module is used for mapping a sample to be detected by using the projection matrix to obtain a soft label of the sample to be detected; and the sample to be detected is a sample without a calibrated class label.

Preferably, the preprocessing module comprises:

the K nearest neighbor unit is used for processing each training sample by utilizing a K nearest neighbor algorithm to obtain K nearest neighbor samples of each training sample, and K is a positive integer;

and the LLE-reconstruction weight unit is used for acquiring a similarity measurement matrix corresponding to the training samples by using the K nearest neighbor samples of each training sample by adopting an LLE-reconstruction weight construction method.

Preferably, the preprocessing module comprises:

and the preset processing unit is used for carrying out normalization processing and symmetry processing on the similarity measurement matrix to obtain the reconstruction coefficient matrix.

The invention provides an image classification method and an image classification system based on a robust learning model, which comprise the following steps: initializing a pre-acquired training set to obtain an initial class label matrix, wherein the training set comprises a preset amount of training samples, and the training samples comprise samples of which the classes are known and are marked with class labels corresponding to the classes, and samples of which the classes are unknown and are not marked with class labels; processing the training sample based on a construction method of neighbor definition and reconstruction weight to obtain a similarity measurement matrix corresponding to the training sample, and presetting the similarity measurement matrix to obtain a reconstruction coefficient matrix; determining the soft label of the training sample without calibrating the class label through an effective balance manifold smooth item and a label fitting item based on the reconstruction coefficient matrix and the initial class label matrix, and carrying out l on the soft label of the training sample in an iteration mode_2，1Normalizing the norm to obtain a projection matrix and a soft label matrix corresponding to the soft label of the training sample; mapping a sample to be detected by using the projection matrix to obtain a soft label of the sample to be detected; the sample to be detected is a sample of which the category is unknown and the category label is not calibrated.

Compared with the prior art, the method comprises the steps of firstly utilizing training samples marked with class labels and training samples not marked with class labels to jointly construct a similarity measurement matrix, further obtaining a reconstruction coefficient matrix, initializing to obtain an initial class label matrix, utilizing the initial class label matrix and the reconstruction coefficient matrix to determine soft labels of the training samples not marked with the class labels, and carrying out l on the soft labels of the training samples_2,1Norm regularization effectively reduces the influence of mixed signals in an original space formed by the soft labels on the accuracy of the classification result; by being based on l_2,1The robustness of regression residual measurement between the embedded features and the soft labels is effectively enhanced by the norm regularization measurement; by calculating on the basis of l_2,1And the mapping of the metric of norm regularization enhances the descriptive property of the extracted features. In summary, by introducing l_2,1Norm regularization technology effectively promotes the system to train samplesThe robustness of mixed signals such as medium noise, heterogeneous data and the like avoids the occurrence of the condition that the accuracy of the classification result of the label-free sample is lower due to the influence of the mixed signal of the original space in the background technology, namely, the influence of the mixed signal of the original space is reduced, the accuracy of the classification result of the sample to be detected is improved, and the classification performance is enhanced. In addition, this application is the model of promoting outside the sample, and the conclusion and the prediction of the outer data of sample need not to introduce extra reconstruction process, and the expansibility can be good for the high efficiency completion sample of embedding.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the provided drawings without creative efforts.

Fig. 1 is a flowchart of an image classification method based on a robust learning model according to an embodiment of the present invention;

fig. 2 is a schematic structural diagram of an image classification system according to an embodiment of the present invention;

fig. 3 is a schematic diagram of label prediction for obtaining classes of human face image test samples in an image classification method and an image classification system based on a robust learning model according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Referring to fig. 1, a flowchart of an image classification method based on a robust learning model according to an embodiment of the present invention is shown, which may include the following steps:

s11: initializing a pre-acquired training set to obtain an initial class label matrix, wherein the training set comprises a preset amount of training samples, and the training samples comprise samples of which the classes are known and are marked with class labels corresponding to the classes and samples of which the classes are unknown and are not marked with the class labels.

The number of training samples in the training set can be determined according to actual needs. The class label of the training sample corresponds to the class of the training sample, the class label of the training sample with unknown class is not calibrated, and the class label of the training sample with known class is calibrated by using the class label corresponding to the class.

S12: processing the training sample based on the construction method of neighbor definition and reconstruction weight to obtain a similar measurement matrix corresponding to the training sample, and presetting the similar measurement matrix to obtain a reconstruction coefficient matrix.

It should be noted that, step S11 and step S12 may be completed simultaneously, or S11 may be completed before S12 is completed, or S12 may be completed before S11 is completed, which may be determined according to actual needs.

S13: determining the soft label of the training sample without calibrating the class label through effectively balancing the manifold smooth item and the label fitting item based on the reconstruction coefficient matrix and the initial class label matrix, and carrying out l on the soft label of the training sample in an iterative mode_2，1And normalizing the norm to obtain a projection matrix and a soft label matrix corresponding to the soft label of the training sample.

Specifically, the manifold smoothing item is determined based on the reconstruction coefficient matrix, the label fitting item is determined based on the initial category label matrix, and then the soft label corresponding to the manifold smoothing item and the label fitting item is determined.

The soft label of each training sample is also associated with its class, and the soft label matrix is determined based on the soft labels of each training sample. The classes of all training samples can be obtained through the obtained soft label matrix.

And l_2，1Norm regularization is an important means in machine learning, and is actually an optimal process for solving a cost function in the learning process of a support vector machine, and the norm is added into the cost function, so that the learning result meets sparseness (sparsity), and the characteristics can be conveniently extracted by a human.

S14: mapping a sample to be detected by using a projection matrix to obtain a soft label of the sample to be detected; the sample to be detected is a sample of which the category is unknown and the category label is not calibrated.

And mapping the sample to be detected by using the projection matrix to obtain a soft label of the sample to be detected, and further determining the category of the sample to be detected.

The method comprises the steps of firstly, utilizing training samples marked with class labels and training samples not marked with class labels to jointly construct a similarity measurement matrix, further obtaining a reconstruction coefficient matrix, initializing to obtain an initial class label matrix, utilizing the initial class label matrix and the reconstruction coefficient matrix to determine soft labels of the training samples not marked with the class labels, and carrying out l on the soft labels of the training samples_2,1Norm regularization effectively reduces the influence of mixed signals in an original space formed by the soft labels on the accuracy of the classification result; by being based on l_2,1The robustness of regression residual measurement between the embedded features and the soft labels is effectively enhanced by the norm regularization measurement; by calculating on the basis of l_2,1And the mapping of the metric of norm regularization enhances the descriptive property of the extracted features. In summary, by introducing l_2,1Norm regularization technology effectively promotes system pairThe robustness of mixed signals such as noise, heterogeneous data and the like in the training samples avoids the occurrence of the condition that the accuracy of the classification result of the label-free sample is lower due to the influence of the mixed signal of the original space in the background technology, namely, the influence of the mixed signal of the original space is reduced, the accuracy of the classification result of the sample to be detected is improved, and the classification performance is enhanced. In addition, this application is the model of promoting outside the sample, and the conclusion and the prediction of the outer data of sample need not to introduce extra reconstruction process, and the expansibility can be good for the high efficiency completion sample of embedding.

In the image classification method based on the robust learning model provided in the above embodiment, initializing a training set acquired in advance to obtain an initial class label matrix may include:

obtaining an initialization matrix with Y₀＝[y₁,y₂,…,y_l+u]For presentation and training setIt is shown that, among others,representing the matrix space of n times l + u, X_LFor samples labeled with class labels, X_L＝[x₁,x₂,...,x_l]，X_UFor samples not labeled with class labels, X_U＝[x_l+1,x_l+2,...,x_l+u]N, l and u are positive integers;

Training sample x with any calibration class label_jIf the training sample x_jBelongs to the ith category, then y is determined_i，jIf the training sample x is 1_jIf it does not belong to the i-th category, y is determined_i，j0; for any training sample not marked with class labelx_jDetermining y_i，jAn initial class label matrix Y is obtained, where j is 1, 2.

Therefore, through the steps, the training samples are converted into the initial class label matrix capable of expressing the class of each training sample based on the class labels of each training sample, so that smooth implementation of the subsequent steps is guaranteed.

In the image classification method based on the robust learning model provided in the foregoing embodiment, the processing of the training sample based on the neighbor definition and the reconstruction weight construction method to obtain the similarity metric matrix corresponding to the training sample may include:

processing each training sample by using a K nearest neighbor algorithm to a training set containing c training samples to obtain K nearest neighbor samples of each training sample, wherein K is a positive integer;

and acquiring a similarity measurement matrix corresponding to the training samples by using the training samples and the K nearest neighbor samples of each training sample by adopting an LLE-reconstruction weight construction method.

The idea of the K nearest neighbor algorithm is as follows: if a sample belongs to a certain class in the K most similar samples in the feature space (i.e., the nearest neighbors in the feature space), then the sample also belongs to this class.

The construction method of LLE-reconstruction weight is an optimization method aiming at the nonlinear signal feature vector dimension, and the dimension optimization is not only simple reduction in quantity, but also mapping the signal of high-dimensional space to low-dimensional space under the condition of keeping the original data property unchanged, namely secondary extraction of feature value.

Specifically, processing the training sample based on the neighbor definition and reconstruction weight construction method to obtain a similarity metric matrix corresponding to the training sample may include:

the similarity metric matrix is obtained using the following formula:

wherein,representing a training sample x_iK nearest neighbor samples, q corresponds to K,for the current training sample x_iThe reconstructed coefficient vector of (2) characterizing its neighborsOn reconstruction of training sample x_iThe degree of contribution of the time-varying,a similarity metric matrix is represented. And a similarity metric matrixObtained by repeating the above steps related to the K-nearest neighbor algorithm and the construction method of LLE-reconstruction weights for all training samples.

It should be noted that, the construction method of LLE-reconstruction weights is used to obtain the similarity metric matrix corresponding to the training samples by using the training samples and the K nearest neighbor samples of each training sample, specifically, the construction method of LLE-reconstruction weights is used to calculate and measure the similarity between vertices to construct the similarity metric matrix of the similar neighbor graph.

In the image classification method based on the robust learning model provided in the above embodiment, the pre-setting processing is performed on the similarity metric matrix to obtain a reconstruction coefficient matrix, which may include:

and carrying out normalization processing and symmetry processing on the similarity measurement matrix to obtain a reconstruction coefficient matrix.

And mapping the sample to be detected by using the projection matrix to obtain the soft label of the sample to be detected, which may include:

obtaining a sample x to be tested_new；

By P^Tx_newTo-be-detected sample x_newEmbedding the prediction vector into a projection matrix P to obtain a prediction vector;

determining the soft label corresponding to the element with the maximum probability in the prediction vector as the sample x to be detected_newThe soft tag of (1).

The projection matrix is a mapping classifier, the sample to be detected is embedded into the mapping classifier, the prediction vector can be obtained, the prediction vector is a soft label matrix corresponding to the sample to be detected, the soft label of the position corresponding to the element with the maximum probability in the soft label matrix is determined to be the soft label of the sample to be detected, and the hard class label of the sample to be detected can be classified as argmax_i≤c(f_new)_iWhereina prediction vector f representing a prediction_newThe ith element position.

Through the steps, all the possible classes to which the sample to be detected belongs can be obtained according to the projection matrix, and the class with the highest probability is used as the class of the sample to be detected, so that the accuracy of the classification result is ensured.

Assume a predicted soft label matrix, i.e., an initialization matrix of F ═ F₁,f₂,…f_l+u]Wherein the training sample x_jClass and each column f_jMiddle maximum termf_i，jIs associated with the location of (a); in the image classification method based on the robust learning model provided in the above embodiment, an iteration mode is adopted to perform l on the soft label_2，1Normalizing the norm to obtain a projection matrix and a soft label matrix corresponding to the soft label of the training sample, which may include:

subj f_i≥0,e^Tf_i＝1for i＝1,2,...,l+u

wherein F represents a soft label matrix, P represents a projection matrix, W represents a reconstruction coefficient matrix, D represents a diagonal matrix, and D represents_ii＝∑_jW_i,j，fⁱTo representThe ith row vector of (2), X^TP-F^TRepresenting regression residue, which is used to measure X^TP and Soft tag F^TThe degree of difference of (a); mu.s_iα, gamma is a trade-off parameter corresponding to each of the above data, and a constraint of 1 for column sum, e, can be added^Tf_i1 and a non-negative constraint, i.e. f_iAnd the distribution probability of each training sample in the output soft label category matrix F is ensured to meet the logic that the probability sum is 1 and the distribution probability is a positive value (e is a column vector with elements of 1). And the projection matrix P, i.e. the mapping classifier, can be expressed as:

in the actual iterative solution process, the projection matrix and the soft label matrix are usually obtained by using the following formulas:

subjF≥0,e^TF＝e^T

wherein, omega ═ I-W^T-W+W^TW and I are unit matrixes, and only diagonal elements of the unit matrixes are nonzero and are 1; u denotes a diagonal matrix with the ith diagonal element μ_iFor labeled training samples, μ_iSet to + ∞, for training samples not labeled with class labels, μ_iIs set to 0. While projecting l of the matrix P_2,1Norm, i.e. P_2,1Many rows of the projection matrix P can be changed into zero, thereby reducing the influence of mixed signals in the original space on the soft label and ensuring that the projection matrix P is sparse.

It should be noted that, in the actual iterative solution process, on the basis of the above formula, the formula may be further rewritten as:

wherein G, H, V are diagonal matrices, and:

wherein r isⁱIs R ═ X^TP-F^TThe ith row vector of (1).

By passingFor a partial derivative of P set to 0, the following equation can be obtained:

wherein,

by passingFor a partial derivative of F set to 0, the following equation can be obtained:

wherein,

after each iteration update, the initial label matrix Y, the diagonal matrix D, the soft label matrix F, and the diagonal matrix H, V are updated as:

wherein,due to the fact thatAnd Y_lRespectively a prediction soft label and a label matrix of an original training sample marked with a class label,and Y_uFor the predicted soft label and label matrix of the original training sample not labeled with class label, the above formula can be rewritten as:

wherein, U can be divided into 4 parts according to the original training samples with the class labels and without the class labels in the initial training samples. The above formula can be converted into the following formula:

after the (k + 1) th iteration is estimated by the above two equations, the soft labels of the training samples with class labels and without class labels in the original space are updated as follows:

setting upBy substituting the above second equation into the first equation:

suppose thatThe soft label of the original training sample in the iteration process can be obtained by the following formula:

f can be updated by the above iteration steps_l ^k+1Andtherefore, when the original input signal contains noise or even contains error marks, the image classification method based on the robust learning model provided by the invention can update F through iterative steps_l ^k+1Thereby making the discrimination stronger. By F after update_l ^k+1Andcan train to obtain P^k+1And G^k+1，Andthe updating is as follows:

wherein,is (F)_l ^k+1)^TOrJth row vector, (r)^j)^k+1J is 1,2, …, l isThe jth row vector, and (r)^j)^k+1J is l +1, …, l + u isThe row vector of (1).

In the actual iteration process, in order to facilitate solving, the image classification method based on the robust learning model provided by the invention sets F in each iteration process_l ^k+1＝Y_lTo simplify the optimization, it is therefore possible to solve the following problems

Wherein,let the marked data x in the original input signal_jIs normalized by the parameter psi_jIs set to psi_lUnmarked data x_jIs normalized by the parameter psi_jIs set to psi_uWhen psi_lWhen 0, it indicates that the label prediction result for the labeled data is the same as the original label. I is_ψ＝(1+U)^-1，I_ξ＝I-I_ψRespectively mixing I_ψAnd I_ξThe method is divided into four parts as follows:

wherein,andrespectively diagonal element is psi_lAnd 1-psi_lThe diagonal matrix of (a) is,andrespectively diagonal element is psi_uAnd 1-psi_uThe diagonal matrix of (a). Is provided withThe k +1 th iteration yields:

the specific algorithm is as follows:

inputting:

matrix of raw dataInitial label matrix Y, control parameters ψ, α, γ, and K.

And (3) outputting:

1) initialization G⁰，H⁰，V⁰Is an identity matrix, F⁰＝Y，P⁰＝0，k＝0；

2) Computing a weight matrix W, constructing a diagonal matrix D such that D_ii＝∑_jW_i,j；

3) Define matrix Ω ═ I-W^T-W+W^TW＝[Ω_ll,Ω_lu,Ω_ul,Ω_uu]。

When not converging:

the soft label matrix F is updated by fixing other values to solve the following problem:

updating the projection matrix P by fixing other values solves the following problem:

update matrix G is:

computingUpdating by the method for updating H and VAnd

if | | | F^k+1-F^k||_F≤ε,||P^k+1-P^k||_FStopping circulation when the epsilon is less than or equal to epsilon; otherwise, K is K +1, and the circulation is continued.

And (3) outputting:

optimal solutionAnd an optimal projection matrix P ═ P^k+1。

Corresponding to the foregoing method embodiment, an embodiment of the present invention further provides an image classification system, as shown in fig. 2, which may include:

the preprocessing module 21 is configured to initialize a training set obtained in advance to obtain an initial class label matrix, where the training set includes a preset number of training samples, and the training samples include samples of which the classes are known and are labeled with class labels corresponding to the classes, and samples of which the classes are unknown and are not labeled with class labels; the construction method is used for processing the training samples based on the neighbor definition and the reconstruction weight to obtain a similarity measurement matrix corresponding to the training samples, and the similarity measurement matrix is subjected to preset processing to obtain a reconstruction coefficient matrix;

a training module 22, configured to determine a soft label of a training sample without a class label by effectively balancing a manifold smoothing term and a label fitting term based on the reconstruction coefficient matrix and the initial class label matrix, and perform l on the soft label of the training sample in an iterative manner_2，1Normalizing the norm to obtain a projection matrix and a soft label matrix corresponding to the soft label of the training sample;

the prediction module 23 is configured to map the sample to be detected by using the projection matrix to obtain a soft label of the sample to be detected; wherein, the sample to be detected is a sample without a calibrated class label.

The method comprises the steps of firstly, utilizing training samples marked with class labels and training samples not marked with class labels to jointly construct a similarity measurement matrix, further obtaining a reconstruction coefficient matrix, initializing to obtain an initial class label matrix, utilizing the initial class label matrix and the reconstruction coefficient matrix to determine soft labels of the training samples not marked with the class labels, and carrying out l on the soft labels_2,1Norm of normThe influence of the mixed signal in the original space formed by the soft label on the accuracy of the classification result is effectively reduced; by being based on l_2,1The robustness of regression residual measurement between the embedded features and the soft labels is effectively enhanced by the norm regularization measurement; by calculating on the basis of l_2,1And the mapping of the metric of norm regularization enhances the descriptive property of the extracted features. In summary, by introducing l_2,1The norm regularization technology effectively improves the robustness of the system to mixed signals such as noise, heterogeneous data and the like in training samples, avoids the occurrence of the condition that the classification result accuracy of a label-free sample is lower due to the influence of the mixed signals of an original space in the background technology, namely, reduces the influence of the mixed signals of the original space, improves the accuracy of the classification result of a sample to be detected, and enhances the classification performance. In addition, this application is the model of promoting outside the sample, and the conclusion and the prediction of the outer data of sample need not to introduce extra reconstruction process, and the expansibility can be good for the high efficiency completion sample of embedding.

In the image classification system provided by the above embodiment, the preprocessing module may include:

and the K neighbor unit is used for processing each training sample by utilizing a K neighbor algorithm to obtain K nearest neighbor samples of each training sample, wherein K is a positive integer.

And the LLE-reconstruction weight unit is used for acquiring a similarity measurement matrix corresponding to the training samples by using the K nearest neighbor samples of each training sample by adopting a LLE-reconstruction weight construction method.

And the preset processing unit is used for carrying out normalization processing and symmetry processing on the similarity measurement matrix to obtain a reconstruction coefficient matrix.

An initialization unit to: obtaining an initialization matrix with Y₀＝[y₁,y₂,…,y_l+u]For presentation and training setIt is shown that, among others,representing the matrix space of n times l + u, X_LFor samples labeled with class labels, X_L＝[x₁,x₂,...,x_l]，X_UFor samples not labeled with class labels, X_U＝[x_l+1,x_l+2,...,x_l+u]N, l and u are positive integers; determining any y in the initialization matrix_iIs a column vector, any column vector corresponds to the ith training sample x in the training set_i1, 2., l + u; training sample x with any calibration class label_jIf the training sample x_jBelongs to the ith category, then y is determined_i，jIf the training sample x is 1_jIf it does not belong to the i-th category, y is determined_i，j0; for any training sample x not marked with class label_jDetermining y_i，jAn initial class label matrix Y is obtained, where j is 1, 2.

The LLE-reconstructing weight unit may include:

an LLE-reconfiguration weight section for: the similarity metric matrix is obtained using the following formula:

wherein,representing a training sample x_iK nearest neighbor samples ofIn this case, q corresponds to K,for the current training sample x_iThe reconstructed coefficient vector of (2) characterizing its neighborsOn reconstruction of training sample x_iThe degree of contribution of the time-varying,representing the corresponding similarity reconstruction coefficient matrix.

In the image classification system provided by the above embodiment, the prediction module may include:

a prediction unit to: obtaining a sample x to be tested_new(ii) a By P^Tx_newTo-be-detected sample x_newEmbedding the prediction vector into a projection matrix P to obtain a prediction vector; determining the soft label corresponding to the element with the maximum probability in the prediction vector as the sample x to be detected_newThe soft tag of (1).

In the image classification system provided by the above embodiment, the training module may include:

a training unit to: the projection matrix and the soft label matrix are obtained by the following formulas:

subj f_i≥0,e^Tf_i＝1for i＝1,2,...,l+u

wherein F represents a soft label matrix, P represents a projection matrix, W represents a reconstruction coefficient matrix, D represents a diagonal matrix, and D represents_ii＝∑_jW_i,j，fⁱTo representThe ith row vector of (2), X^TP-F^TRepresenting regression residue, which is used to measure X^TP and Soft tag F^TThe degree of difference of (a); mu.s_iα, γ is the corresponding trade-off parameter;

subjF≥0,e^TF＝e^T

For an image classification system disclosed in the embodiment of the present invention, since it corresponds to an image classification method based on a robust learning model disclosed in the embodiment of the present invention, for a specific description thereof, reference may be made to relevant contents of the above method embodiment.

In addition, in the embodiment of the present invention, the image classification method based on the robust learning model provided in the embodiment of the present invention is tested on 4 real data sets, including Extended Yale-B, AR face, CMU peer face, and CMU place face. Based on computational efficiency considerations, all real images are compressed to 32x32 in size, so in the experiment one 1024-dimensional vector is for each picture. In order to test the performance of the method, a certain number of samples are selected from each class to form training samples with labeled class labels in a training set to serve as labeled training sample subsets, and then the same number of training samples without labeled class labels are selected from each class to serve as unlabeled training sample subsets. These data sets are collected from multiple aspects, and thus the test results are generally illustrative.

Please refer to table 1, which shows the average recognition rate and standard deviation of each method experiment for the image classification method based on the robust learning model and the comparison table of the testing recognition results of the GFHF, LLGC, SLP, LNP, SDA, Lap-LDA, FME and ELP methods in the extended Yale-B, AR face, CMU PIE and CMU pos face data set. In the experiment, a certain amount of samples are selected from each type of samples to form a training set, and in order to increase the experiment fairness, experiment parameters participating in a comparison method are also carefully selected, wherein the number of training samples marked with class labels for each type in the training set is in parentheses in a table, and the units of identification results are percentages.

TABLE 1 comparison table of test identification results

In addition, when the image is a face image, the face image test sample is equivalent to a sample to be tested in the present application, and the face image training sample is equivalent to a training sample in the present application, and a label prediction schematic diagram for obtaining the category of the face image test sample is shown in fig. 3.

The experimental result shows that the image classification method based on the robust learning model provided by the invention has an effect obviously superior to that of the traditional label propagation algorithm and has higher applicability and robustness. Therefore, the image classification system provided by the invention has the same effect.

The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims

1. An image classification method based on a robust learning model is characterized by comprising the following steps:

mapping a sample to be detected by using the projection matrix to obtain a soft label of the sample to be detected; the sample to be detected is a sample of which the category is unknown and the category label is not calibrated;

the training sample and the sample to be detected are both face images.

2. The method according to claim 1, wherein the constructing method based on neighbor definition and reconstruction weight processes the training samples to obtain a similarity metric matrix corresponding to the training samples, and comprises:

3. The method according to claim 2, wherein the pre-processing the similarity metric matrix to obtain a reconstruction coefficient matrix comprises:

4. The method of claim 3, wherein mapping a sample to be tested with the projection matrix to obtain a soft label of the sample to be tested comprises:

obtaining a sample x to be tested_new；

5. The method of claim 2, wherein initializing a pre-acquired training set to obtain an initial class label matrix comprises:

Training sample x with any calibration class label_jIf the training sample x_jBelongs to the ith category, then y is determined_i，jIf the training sample x is 1_jIf it does not belong to the i-th category, y is determined_i，j0; for any training sample x not marked with class label_jDetermining y_i，jThe initial class label matrix Y is obtained, where j is 1,2,...,l+u。

6. The method according to claim 5, wherein the constructing method based on neighbor definition and reconstruction weight processes the training samples to obtain a similarity metric matrix corresponding to the training samples, and comprises:

obtaining the similarity metric matrix using the following formula:

7. The method of claim 6, wherein said iteratively applying is performed on said sampleTraining sample soft label_2，1And normalizing the norm to obtain a projection matrix and a soft label matrix corresponding to the soft label of the training sample, wherein the soft label matrix comprises the following steps:

subj f_i≥0,e^Tf_i＝1for i＝1,2,...,l+u

subjF≥0,e^TF＝e^T

8. An image classification system, comprising:

the prediction module is used for mapping a sample to be detected by using the projection matrix to obtain a soft label of the sample to be detected; the sample to be detected is a sample without a calibrated class label;

the training sample and the sample to be detected are both face images.

9. The system of claim 8, wherein the pre-processing module comprises:

10. The system of claim 9, wherein the pre-processing module comprises: