CN110378366B

CN110378366B - Cross-domain image classification method based on coupling knowledge migration

Info

Publication number: CN110378366B
Application number: CN201910482559.5A
Authority: CN
Inventors: 孟敏; 兰孟城; 武继刚
Original assignee: Guangdong University of Technology
Current assignee: Guangdong University of Technology
Priority date: 2019-06-04
Filing date: 2019-06-04
Publication date: 2023-01-17
Anticipated expiration: 2039-06-04
Also published as: CN110378366A

Abstract

The invention discloses a cross-domain image classification method based on coupling knowledge migration, which is characterized in that a common low-dimensional subspace of a source domain and a target domain is searched based on a maximum mean difference criterion, and the difference between the edge distribution and the class condition distribution of data of the source domain and the target domain is eliminated; constructing respective adjacent maps according to label information of source domain data and pseudo label information of target domain data, keeping structural consistency of the data from an original space to a low-dimensional subspace, and simultaneously dynamically adjusting the structure of the adjacent maps to promote the forward migration of domain knowledge; training a nearest neighbor classifier by using source domain data with label information in a low-dimensional subspace, and continuously iterating and optimizing pseudo label information of target domain data to obtain final label information of the target domain data, namely finishing cross-domain image classification; in addition, the method of the invention endows different confidence degrees of the pseudo label of the target domain image by designing a sample reweighting strategy, effectively reduces the negative transfer of the knowledge in the domain, and improves the precision of cross-domain image classification.

Description

Cross-domain image classification method based on coupling knowledge migration

Technical Field

The invention relates to the technical field of computer vision image classification, in particular to a cross-domain image classification method based on coupling knowledge migration.

Background

Conventional machine learning algorithms typically require a large number of labeled data samples and require that the training samples and the test samples follow separate and identical distributions. A straightforward approach is to migrate an existing tagged data set onto a new unknown data set, i.e. from the source domain to the target domain. However, the distribution must be different between different data sets, and the assumption that "independent equal distribution is obeyed" is often not true. Therefore, the conventional image classification method has difficulty in obtaining good classification performance.

In order to overcome the difference of sample distribution between a source domain and a target domain and improve the generalization capability of a classification algorithm, currently, domain adaptation is mainly used as a means for processing the problem of knowledge migration in cross-domain image classification, and the method can be generally divided into a method based on feature transformation, a method based on an example and a method based on neural network migration. However, there are many problems to be solved and improved in the current cross-domain learning, for example, in the case of a large difference between the source domain and the target domain, the classification performance of the domain adaptation method based on feature transformation is far from reaching the original expectation of people; the method based on the example needs to calculate additional sample weight, has high calculation complexity, and cannot carry out rapid cross-field learning on a large data set; the neural network-based approach requires a large number of labeled source domain samples for long training and has a large limitation in practical applications.

Disclosure of Invention

The invention provides a cross-domain image classification method based on coupling knowledge migration, which aims to solve the problems of long calculation time, insufficient classification accuracy and the like of the existing cross-domain image classification method.

In order to realize the purpose of the invention, the technical means adopted is as follows:

a cross-domain image classification method based on coupling knowledge migration comprises the following steps:

s1, obtaining a source domain image and a target domain image, extracting characteristics, respectively obtaining a source domain characteristic matrix as source domain data, and obtaining a target domain characteristic matrix as target domain data;

s2, respectively constructing an adjacency graph of source domain data and an adjacency graph of target domain data according to label information of the source domain data, and respectively obtaining a weight matrix of the source domain and a weight matrix of the target domain by calculating edge weights of the adjacency graphs;

s3, performing inter-domain data distribution alignment in a common low-dimensional subspace of the source domain and the target domain, and performing knowledge migration of intra-domain data by using a weight matrix of the source domain and a weight matrix of the target domain;

s4, in the low-dimensional subspace, utilizing source domain data with label information to train a nearest neighbor classifier and predicting the category information of target domain data;

s5, reweighing the label information of the predicted target domain data;

and S6, judging whether the maximum iteration number is reached, if not, returning to the step S2, and if so, outputting the label information of the predicted target domain image.

In the scheme, the common low-dimensional subspace of the source domain and the target source is searched, the difference of sample distribution is reduced, meanwhile, the adjacent map is used for carrying out the in-domain knowledge positive transfer, the structural consistency from the original space to the common low-dimensional subspace is kept, the data distribution is aligned by eliminating the difference of the edge distribution and class condition distribution of the data of the source domain and the target domain, therefore, the source domain data with label information is used for training a nearest neighbor classifier in the low-dimensional subspace to predict the label information of the data of the target domain, and finally, the predicted label information of the data of the target domain is subjected to weighting to inhibit the knowledge negative transfer, so that the accurate classification of the cross-domain image is realized.

Preferably, the specific steps of step S1 are:

s1.1. Input n _s A source domain image with label and n _t A label-free target domain image; wherein n is _s 、n _t Is a positive integer;

s1.2, respectively extracting the features of the source domain image and the target domain image, and arranging the feature vectors obtained by extracting the source domain image in columns to obtain a source domain data feature matrix

Where m is the dimension of the feature vector, n _s The number of the feature vectors;

arranging the characteristic vectors extracted from the target domain image according to columns to obtain a target domain data characteristic matrix

Where m is the dimension of the feature vector, n _t The number of the feature vectors;

the feature vector of the source domain image represents source domain data of the source domain image, and the feature vector of the target domain image represents target domain data of the target domain image.

Preferably, the feature extraction in step S1 is to extract feature vectors of the source domain image and the target domain image respectively through a pre-trained neural network. In the preferred embodiment, the neural network may use VGG16, *** lenet, resNet50, or the like.

Preferably, the feature extraction in step S1 is specifically to arrange the gray values of the source domain image into feature vectors and the gray values of the target domain image into feature vectors after graying the source domain image and the target domain image respectively.

Preferably, the specific step S2 is:

s2.1, judging whether the iteration is the first iteration, if not, directly carrying out the step S2.2, if so, constructing a nearest neighbor classifier and training the nearest neighbor classifier by utilizing the source domain data for predicting pseudo label information of the target domain data

S2.2, constructing an adjacency graph of the source domain data according to the label information of the source domain data, namely establishing adjacent edges among the same type data in the source domain data, and calculating the weights of the adjacent edges to obtain a weight matrix

Wherein W ^s A weight matrix for the source domain, calculated by:

wherein

Is the feature vector of the ith source domain image,

the label information corresponding to the ith source domain image is represented by the bandwidth of a Gaussian kernel, and | · | | | represents a vector two-norm;

constructing an adjacency graph of the target domain data according to the pseudo label information of the target domain data, establishing an adjacency edge between the same type data in the target domain data, and calculating the weight of the adjacency edge to obtain a weight matrix

Wherein W ^t Is the weight matrix of the target domain and is calculated by:

wherein

Is the feature vector of the ith target domain image,

for the pseudo label corresponding to the ith target field image, beta _i Is composed of

The initial value of (1) is set to be 1.

In the preferred scheme, by means of the method for constructing the adjacent maps in the source domain and the target domain respectively, the structural consistency of the data from an original space to an invariant subspace can be maintained, the semantic information of the source domain data label information is utilized to construct the adjacent maps, the structure of the adjacent maps can be dynamically adjusted, so that the more essential local structure of the data is mined, and the forward migration of the knowledge in the domain is promoted.

Preferably, the specific step S3 is:

s3.1, in a common low-dimensional subspace of the source domain and the target domain, performing inter-domain data distribution alignment including edge distribution and conditional probability distribution by using a maximum mean difference criterion, specifically, minimizing the following first objective function:

wherein C represents the number of categories of label information, x ^s,c And x ^t,c Respectively indicating that the category of the source domain data label information belongs to the set of categories c and the category of the target domain data label information belongs to the set of categories c,

and

respectively indicating the number of data of which the source domain data belongs to the category c of the tag information and the number of data of which the destination domain data belongs to the category c of the tag information,

is a projection matrix, where d < m, for projecting data from m dimensions to d-dimensional subspace;

s3.2. Using weight matrix W ^s And W ^t Performing knowledge migration of data within the domain, specifically by minimizing the following second objective function:

s3.3, combining the first objective function and the second objective function and adding a regularization

The following third objective function is obtained:

s.t.P ^T XHX ^T P＝I

wherein λ and α are balance factors, λ > 0 and α > 0, X = [ X ] = _s ,X _t ]H is the centering matrix:

i is a unit matrix, and the unit matrix is,

is a full 1 column vector of value 1;

solving a projection matrix P through generalized eigen decomposition, and obtaining the expressions of the source domain data and the target domain data in the low-dimensional subspace as follows:

and

in the preferred scheme, the difference between the edge distribution and the class condition distribution of the data of the source domain and the target domain is eliminated by searching the common low-dimensional subspace of the source domain and the target domain based on the maximum mean difference criterion.

Preferably, the specific steps of step S4 are:

obtaining the source domain data with label information in the low-dimensional subspace according to the representation of the source domain data in the low-dimensional subspace

For training the nearest neighbor classifier and predicting target domain data in a low-dimensional space

Class information of

In the preferred scheme, a nearest neighbor classifier is trained by using source domain data with label information in a low-dimensional subspace, and the pseudo label information of the target domain data is continuously optimized in an iterative manner, so that the final label information of the target domain data is obtained.

Preferably, the specific step of step S5 includes:

s5.1. for target domain data in a low-dimensional space

Constructing a new adjacency graph, i.e. each target domain data point in the low-dimensional space

Respectively establishing adjacent edges with k target domain data points with the shortest Euclidean distances;

s5.2, based on the adjacency graph in the step S5.1, for the ith target domain data point on the granted adjacency graph

If and with

The connected k target domain data points are all connected with

If they belong to one category, the target domain data point

Is marked with a label

Confidence of (beta) _i =1, otherwise β _i =0, wherein k is a positive integer.

In the preferred scheme, different confidences are given to the target domain samples based on a sample re-weighting strategy, so that the negative migration of the knowledge in the domain is effectively inhibited.

Compared with the prior art, the technical scheme of the invention has the beneficial effects that:

the method of the invention carries out inter-domain data distribution alignment comprising edge distribution and conditional probability distribution by utilizing the maximum mean difference criterion, searches a common low-dimensional subspace of a source domain and a target domain, eliminates the difference between the edge distribution and the class conditional distribution of the data of the source domain and the target domain, and leads the data distribution to be aligned;

constructing respective adjacency graphs according to the label information of the source domain data and the pseudo label information of the target domain data, keeping the structural consistency of the data from an original space to a low-dimensional subspace, and simultaneously dynamically adjusting the structure of the adjacency graphs to promote the forward migration of the knowledge in the domain; training a nearest neighbor classifier by using source domain data with label information in a low-dimensional subspace, and continuously iterating and optimizing pseudo label information of target domain data to obtain final label information of the target domain data, namely finishing cross-domain image classification;

in addition, the method of the invention endows different confidence coefficients of the pseudo labels of the target domain images by designing a sample re-weighting strategy, effectively reduces the negative transfer of the knowledge in the domain, and improves the accuracy of cross-domain image classification.

The method is based on an iterative method, and can effectively combine data distribution alignment to promote the intra-domain knowledge positive migration and inhibit the intra-domain knowledge negative migration, thereby achieving the purpose of coupling knowledge migration and greatly improving the performance of cross-domain image classification. The method solves the problems of long calculation time, insufficient classification accuracy and the like of the conventional cross-domain image classification method.

Drawings

FIG. 1 is a flow chart of the method of the present invention.

Fig. 2 is an effect diagram after performing inter-domain data distribution alignment in embodiment 2.

Fig. 3 is a diagram illustrating the distribution effect of the label information types in the low-dimensional space for the target domain in this embodiment 2.

Detailed Description

The drawings are for illustrative purposes only and are not to be construed as limiting the patent;

for the purpose of better illustrating the present embodiments, certain elements of the drawings may be omitted, enlarged or reduced, and do not represent the size of an actual product;

it will be understood by those skilled in the art that certain well-known structures in the drawings and descriptions thereof may be omitted.

The technical solution of the present invention is further described with reference to the drawings and the embodiments.

Example 1

A cross-domain image classification method based on coupled knowledge migration, as shown in FIG. 1, includes the following steps:

s1, obtaining a source domain image and a target domain image, extracting features, respectively obtaining a source domain feature matrix as source domain data, and obtaining a target domain feature matrix as target domain data:

s1.1. Input n _s A tagged source domain image and n _t Opening an image of the target domain without a label; wherein n is _s 、n _t Is a positive integer;

s1.2, respectively extracting the features of the source domain image and the target domain image, respectively extracting the feature vectors of the source domain image and the target domain image through any one of pre-trained neural networks of VGG16, ***Ne and ResNet50, and arranging the feature vectors extracted from the source domain image in columns to obtain a source domain data feature matrix

arranging the feature vectors extracted from the target domain image according to columns to obtain a target domain data feature matrix

S2, respectively constructing an adjacency graph of source domain data and an adjacency graph of target domain data according to label information of the source domain data, and respectively obtaining a weight matrix of the source domain and a weight matrix of the target domain by calculating edge weights of the adjacency graphs:

s2.1, judging whether the iteration is the first iteration or not, if not, directly carrying out the step S2.2, if so, constructing a nearest neighbor classifier and training the nearest neighbor classifier by utilizing the source domain data for predicting pseudo label information of target domain data

S2.2, constructing an adjacency graph of the source domain data according to the label information of the source domain data, namely establishing adjacent edges among the same-class data in the source domain data, and calculating the weights of the adjacent edges to obtain a weight matrix

Wherein W ^s A weight matrix for the source domain, calculated by:

wherein

Is the feature vector of the ith source domain image,

Wherein W ^t Is the weight matrix of the target domain and is calculated by:

wherein

Is the feature vector of the ith target domain image,

The initial value of (1) is set.

S3, performing inter-domain data distribution alignment in a common low-dimensional subspace of the source domain and the target domain, and simultaneously performing knowledge migration of intra-domain data by using a weight matrix of the source domain and a weight matrix of the target domain:

and

s3.2. Using weight matrix W ^s And W ^t Performing knowledge migration of data in the domain, specifically by minimizing the following second objective function:

The following third objective function is obtained:

s.t.P ^T XHX ^T P＝I

i is a unit matrix, and the unit matrix is,

is a full 1 column vector of value 1;

solving a projection matrix P through generalized eigen decomposition to obtain the expressions of the source domain data and the target domain data in the low-dimensional subspace, wherein the expressions are respectively as follows:

and

s4, in the low-dimensional subspace, utilizing the source domain data with the label information to train a nearest neighbor classifier, and predicting the category information of the target domain data: obtaining the source domain data with label information in the low-dimensional subspace according to the representation of the source domain data in the low-dimensional subspace

Class information of

S5, reweighing the label information of the predicted target domain data:

s5.1. for target domain data in a low-dimensional space

Respectively establishing adjacent edges with k target domain data points with the shortest Euclidean distance;

If and with

The connected k target domain data points are all connected with

If they belong to one category, the target domain data point

Of (2)

S6, judging whether the maximum iteration times is reached or not, and if not, returning to the step S2, if yes, outputting the label information of the predicted target domain image

Example 2

In this embodiment 2, 980 face images each of which is composed of face images of 10 persons randomly selected from the CMU PIE data set were selected, and each image was cut to 32 × 32 resolution. Further, 490 images in the C05 subset are used as a source domain data set, and 490 images in the C27 subset are used as a target domain data set to be classified.

S1, obtaining a source domain image in a source domain data set and a target domain image in a target domain data set, after graying the source domain image and the target domain image respectively, arranging gray values of the source domain image into a feature vector and arranging gray values of the target domain image into a feature vector, in this embodiment 2, gray values of each image are arranged into a feature vector, that is, each image is represented as a 1024-dimensional column vector, and correspondingly, source domain data feature matrix matrixes can be obtained respectively by arranging the images in columns

And a target domain feature matrix

s2.1, judging whether the iteration is the first iteration or not, if not, directly carrying out the step S2.2, if so, constructing a nearest neighbor classifier and utilizing the source domain data with the labeled information

Training nearest neighbor classifiers for predicting target domain data

Pseudo category information of

S2.2. Tag information y according to source domain data ^s Constructing an adjacency graph of the source domain data, namely establishing an adjacency edge between homogeneous data in the source domain data, and calculating the weight of the adjacency edge to obtain a weight matrix

Wherein W ^s A weight matrix for the source domain, calculated by:

wherein

Is the feature vector of the ith source domain image,

for the label information corresponding to the ith source domain image, σ is the bandwidth of the gaussian kernel, which is set to 1 in this embodiment 1, and | · | | | represents a vector two-norm;

pseudo label information according to the target domain data

Constructing an adjacent graph of target domain data, establishing adjacent edges among the same-class data in the target domain data, and calculating the weights of the adjacent edges to obtain a weight matrix

Wherein W ^t A weight matrix for the target domain and calculated by:

wherein

Is the feature vector of the ith target domain image,

The initial value of (1) is set to be 1.

S3, performing inter-domain data distribution alignment in a common low-dimensional subspace of the source domain and the target domain, and simultaneously performing knowledge migration of intra-domain data by using a weight matrix of the source domain and a weight matrix of the target domain;

by minimizing the following objective function:

s.t.P ^T XHX ^T P＝I

where λ =0.1 and α =0.01 are balance factors. X = [ X = _s ,X _t ]H is the centering matrix:

I ₉₈₀ is a unit matrix, 1 ₉₈₀ Is a full 1 column vector. Solving the above formula through generalized eigen decomposition to obtain a projection matrix

The dimension of the low-dimensional subspace is set to d =100 in this embodiment 2. And finally, the representation of the source domain data and the target domain data in the low-dimensional subspace is obtained:

and

s4, according to the representation of the source domain data in the low-dimensional subspace, obtaining the source domain data with label information in the low-dimensional subspace

Class information of

S5, reweighing the label information of the predicted target domain data;

s5.1. For target domain data in a low-dimensional space

Respectively establishing adjacent edges with k =5 target domain data points with the shortest Euclidean distance;

If and with

The 5 connected target domain data points are all connected with

If they belong to one category, the target domain data point

Is markedLabel (Bao)

Confidence of (beta) _i =1, otherwise beta _i ＝0。

S6, judging whether the maximum iteration number is reached to 10, if not, returning to the step S2, and if so, outputting the label information of the predicted target domain image

Thereby obtaining a classification result.

In this embodiment 2, the experimental platform is MATLAB R2017a software on WIN10 system, and the CPU model is Intel i7-6700k @4.00ghz. The experimental result diagram of this embodiment 2 is shown in fig. 2 and 3, where in fig. 2, a circle represents source domain data, a pentagon represents target domain data, and a left diagram of fig. 2 is an original distribution of the source domain data and the target domain data, and it can be obviously seen that the distribution of the source domain data and the target domain data has a great difference; the right diagram of fig. 2 is data distribution after inter-domain data distribution alignment is performed on source domain data and data target domain data in a low-dimensional subspace in the method of embodiment 2, and it can be obviously seen that the difference of the source domain data and the target domain data distribution is greatly reduced. In fig. 3, labels of different shapes represent different label information categories, for a total of ten label information categories. The left sub-graph in fig. 3 is the distribution of each label information of the original target domain data, and the category distribution is relatively disordered; fig. 3 is a right sub-graph of the distribution of the target domain data in the low-dimensional subspace obtained by the method of this embodiment 2, which shows that the category distribution of the target domain data is clear, and each data with the same tag information forms a cluster. This demonstrates the high efficiency of the process of the invention.

The terms describing positional relationships in the drawings are for illustrative purposes only and are not to be construed as limiting the patent;

it should be understood that the above-described embodiments of the present invention are merely examples for clearly illustrating the present invention, and are not intended to limit the embodiments of the present invention. Other variations and modifications will be apparent to persons skilled in the art in light of the above description. And are neither required nor exhaustive of all embodiments. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present invention should be included in the protection scope of the claims of the present invention.

Claims

1. A cross-domain image classification method based on coupling knowledge migration is characterized by comprising the following steps:

s4, in the low-dimensional subspace, training a nearest neighbor classifier by using source domain data with label information, and predicting the label information of target domain data;

s5, reweighing the label information of the predicted target domain data;

s6, judging whether the maximum iteration number is reached, if not, returning to the step S2, and if so, outputting label information of the predicted target domain image so as to obtain a classification result;

the S3 comprises the following specific steps:

and

is a projection matrix, where d < m, for projecting data from m dimensions into a d-dimensional subspace, m representing the dimensions of the eigenvectors;

the feature vector of the ith source domain image is obtained; n is _s Number of source domain images for which a label is applied; n is _t Representing the number of unlabeled target domain images;

the feature vector of the ith target domain image is obtained;

The following third objective function is obtained:

s.t.P ^T XHX ^T P＝I

i is an identity matrix and is a matrix of the identity,

is a full 1-column vector of value 1; x _s Representing a source domain data feature matrix, X _t Representing a target domain data feature matrix;

and

2. the cross-domain image classification method based on coupling knowledge migration according to claim 1, wherein the specific steps of the step S1 are:

s1.1. Input n _s A source domain image with label and n _t Opening an image of the target domain without a label; wherein n is _s 、n _t Is a positive integer;

Wherein m is the dimension of the feature vector;

Wherein m is the dimension of the feature vector;

3. The method for cross-domain image classification based on coupling knowledge migration according to claim 2, wherein the feature extraction in step S1 is to extract feature vectors of the source domain image and the target domain image respectively through a pre-trained neural network.

4. The cross-domain image classification method based on coupling knowledge migration according to claim 2, wherein the feature extraction in step S1 specifically is to arrange gray values of the source domain image into feature vectors and gray values of the target domain image into feature vectors after graying the source domain image and the target domain image respectively.

5. The cross-domain image classification method based on coupling knowledge migration according to claim 3 or 4, wherein the specific step S2 is:

S2.2. According to the source domain data markLabeling information, constructing an adjacency graph of source domain data, establishing an adjacency edge between the same type data in the source domain data, and calculating the weight of the adjacency edge to obtain a weight matrix

Wherein W ^s A weight matrix for the source domain, calculated by:

wherein

Is the feature vector of the ith source domain image,

Wherein W ^t Is the weight matrix of the target domain and is calculated by:

wherein

Is the feature vector of the ith target domain image,

for the pseudo label information, beta, corresponding to the ith target field image _i Is composed of

The initial value of (1) is set.

6. The cross-domain image classification method based on coupling knowledge migration according to claim 5, wherein the specific steps of the step S4 are as follows:

Pseudo tag information of

7. The cross-domain image classification method based on coupling knowledge migration according to claim 6, wherein the specific steps of the step S5 comprise:

s5.1. For target domain data in a low-dimensional space

Constructing a new adjacency graph, and setting each target domain data point in a low-dimensional space

Respectively establishing adjacent edges with k target domain data points with the shortest Euclidean distance; i =1,2, \8230;, n _t ；

S5.2. Based onFor the adjacency graph stated in step S5.1, for the ith target domain data point on the granted adjacency graph

If and with

The k target domain data points connected with

If the same belongs to a category, the target domain data point

Is marked with a label

Weight of beta _i =1, otherwise β _i =0, wherein k is a positive integer.