CN113420786A

CN113420786A - Semi-supervised classification method for feature mixed image

Info

Publication number: CN113420786A
Application number: CN202110599001.2A
Authority: CN
Inventors: 姚博; 文成林; 徐晓滨
Original assignee: Hangzhou Dianzi University
Current assignee: Hangzhou Dianzi University
Priority date: 2021-05-31
Filing date: 2021-05-31
Publication date: 2021-09-21

Abstract

The invention provides a semi-supervised classification method of feature mixed images, and relates to the field of computer image processing. The invention comprises the following steps: manufacturing a semi-supervised data set; performing feature fusion augmentation processing on the label-free data; adding a consistency measurement method for the network, and designing a new objective function; and finally training the network and testing the classification effect of the network on the feature mixed image. According to the method, a multi-image mixed sample capacity increasing technology is integrated into an expansion framework of a semi-supervised model, the diversity of training data is increased, various types of noise interference is introduced, the generalization capability of the model is improved, the robustness of the model is enhanced, and the accurate classification of the characteristic mixed images is realized.

Description

Semi-supervised classification method for feature mixed image

Technical Field

The invention belongs to the field of computer image processing, and particularly relates to a semi-supervised classification method of feature mixed images

Background

Semi-supervised learning is an important research direction in the field of current machine learning, in the field of semi-supervised learning, training data usually contains a large amount of label-free data, the training requirements of a model cannot be met only by augmenting labeled data, and particularly when the number of labeled data is small, the model is easy to be over-fitted. In order to solve the problem, the main stream semi-supervised learning model depends on a consistency regularization term in an objective function to carry out consistency measurement on data distribution before and after augmentation, effective information is extracted from non-label data, and the generalization performance and the robustness of the model can be effectively improved due to the diversity and the complexity of augmentation types.

In the field of semi-supervised classification, a consistency regularization method mainly uses a RandAugment augmentation framework, a strategy space with only a single image augmentation method is defined, and each image randomly selects the augmentation method to generate corresponding augmentation data. In addition to single image augmentation, the Mixup mixed data augmentation method realizes data augmentation by modeling different types of data, and achieves remarkable performance improvement in the image classification field under supervised learning. Mixup, as a hybrid augmentation method, typically generates a hybrid image containing multiple target features, which is often found in artificially modified images or images of hybrid species. Because the mixed image does not belong to any determined known class, the strategy space of RandAugment cannot be directly added for consistency measurement, which brings difficulty to the classification of the mixed image under semi-supervision.

Disclosure of Invention

The invention aims to provide a semi-supervised classification method of a feature mixed image, which realizes the rapid and accurate classification of the feature mixed image when only a small amount of labeled data exists.

The method comprises the following specific steps:

step 1, collecting and sorting images of objects to be classified as a data set, and only carrying out partial labeling on the images of each category.

And 2, performing multiple feature mixing augmentation on each unmarked image in the data set.

And 3, designing a consistency measurement method related to the unmarked image and the augmented image for the convolutional network, and improving the objective function of the network.

And 4, inputting the data set containing the marked image, the unmarked image and the augmented image into the improved convolution network, and training until the network is converged.

And 5, taking the mixed image with the multi-target characteristics as a test set to obtain an image classification result.

Further, in step 1, in order to ensure that the class probabilities in the network training are equal, the number of labeled images of each class is consistent, that is, the class labels are balanced.

Further, in step 2, in the case of the non-labeled image enhancement, a Mixup mixed data enhancement method is used in addition to the usual single image enhancement such as stretching, inversion, and cropping, and a mixed image having a plurality of kinds of features is generated.

Further, step 3 includes the following sub-steps:

3.1 select KL divergence as the consistency measure representing the deviation of the model prediction output before and after input augmentation.

3.2 the improved network objective function not only includes the cross entropy of the traditional marked image, but also includes the consistency measurement of the unmarked image.

Further, in step 4, in each step of network training, the annotated image, the unlabeled image and the augmented image are simultaneously input into the network, the annotated image is subjected to feature extraction, then cross entropy is calculated according to the annotation, consistency measurement of network output is carried out on the unlabeled image and the augmented image, and the unlabeled image and the augmented image participate in gradient calculation together.

Further, in step 5, the network outputs class probability to the multi-feature mixed image which is difficult to classify, which indicates the class source of the mixed feature, and obtains the classification result of the mixed image.

The invention has the beneficial effects that:

(1) and the generalization capability of the model is improved. The multi-image mixed sample capacity increasing technology is integrated into an expansion framework of a semi-supervised model, the diversity of training data is increased, and the generalization capability of the model is improved.

(2) And the robustness of the model is enhanced. The mixed image introduces various types of noise interference on the augmented various data samples, and the sensitivity of the model to interfered sample data is reduced.

(3) And the classification capability of the mixed image is improved. The characteristic mixed sample generated in multiple modes establishes an evaluation method for the consistency measurement with the original image, and the evaluation method is incorporated into the semi-supervised learning model, so that the generalization capability of the model is further improved, and the classification capability of the mixed image is improved.

Drawings

FIG. 1 is a flow chart of classification of feature-mixed images under semi-supervised learning;

fig. 2 is a schematic diagram of information storage of a Mixup mixed image in the present invention.

Detailed Description

The invention provides a semi-supervised classification method of feature mixed images, which is further explained by combining the attached drawings.

As shown in fig. 1, the present invention provides a semi-supervised classification method for feature-mixed images, comprising the following steps:

step 1, manufacturing a semi-supervised data set:

a CIFAR-10 data set is selected as a classification object, 50000 training pictures are divided into 10 classes, and each class is provided with 5000 labeled images. Randomly selecting N in each category_ulAnd (4) keeping the labels for each picture, deleting the labels for all the rest training data, and only keeping the images as a semi-supervised training set.

Step 2, feature mixing and augmentation:

and performing multiple feature mixing augmentation on each unmarked image in the data set. The Mixup mixed data augmentation method achieves the purpose of data augmentation by interpolating images and labels, wherein x represents the images, and y represents the labels, and the method is specifically as follows:

where λ represents the mixing coefficient, obtained by sampling the Beta distribution with parameters α, β. (x)₁,y₁) And (x)₂,y₂) Are two samples drawn at random.

The label-free data in the method has no label for mixing, and the image can only be interpolated to generate mixed tensor mixed_xTherefore, the labeling process of the feature-blended image is put into the consistency regularization of step 3.

Another difference between Mixup multi-image mixing and single-image augmentation is that two or more images need to be enumerated and recorded for further fusion and consistency measurement. In order to ensure the randomness of the method, double-ended queues (Queue) are adopted to record data participating in multi-image mixing.

As shown in fig. 2, the Queue (Queue) stores the original image for blending internally, if at the previous time, the original image x₁With the original image x₂Generating a mixed image A, and storing the mixed image A as mixed information A; the original image x will be at the current moment₁Pop-up queue, original image x₂Adding, generating a mixed image B, and storing as mixed information B; the next moment will be the original image x₂Pop-up queue, original image x₃Enqueuing mixes and stores the information. And continuing until all input images needing hybrid enhancement are traversed. The mixed information records the mixed image and the two original image information before mixing.

Step 3 consistency measurement:

the consistency measurement method aims to add a regularization term to an objective function of the network and prevent the occurrence of network overfitting under the condition of a small amount of labeled data. The main idea of consistency regularization is that for an input, even if slightly disturbed, its prediction should be consistent, i.e. the network prediction output is unchanged before and after the same object is augmented. This is consistent with the goal of most objective functions, i.e., it is desirable that predictions be consistent with labels. Consistency measurement and KL divergence measurement mode D are carried out on data before and after augmentation_KLMay be represented by the following formula:

wherein

Is the augmented data of sample x, f_θ(.) represents the model under training, with the model parameters being θ. Mixing tensor mixed_xPassed to model f in training_θ(.) can derive an output tensor mixed_out. Is trainedIn the process, the output tensor mixed of the model_outRespectively carrying out consistency measurement on the original image before mixing, carrying out weighted summation on a plurality of measurement values according to a mixing coefficient lambda of the image, and adding the sum as a regularization item into a target function, wherein the concrete expression is as follows:

d_kl＝λD_KL(x₁||mixed_out)+(1-λ)D_KL(x₂||mixed_out)(3)

the objective function L of the final network is as follows:

L＝H(N_l)+d_kl(N_ul)(4)

wherein N is_lRepresents the annotation data, H (N)_l) Cross entropy, N, representing annotation data_ulRepresenting unlabelled data, d_kl(N_ul) A consistency regularization term representing unlabeled data.

And 4, step 4: network training

The network adopts WRN-28-10 realized based on TensorFlow, and compared with a deep residual error network (ResNet), the WRN increases the width of convolution cores in a residual error block (Resblock), namely the characteristic number of convolution layers is increased, so that the WRN has a better effect than the traditional ResNet under the same depth, and meanwhile, the network depth is reduced, and the network convergence speed is accelerated.

And 5: classifying feature-blended images

And taking the mixed image with the multi-target characteristics as a test set, outputting the class probability to the multi-characteristic mixed image which is difficult to classify by the network, indicating the class source of the mixed characteristics, and obtaining the classification result of the mixed image.

According to the method, the multi-image mixing and amplification technology is integrated into the semi-supervised field, so that the diversity of training data is increased, the generalization capability of the model is improved, meanwhile, various noises are introduced in the training stage, and the robustness of the model is enhanced. The characteristic mixed sample generated in multiple modes establishes an evaluation method for the consistency measurement with the original image, and the evaluation method is incorporated into the semi-supervised learning model, so that the generalization capability of the model is further improved, and the classification capability of the mixed image is improved. The method and the device realize rapid and accurate classification of the feature mixed image when only a small amount of labeled data exists.

Claims

1. A semi-supervised classification method of feature-mixed images is characterized by comprising the following steps:

step 1, collecting and sorting images of objects to be classified as a data set, and only carrying out partial labeling on the images of each category;

step 2, performing multiple feature mixing augmentation on each unmarked image in the data set;

step 3, designing a consistency measurement method related to the unmarked image and the augmented image for the convolution network, and improving the objective function of the network;

step 4, inputting a data set containing an annotated image, a non-annotated image and an augmented image into the improved convolution network, and training until the network is converged;

2. The semi-supervised classification method for feature mixed images as claimed in claim 1, wherein in step 1, the number of labeled images in each category is consistent, i.e. category labeling is balanced.

3. The semi-supervised classification method for feature-mixed images as claimed in claim 1, wherein in step 2, in addition to the commonly used single image expansion of stretching, inversion and cropping, a Mixup data expansion method is used to generate mixed images with various types of features.

4. The semi-supervised classification method for feature-mixed images according to claim 1, wherein in step 3, KL divergence is selected as a consistency measure representing a deviation of model prediction outputs before and after input augmentation.

5. The semi-supervised classification method for feature mixed images as claimed in claim 1, wherein in step 3, the improved network objective function includes not only cross entropy of traditional labeled images but also consistency measure of unlabeled images.

6. The semi-supervised classification method for the feature mixed image as recited in claim 1, wherein in step 4, in each step of network training, the annotated image, the unlabeled image and the augmented image are simultaneously input into the network, the annotated image is subjected to feature extraction, then cross entropy is calculated according to the annotation, and consistency measurement of network output is performed on the unlabeled image and the augmented image, and the unlabeled image and the augmented image participate in gradient calculation.

7. The semi-supervised classification method for the feature-mixed image as claimed in claim 1, wherein in step 5, the network outputs class probability indicating the class source of the mixed features to the multi-feature mixed image which is difficult to classify, and obtains the classification result of the mixed image.