CN111160474B

CN111160474B - Image recognition method based on deep course learning

Info

Publication number: CN111160474B
Application number: CN201911401914.8A
Authority: CN
Inventors: 胡珍珍; 秦伟; 刘祥龙; 洪日昌; 汪萌
Original assignee: Beihang University; Hefei University of Technology
Current assignee: Beihang University; Hefei University of Technology
Priority date: 2019-12-30
Filing date: 2019-12-30
Publication date: 2023-08-29
Anticipated expiration: 2039-12-30
Also published as: CN111160474A

Abstract

The invention discloses an image recognition method based on deep course learning, which belongs to the field of image recognition and comprises the following steps: constructing teacher and student networks based on the deep convolutional neural network; training the teacher network by using the training samples to carry out image classification training, and predicting the probability that the training samples belong to each category; calculating the difference between the prediction and the label of the teacher network to update the parameters; transmitting the forecast information to a student network; training a student network; guiding the student network training by using the prediction information result of the teacher network; calculating a difference update parameter between a student network prediction result and a label; completing student network classification training; the training student network realizes the identification and classification of the images. The invention simulates the process of easy-to-get-difficult learning of human beings, has reasonable training process, greatly reduced workload and quick network parameter updating, and has higher prediction precision and more reliable and stable performance due to the influence of gradient difference unbalance samples generated by different samples.

Description

Image recognition method based on deep course learning

Technical Field

The invention relates to the technical field of image recognition, in particular to an image recognition method based on deep course learning.

Background

Due to the large increase in computational power and the increase in training data, the training conditions of the deep neural network are satisfied, and due to the strong fitting capability, the deep neural network achieves the best effect in many tasks. However, since parameters of the deep neural network are updated based on a large number of samples, the samples play a critical role in the training process. Therefore, the training work intensity is high, the training process is complex, and the parameter updating of the deep neural network is slow.

Meanwhile, the deep convolutional neural network calculates the gradient of each parameter according to the prediction result of the training sample and the correctly classified labels by the deep convolutional neural network, and then the generated gradient is used for updating the network parameters. In the prior art, the difference between the predicted result and the correct labeling is large, so that the deep convolutional neural network is punished, and the true labeling result cannot be fitted.

Therefore, an image recognition method which is simple in training process, fast in parameter updating of the deep neural network and capable of fitting the real labels of the images is urgently needed.

Disclosure of Invention

The invention aims to provide an image recognition method which has simple training process, fast parameter updating of a deep neural network and capability of fitting the real annotation of an image, and the invention comprises the following steps:

an image recognition method based on deep course learning comprises the following steps:

s10, constructing a teacher network and a student network based on a deep convolutional neural network;

s20, performing image recognition and classification training on the teacher network by using a training sample; the teacher network learns the training samples and predicts the probability that the training samples belong to each category;

s30, calculating the difference between the prediction information result of the teacher network and the training sample label by using a cross entropy loss function, and updating the parameters of the teacher network;

s40, transmitting a predicted information result of the training sample to the student network by the teacher network;

s50, performing image classification training on the student network by using the same training sample;

s60, guiding the student network to train by taking a predicted information result of the teacher network on the training sample as a judging basis of difficulty level;

s70, calculating the difference between the student network classification prediction result and the training sample label by using a BLCL loss function, and updating parameters of the student network;

s80, completing the student network classification training;

and S90, based on the training-completed student network, the identification and classification of the images are realized.

Further, the step S20 of predicting the probability of belonging to each category by the teacher network is: the deep convolutional neural network of the teacher network processes the pictures of the training samples into visual features, generates scoring of corresponding classifications through global pooling and a full-connection network, synthesizes the scoring of different classifications by using a softmax function, converts the scoring into the probability of classification, and predicts the probability that the pictures belong to each classification; the softmax function is

Wherein x represents the classifying target picture, Z _k A score representing the picture being classified into the kth category.

Further, the step S30 specifically includes: after the deep convolutional neural network is classified, calculating the difference between the prediction probability and the real label by using a cross entropy loss function, and calculating the gradient size and the gradient direction which are needed to be updated for each parameter in the deep convolutional neural network from the difference by using a back propagation algorithm; calculating the loss of a whole batch of pictures and the corresponding gradient each time by adopting a batch gradient random descent algorithm, then updating the loss of the whole batch of pictures and the corresponding gradient, and updating parameters; the loss function is

Where N is the number of samples of the batch data, x _i To attempt to classify the sample, y _i For the corresponding label, θ _t For the teacher's network to be in the training process,is a cross entropy loss function.

Further, the BLCL loss function in step S70 is

Wherein x is _i In order to train the sample,the prediction result of the teacher network is that the student network judges the difficulty of the sample by using the prediction information of the teacher network and carries out adaptive weight on the sample, and L is a cross entropy loss function or an L2 loss function.

The invention has the beneficial effects that:

1. the invention simulates the process of easy-to-get-difficult human learning, firstly learns easy-to-classify samples, then learns difficult samples, has reasonable training process and greatly reduces training workload.

2. The training of the deep convolutional neural network is based on a random gradient descent method, and in each training round, gradients are calculated according to loss generated by the prediction result and the distance of the correct label, and then the generated gradients are used for updating network parameters, so that the network parameter updating speed is high.

3. In the training process of the deep neural convolutional network, loss and gradient sizes generated by different samples are monitored in real time, and then the influence of the samples is balanced according to gradient differences generated by the different samples. Compared with the original unbalanced training mode, the method has higher prediction precision and more reliable and stable performance.

Drawings

FIG. 1 is a flow chart of an image recognition method based on deep course learning

Detailed Description

In this embodiment, given an image dataset such as CIFAR10, CIFAR100 uses different deep neural network models to verify the effectiveness of BLCL learning.

Each data set is divided into a training set and a testing set, and training data is trained in batches.

The method comprises the following specific steps:

s10, constructing a teacher network and a student network based on a deep convolutional neural network; convolutional neural network this embodiment employs ResNet-8, resNet-20, resNet-32 and DensNet-BC.

S20, performing image recognition and classification training on a teacher network by using training samples; the teacher network learns the training samples and predicts the probability that the training samples belong to each category;

the training sample is CIFAR10 and CIFAR100 data set, the training set comprises 50,000 pictures, the test set comprises 10,000 pictures, and each training picture is subjected to data disturbance enhancement.

The teacher network predicts the probability of belonging to each category as follows: the deep convolutional neural network of the teacher network processes the pictures of the training samples into visual features, generates scoring corresponding to the classification through global pooling and a fully connected network, synthesizes the scoring of different classes by using a softmax function, converts the scoring into the probability of classification, and predicts the probability that the pictures belong to each class; the softmax function is

Wherein x represents the classifying target picture, Z _k Scoring representing the pictures being classified into a kth category

S30, calculating the difference between the prediction information result of the teacher network and the training sample label by using the cross entropy loss function, and updating the parameters of the teacher network;

after the deep convolutional neural network is classified, calculating the difference between the prediction probability and the real label by using a cross entropy loss function, and calculating the gradient size and the gradient direction which are needed to be updated for each parameter in the deep convolutional neural network from the difference by using a back propagation algorithm; in order to increase the efficiency and stability of network parameter updating, a batch gradient random descent algorithm is adopted, the loss and the corresponding gradient of a whole batch of pictures are calculated each time, then the loss and the corresponding gradient of the whole batch of pictures in updating are carried out, and parameter updating is carried out; the loss function is

Where N is the number of samples of the batch data, x _i To attempt to classify the sample, y _i For the corresponding label, θ _t For the teacher's network to be in the training process,is a cross entropy loss function;

the batch size of each batch of data was 128, optimized by random gradient descent.

S40, transmitting a predicted information result of the training sample to a student network by the teacher network;

s60, guiding the student to train by taking the prediction information result of the teacher network on the training sample as a judgment basis of the difficulty level;

s70, calculating the difference between the classification prediction result of the student network and the training sample label by using the BLCL loss function, and updating parameters of the student network;

BLCL loss function of

S80, finishing student network classification training;

The implementation process of the embodiment is as follows: firstly, training set data are sent into a teacher network model, the difference between a prediction result and a label is calculated by using a cross entropy loss function, and parameters of the teacher network model are updated. And then, taking the result of the teacher network model prediction as a judgment basis of the sample difficulty level to guide the training of the student network model. And the same batch of training data is sent into the student network model to calculate the difference between the prediction result and the label according to the BLCL loss function to update the parameters of the student model according to the judgment of the teacher network model, so that the weight of the sample easy to classify is increased, and the weight of the sample difficult to classify is reduced. And the student classification model is used as a training result, and finally the student classification network model can realize image recognition.

The foregoing description is only a preferred embodiment of the present invention, and is not intended to limit the technical scope of the present invention, so any minor modifications, equivalent changes and modifications made to the above embodiments according to the technical principles of the present invention still fall within the scope of the technical solutions of the present invention.

Claims

1. An image recognition method based on deep course learning is characterized by comprising the following steps:

s80, completing the student network classification training;

s90, based on the training-completed student network, identifying and classifying images;

the step S20 of predicting the probability of belonging to each category by the teacher network includes: the deep convolutional neural network of the teacher network processes the pictures of the training samples into visual features, generates scoring of corresponding classifications through global pooling and a full-connection network, synthesizes the scoring of different classifications by using a softmax function, converts the scoring into the probability of classification, and predicts the probability that the pictures belong to each classification; the softmax function is

Wherein x represents the classifying target picture, Z _k A score representing the picture being classified into a kth category;

the training samples are CIFAR10 and CIFAR100 data sets, wherein the training set comprises 50,000 pictures, the test set comprises 10,000 pictures, and each training picture is subjected to data disturbance enhancement;

the BLCL loss function in step S70 is

2. The image recognition method according to claim 1, wherein the step S30 specifically comprises the steps of: after the deep convolutional neural network is classified, calculating the difference between the prediction probability and the real label by using a cross entropy loss function, and calculating the gradient size and the gradient direction which are needed to be updated for each parameter in the deep convolutional neural network from the difference by using a back propagation algorithm; calculating the loss of a whole batch of pictures and the corresponding gradient each time by adopting a batch gradient random descent algorithm, then updating the loss of the whole batch of pictures and the corresponding gradient, and updating parameters; the loss function is