CN113919491A

CN113919491A - Method for assisting in training classification network by means of heatmap

Info

Publication number: CN113919491A
Application number: CN202010662452.1A
Authority: CN
Inventors: 孙建峰
Original assignee: Beijing Ingenic Semiconductor Co Ltd
Current assignee: Beijing Ingenic Semiconductor Co Ltd
Priority date: 2020-07-10
Filing date: 2020-07-10
Publication date: 2022-01-11

Abstract

The invention provides a method for assisting in training a classification network by means of heatmap, which comprises the following steps: step 1, a network is provided with two branches in the middle, wherein the left branch uses a full-connection network to predict a classification result; step 2, designing the right branch as a network only used for training; step 3, making a heatmap in a network used for training only in the right branch, marking the position of a common characteristic region in a sample picture, and mapping the position to a heat map; step 4, making an output layer label in the left-branch network; step 5, training a network, namely weighting and adding the loss value of the heatmap and the loss value of the output layer, and then training the network, wherein the first half network before branching, the heatmap network and the classification network share the weight, so that the image characteristics learned by the heatmap can bring benefits to the classification network; and 6, removing the network only used for training when the trained network is used for reasoning. The invention provides a novel deep learning training method which can improve the generalization capability of a classification network under the condition of limited training samples.

Description

Method for assisting in training classification network by means of heatmap

Technical Field

The invention relates to the technical field of convolutional neural networks, in particular to a method for assisting in training a classification network by means of heatmap.

Background

Today, convolutional neural network deep learning is increasingly widely applied. Existing deep learning convolutional neural networks, such as classification networks that contain fully connected layers, and full convolutional networks for image segmentation that do not contain fully connected layers.

However, the existing deep learning neural network training is highly dependent on training samples, and under the condition of less training samples, an overfitting phenomenon is easy to occur, so that the model generalization capability is poor.

The common terminology in the prior art is as follows:

1. the convolutional neural network is a feature extractor consisting of a convolutional layer and a pooling layer.

2. heatmap, heat map, here a manually calibrated picture that corresponds to the training sample.

3. And a full connection layer, wherein each neuron is fully connected with all neurons in the previous layer.

Disclosure of Invention

In order to solve the above problems, the present invention is directed to: a new deep learning training method is provided, and the generalization capability of a classification network can be improved under the condition that training samples are limited.

Specifically, the invention provides a method for assisting in training a classification network by means of heatmap, which comprises the following steps:

step 1, a network is provided with two branches in the middle, wherein the left branch uses a full-connection network to predict a classification result;

step 2, designing the right branch as a network only used for training;

step 3, making a heatmap in a network used for training only in the right branch, marking the position of a common characteristic region in a sample picture, and mapping the position to a heat map;

step 4, making an output layer label in the left-branch network;

step 5, training a network, namely weighting and adding the loss value of the heatmap and the loss value of the output layer, and then training the network, wherein the first half network before branching, the heatmap network and the classification network share the weight, so that the image characteristics learned by the heatmap can bring benefits to the classification network;

and 6, when the trained network is used for reasoning, the network only used for training is removed, and the calculated amount is saved.

The heatmap network is a right branch after branching, the classification network is a left branch after branching, the output of the first half part network is used as the input of the heatmap network and the classification network of the last two networks, the heatmap network and the classification network are parallel relations and are not dependent on each other and not influenced mutually, and the heatmap network and the classification network influence the weight data of the first half part network together in the network learning process.

The method is characterized in that a heatmap with a corresponding relation with a training sample is generated through manually judging and calibrating the characteristics of the required learning image and is used for assisting training.

The fully connected network including the left branch in step 1 further includes: an Input layer, a C1 convolutional layer, an S2 pooling layer, a C3 convolutional layer, an S4 pooling layer, a C5 convolutional layer, an F6 full link layer, and an output layer.

And (2) predicting the heatmap of the picture by using an upsampling and convolution network in the right branch of the step 2, wherein the heatmap and the training sample picture have a corresponding relation.

The heatmap and the training sample picture have a corresponding relation, and the size of the heatmap is 1:1 or equal ratio mapping according to the actual required ratio.

The step 3 further comprises:

whether a specific area exists in the picture is detected by the classification network, the edge information of the specific area is important information, and the background and other contents except the edge of the specific area are ignorable information, so that the area of the edge of the specific area is defined as a common feature area and is a pattern feature which needs to be learned by the neural network, and therefore, the edge area of the specific area in the sample picture is mapped to the heatmap, namely, the area containing the important information is highlighted, and if no specific area exists in the picture, the heatmap pattern is a full black picture.

The step 4 of manufacturing the output layer label further comprises the following steps:

whether a picture contains a specific area or not is judged, the picture containing the specific area can be classified into one type, and the picture not containing the specific area can be classified into another type.

Thus, the present application has the advantages that: and generating a heatmap with a corresponding relation between the heatmap and the training samples through manually judging and calibrating the characteristics of the required learning images, wherein the heatmap is used for assisting training, and effectively improving the generalization capability of the classification network under the condition of less training samples. And an auxiliary characteristic learning network is added in the training process to assist the network in learning the image characteristics, corresponding calculation is removed in the reasoning process, the calculated amount is saved, and the efficiency is improved.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the principles of the invention.

FIG. 1 is a schematic flow diagram of the process of the present invention.

Fig. 2 is a schematic diagram of a network structure to which the method of the present invention is applied.

Fig. 3 is a picture example of one case in step 3 of the specific embodiment.

Fig. 4 is a picture example of another case in step 3 of the embodiment.

Detailed Description

In order that the technical contents and advantages of the present invention can be more clearly understood, the present invention will now be described in further detail with reference to the accompanying drawings.

As shown in fig. 1, the present invention relates to a method for assisting in training a classification network by means of heatmap, said method comprising the steps of:

step 2, designing the right branch as a network only used for training;

step 4, making an output layer label in the left-branch network;

and 6, removing the network only used for training when the trained network is used for reasoning.

As shown in fig. 2, it can be seen from the network structure applying the method of the present invention that the network has branches in the middle, the left branch uses a fully connected network, the classification result is predicted, the right branch uses an upsampling and convolutional network, the heatmap of the picture is predicted, the heatmap and the training sample picture have a corresponding relationship, and the size can be equal ratio mapping of 1:1 or other ratios.

Further, in step 3, a heatmap is created, and the positions of the common feature areas in the sample picture are labeled and mapped to a heat map. As shown in fig. 3, for example, a classification network detects whether there is paper in the picture, the edge information of the paper is an important information, the background and the person in the paper are negligible information, the area of the paper edge is defined as a common feature area, which is a pattern feature that needs to be learned by the neural network, so that the area of the paper edge in the left sample picture in fig. 3 is mapped to the heatmap, i.e., the right half of fig. 3, and includes the important information area highlighted, if there is no paper in the pattern, the heatmap pattern is a full black picture, as shown in fig. 4.

Further, in step 4, an output layer label is made, for example, if we judge whether the picture contains a piece of paper, the picture can be classified into one type containing paper and the picture can be classified into another type containing no paper.

Further, in step 5, the network is trained, the loss value of the heatmap and the loss value of the output layer are weighted and added, and then the network is trained, because the first half, the heatmap network and the classification network share the weight, the image features learned by the heatmap can make the classification network profit, avoid the network learning other irrelevant features, and enhance the generalization ability of the network.

Finally, in step 6, when the trained network inference is used, the network used only for training in fig. 2 is removed, so that the calculation amount is saved.

The above description is only a preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes may be made to the embodiment of the present invention by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A method for assisting in training a classification network with a heatmap, the method comprising the steps of:

step 2, designing the right branch as a network only used for training;

step 4, making an output layer label in the left-branch network;

2. The method for assisting in training classification networks by means of heatmap as claimed in claim 1, wherein the method is to generate heatmap with correspondence relationship between training samples for assisting training by manually determining and calibrating the required learning image features.

3. The method of claim 1, wherein the step 1 of the fully-connected network including the left branch further comprises: an Input layer, a C1 convolutional layer, an S2 pooling layer, a C3 convolutional layer, an S4 pooling layer, a C5 convolutional layer, an F6 full link layer, and an output layer.

4. The method for assisting in training a classification network by means of heatmap as claimed in claim 1, wherein the step 2 right branch predicts heatmap of pictures using upsampling and convolutional networks, which have correspondence with the training sample pictures.

5. The method as claimed in claim 4, wherein the heatmap and the training sample picture have a correspondence relationship, and the size is 1:1 or an equal ratio mapping according to an actual required ratio.

6. The method for assisting in training a classification network by means of heatmap as claimed in claim 1, wherein said step 3 further comprises:

7. The method of claim 1, wherein the step 4 of creating output layer labels further comprises:

8. The method as claimed in claim 1, wherein the heatmap network is a right branch after branching, the classification network is a left branch after branching, the output of the first half network is used as the input of the heatmap network and the classification network of the last two networks, the heatmap network and the classification network are parallel relations, and are independent of each other, and do not affect each other, and during the network learning process, the heatmap network and the classification network affect the weight data of the first half network together.