CN111091160A

CN111091160A - Image classification method

Info

Publication number: CN111091160A
Application number: CN201911376892.4A
Authority: CN
Inventors: 刘洪淼
Original assignee: Beijing Milaiwu Network Technology Co ltd
Current assignee: Beijing Milaiwu Network Technology Co ltd
Priority date: 2019-12-27
Filing date: 2019-12-27
Publication date: 2020-05-01
Anticipated expiration: 2039-12-27
Also published as: CN111091160B

Abstract

The invention provides an image classification method, which comprises the following steps: marking the position and visibility of a preset key point of a training image; determining a key point position prediction algorithm according to the position of a preset key point; determining a visibility prediction algorithm according to the visibility of the key points; acquiring global features of an image to be classified; predicting the position of a preset key point of the image to be classified according to a key point position prediction algorithm; predicting the visibility of preset key points of the image to be classified according to a visibility prediction algorithm; extracting local features according to the position and visibility of a preset key point; and predicting a corresponding classification result according to the result of splicing the global features and the local features. The image classification method can classify the pictures according to the overall impression of the people in the pictures.

Description

Image classification method

Technical Field

The embodiment of the invention relates to the technical field of image classification, in particular to an image classification method.

Background

Image classification is an image processing method for distinguishing objects of different categories from each other based on different characteristics each reflected in image information. It uses computer to make quantitative analysis of image, and classifies the whole image into one of several categories to replace human visual interpretation.

The existing image classification can be realized by a neural network, but the method may use a background region in the picture as main distinguishing information, so that the accuracy is reduced. Or the method is realized through the geometric characteristics of the human face, but the method mainly aims at the color value and does not consider the integral senses such as the clothing style, the quality characteristics and the like; and based on the difference of picture shooting angles, human postures and the like, the characteristics of large calculation amount and manual design need to be introduced.

Disclosure of Invention

The embodiment of the invention aims to provide an image classification method which can classify pictures according to the overall impression of people in the pictures.

To achieve the purpose, the embodiment of the invention adopts the following technical scheme:

the embodiment of the invention provides an image classification method, which comprises the following steps:

marking the position and visibility of a preset key point of a training image;

determining a key point position prediction algorithm according to the position of a preset key point;

determining a visibility prediction algorithm according to the visibility of the key points;

acquiring global features of an image to be classified;

predicting the position of the preset key point of the image to be classified according to the key point position prediction algorithm;

predicting the visibility of the preset key points of the image to be classified according to a visibility prediction algorithm;

extracting local features according to the position and the visibility of the preset key points;

and predicting a corresponding classification result according to the result of splicing the global feature and the local feature.

Further, before determining the key point position prediction algorithm according to the position of the preset key point corresponding to the mark, the method further includes:

marking the class to which the training image belongs;

determining an image classification algorithm according to the category and the similarity of the training images;

correspondingly, predicting the corresponding classification result according to the result obtained by splicing the global feature and the local feature comprises the following steps:

and predicting a corresponding classification result according to the result of splicing the global feature and the local feature and the image classification algorithm.

Further, before obtaining the global features of the image to be classified, the method further includes:

and processing the images to be classified through a full convolution network algorithm.

Further, the preset key points include the center of the line connecting the two eyes, the left shoulder, the right shoulder and the center of the chest of the person in the image.

Further, predicting a corresponding classification result according to the result of the global feature and the local feature after the global feature and the local feature are spliced and the image classification algorithm, and then:

if the position of the preset key point of the image to be classified and the training loss of the visibility are obtained, increasing a coefficient to train the key point position prediction algorithm and the visibility prediction algorithm in the first step, and reducing or keeping the coefficient to train the image classification algorithm;

secondly, increasing coefficients to train the image classification algorithm, and reducing or keeping the coefficients unchanged to train the key point position prediction algorithm and the visibility prediction algorithm;

the first step and the second step may be alternately repeated.

if the position of the preset key point of the image to be classified and the training loss of the visibility are obtained, increasing the types of the loss to train the key point position prediction algorithm, the visibility prediction algorithm and the image classification algorithm;

or reducing the loss category to train the keypoint location prediction algorithm, the visibility prediction algorithm and the image classification algorithm;

or changing the loss category to train the keypoint location prediction algorithm, the visibility prediction algorithm and the image classification algorithm.

Further, the determining a key point position prediction algorithm according to the position of the preset key point corresponding mark comprises:

and determining a key point position prediction algorithm through a mean square error loss function according to the positions of the preset key point corresponding marks.

Further, determining a visibility prediction algorithm according to the visibility of the key point includes:

and determining a visibility prediction algorithm through a cross entropy loss function according to the visibility of the key points.

Further, the global feature and the local feature stitching includes:

converting the global features and the local features into one-dimensional data respectively;

and directly splicing the one-dimensional data.

The embodiment of the invention has the beneficial effects that:

according to the embodiment of the invention, the prediction algorithm is obtained by directly regressing the positions of the key points, local features are extracted in a certain range of the predicted positions of the key points, the local features comprise clothing style and gas quality features, and are combined with the global features of the whole image for classification, so that the classification accuracy can be improved, and the calculation cost can be reduced.

Drawings

Fig. 1 is a flowchart illustrating an image classification method according to an embodiment of the present invention.

Fig. 2 is a flowchart illustrating an image classification method according to a second embodiment of the present invention.

Detailed Description

In order to make the technical problems solved, technical solutions adopted and technical effects achieved by the present invention clearer, the technical solutions of the embodiments of the present invention will be described in further detail below with reference to the accompanying drawings, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments.

Example one

The embodiment provides an image classification method, which increases the clothing style, takes the gas characteristics as consideration factors, improves the classification accuracy, and takes the characteristics of key points as a judgment basis so as to reduce the calculation amount and reduce the calculation cost.

Fig. 1 is a flowchart illustrating an image classification method according to an embodiment of the present invention. As shown in fig. 1, the image classification method includes the steps of:

and S11, marking the position and the visibility of the preset key point of the training image.

Specifically, the person in the training image includes at least a body above the abdomen.

In this embodiment, the preset key points include the center of the line connecting the two eyes, the left shoulder, the right shoulder and the center of the chest of the person in the image. These pre-set keypoints are labeled in the training image. The visibility is marked according to whether the preset key points are occluded or not.

And S12, determining a key point position prediction algorithm according to the position of the preset key point.

Specifically, a key point position prediction algorithm is determined through a mean square error loss function according to the position of a preset key point corresponding mark.

S13, determining a visibility prediction algorithm according to the visibility of the key points.

Specifically, a visibility prediction algorithm is determined by a cross entropy loss function according to the visibility of the key points.

And S14, processing the image to be classified through a full convolution network algorithm. The processed feature map is provided for extracting global features, local features and predicting the positions and visibility of key points, and the feature extraction from the original image is avoided, so that the calculation amount is reduced.

And S15, acquiring the global features of the image to be classified. And acquiring the global features of the image from the image processed by the full convolution network algorithm, and using the global features as the basis of input data of subsequent image classification.

S16, predicting the position of the preset key point of the image to be classified according to the key point position prediction algorithm. The center of the line of the two eyes, the center of the left shoulder, the center of the right shoulder and the center of the chest of the person in the image to be classified can be positioned through the key point position prediction algorithm and used as a precondition for subsequently extracting local features.

S17, predicting the visibility of the preset key points of the image to be classified according to a visibility prediction algorithm. After the positions of the center of the line of the two eyes, the left shoulder, the right shoulder and the center of the chest in the image to be classified are predicted, whether the key points are shielded or not is predicted through a visibility prediction algorithm. Increasing visibility as a consideration can improve the accuracy of classification.

And S18, extracting local features according to the positions and the visibility of the preset key points. And acquiring local features of the image according to the position and the visibility of the preset key point in the feature map processed by the full convolution network algorithm, wherein the local features comprise clothing style factors and gas quality feature factors.

And S19, predicting a corresponding classification result according to the result of splicing the global feature and the local feature.

Specifically, the global features and the local features are respectively converted into one-dimensional data, and then the one-dimensional data are directly spliced to be used as input of classification.

According to the embodiment, the prediction algorithm is obtained by directly regressing the positions of the key points, local features are extracted in a certain range of the predicted positions of the key points, the local features comprise clothing styles and gas quality features, and are combined with the global features of the whole image for classification, so that the classification accuracy can be improved, and the calculation cost can be reduced.

Example two

In this embodiment, based on the above embodiments, the classification method and the method for processing prediction loss are refined, and fig. 2 is a schematic flow chart of the image classification method according to the second embodiment of the present invention. As shown in fig. 2, the image classification method includes the steps of:

and S21, marking the position and the visibility of the preset key point of the training image.

Specifically, the person in the training image includes at least a body above the abdomen. Different key points use different labels, and whether they are visible or not, also use different labels. Each label is different.

And S22, determining a key point position prediction algorithm according to the position of the preset key point.

S23, determining a visibility prediction algorithm according to the visibility of the key points.

In other embodiments, other functions may be selected to determine the keypoint location prediction algorithm and the visibility prediction algorithm depending on the particular use case.

S24, marking the category to which the training image belongs;

and classifying the images by manually marking labels according to actual requirements, wherein each label has a corresponding category and has corresponding global features and local features.

And S25, determining an image classification algorithm according to the category and the similarity of the training images.

And combining the categories, extracting the local features of the unclassified image and the local features of the labeled image to compare to obtain local similarity, extracting the unclassified global features and the global features of the labeled image to compare to obtain global similarity, and determining the image classification algorithm through the final similarity obtained by weighted summation of the local similarity and the global similarity. Further, the similarity solution is realized by a similarity calculation function.

And S26, processing the image to be classified through a full convolution network algorithm.

And S27, acquiring the global features of the image to be classified.

S28, predicting the position of the preset key point of the image to be classified according to the key point position prediction algorithm.

S29, predicting the visibility of the preset key points of the image to be classified according to a visibility prediction algorithm.

And S30, extracting local features according to the positions and the visibility of the preset key points.

And S31, predicting a corresponding classification result according to the result of splicing the global feature and the local feature.

Specifically, the corresponding classification result is predicted according to the result of splicing the global feature and the local feature and the image classification algorithm.

In this embodiment, if the position of the preset keypoint of the image to be classified and the training loss of visibility are obtained, the first step is to increase the coefficient to train the keypoint position prediction algorithm and the visibility prediction algorithm, and to reduce or not change the coefficient to train the image classification algorithm;

the first step and the second step may be alternately repeated.

In other embodiments, the position prediction algorithm, the visibility prediction algorithm and the image classification algorithm of the key point can be trained by increasing the types of loss if the training loss of the position and the visibility of the preset key point of the image to be classified is obtained; or reducing the loss category to train the key point position prediction algorithm, the visibility prediction algorithm and the image classification algorithm; or changing the loss category to train the key point position prediction algorithm, the visibility prediction algorithm and the image classification algorithm.

And respectively iterating the trained keypoint position prediction algorithm, the trained visibility prediction algorithm and the trained image classification algorithm to the steps S28, S29 and S31 for prediction.

According to the method and the device, local features are extracted at the position of a predicted key point and the visibility, and after the classification is carried out by combining global features, the loss of the whole network is reduced by acquiring and processing the prediction loss, so that the accuracy of prediction is further improved.

The technical principle of the present invention is described above in connection with specific embodiments. The description is made for the purpose of illustrating the principles of the invention and should not be construed in any way as limiting the scope of the invention. Based on the explanations herein, those skilled in the art will be able to conceive of other embodiments of the present invention without inventive effort, which would fall within the scope of the present invention.

Claims

1. An image classification method, comprising:

marking the position and visibility of a preset key point of a training image;

acquiring global features of an image to be classified;

2. The image classification method according to claim 1, wherein the determining the keypoint location prediction algorithm according to the locations of the preset keypoint correspondence markers further comprises:

marking the class to which the training image belongs;

3. The image classification method according to claim 1, wherein obtaining the global features of the image to be classified further comprises:

4. The image classification method according to claim 1, wherein the preset key points include a center of a line connecting both eyes, a left shoulder, a right shoulder and a center of a chest of a person in the image.

5. The image classification method according to claim 2, wherein the predicting a corresponding classification result according to the result of the global feature and the local feature splicing and the image classification algorithm further comprises:

the first step and the second step may be alternately repeated.

6. The image classification method according to claim 2, wherein the predicting a corresponding classification result according to the result of the global feature and the local feature splicing and the image classification algorithm further comprises:

7. The image classification method according to claim 1, wherein determining a keypoint location prediction algorithm from the locations of the preset keypoint correspondence markers comprises:

and determining a key point position [ ] position prediction algorithm through a mean square error loss function according to the positions of the preset key point corresponding marks.

8. The image classification method according to claim 1, wherein determining a visibility prediction algorithm based on the visibility of the keypoints comprises:

9. The image classification method according to claim 1, wherein the global feature and the local feature stitching includes:

and directly splicing the one-dimensional data.