CN111738367A

CN111738367A - Part classification method based on image recognition

Info

Publication number: CN111738367A
Application number: CN202010825217.1A
Authority: CN
Inventors: 廖峪; 林仁辉; 苏茂才; 唐泰可
Original assignee: Chengdu Zhonggui Track Equipment Co ltd
Current assignee: Chengdu Zhonggui Track Equipment Co ltd
Priority date: 2020-08-17
Filing date: 2020-08-17
Publication date: 2020-10-02
Anticipated expiration: 2040-08-17
Also published as: CN111738367B

Abstract

The invention discloses a part classification method based on image recognition, which belongs to the field of image processing and comprises the following steps: collecting part images, preprocessing the collected part images, and taking the preprocessed part images as a training set; constructing a part recognition neural network, and initializing network parameters of the part recognition neural network to obtain a primary part recognition neural network; constructing a loss function, taking the minimum loss function as a target, and training the primary part recognition neural network through a training set until the loss function is less than a to obtain a trained part recognition neural network; and acquiring an image to be recognized, preprocessing the image to be recognized, and inputting the preprocessed image to be recognized into the trained part recognition neural network to obtain a part classification result. The invention can assist maintenance personnel or field workers to maintain equipment by realizing part classification, thereby avoiding the problem that the part identification is time-consuming and labor-consuming in the maintenance process.

Description

Part classification method based on image recognition

Technical Field

The invention belongs to the field of image processing, and particularly relates to a part classification method based on image recognition.

Background

The image classification research task mainly comprises three main links of preprocessing, feature extraction and classification, and each link has important influence on the classification effect of the image. With the rapid development of computer software and hardware and internet technology, the amount of multimedia data is also increasing at an incredible speed, and more information is expressed in the form of images in various industries, which undoubtedly brings huge challenges to each link of the task of image classification. In the industrial manufacturing, various parts exist, wherein various types of similar parts are not lacked, the time for distinguishing the parts to maintain is long, and the manual work sometimes generates errors, so that the loss is caused, and when a maintenance worker is not on the spot, other workers cannot replace the parts independently.

Disclosure of Invention

Aiming at the defects in the prior art, the part classification method based on image recognition solves the problem that manual part recognition in the prior art is time-consuming and labor-consuming.

In order to achieve the purpose of the invention, the invention adopts the technical scheme that: a part classification method based on image recognition comprises the following steps:

s1, collecting K part images, and collecting N parts of each part image;

s2, preprocessing the collected part images, and taking the preprocessed part images as a training set;

s3, constructing a part recognition neural network, and initializing network parameters of the part recognition neural network to obtain a primary part recognition neural network;

s4, constructing a loss function, taking the minimum loss function as a target, and training the primary part recognition neural network through a training set until the loss function is smaller than a set training threshold value a to obtain a trained part recognition neural network;

and S5, collecting the image to be recognized, preprocessing the image to be recognized, and inputting the preprocessed image to be recognized into the trained part recognition neural network to obtain a part classification result.

Further, the specific method for preprocessing the part image in step S2 is as follows:

a1, sequentially carrying out Gaussian filtering, mean filtering, minimum mean square error filtering and Gabor filtering on the part image to obtain a first-stage processing part image;

a2, carrying out gray processing on the primary processing part image to obtain a secondary processing part image;

a3, obtaining the gradient of pixel points in the secondary processing part image, and performing gray level representation on the secondary processing part image according to the gradient to obtain a tertiary processing part image;

a4, carrying out contour vertical coordinate reconstruction on the three-level processing part image to obtain a four-level processing part image;

a5, extracting an outline region in the four-level processed part image, and acquiring a preprocessed part image;

further, the specific steps of step a3 are as follows:

a31, sequentially solving the gradient of each pixel point in the secondary processing part image function f (x, y)

Comprises the following steps:

wherein X represents an abscissa of the pixel point, Y represents an ordinate of the pixel point, X =0,1,., X, Y =0,1,.., Y, X represents a maximum abscissa, Y represents a maximum ordinate,

；

a32, setting a gray threshold T, and determining the gradient of each pixel point according to the gray threshold T

Performing gray scale on the image of the secondary processing part

Representing to obtain a three-level processing part image; the gray scale

Comprises the following steps:

wherein M represents a pixel point located on the contour, and N represents a pixel point on the non-contour line.

Further, the specific steps of step a4 are as follows:

a41, randomly searching a gray scale in the image of the three-level processed part

The pixel point is recorded as

；

A42, pixel point

Centering, extracting pixel points

The gray level of the pixel point with M in all the adjacent pixel points;

a43, selecting the pixel with the maximum gradient from the pixels with the gray scale of M in the step A42, and taking the pixel with the maximum gradient as the center to extract the pixel with the gray scale of M from all adjacent pixels;

and A44, analogizing according to the method in the step A43, obtaining contour pixel points in the three-level processing part image, and obtaining a four-level processing part image.

Further, the step a5 of extracting the contour region in the four-level processed part image, and the specific step of obtaining the preprocessed part image includes: and extracting a square area containing all contour pixel points in the four-level processed part image, and modifying the size of the square area to 224 x 224 to obtain the preprocessed part image.

Further, the specific structure of the part recognition neural network in step S3 includes an input layer, a first convolution layer, a first maximum pooling layer, a first normalization layer, a second convolution layer, a third convolution layer, a second normalization layer, a second maximum pooling layer, a first image processing module, a second image processing module, a third maximum pooling layer, a third image processing module, a fourth image processing module, a fifth image processing module, a sixth image processing module, a seventh image processing module, a fourth maximum pooling layer, an eighth image processing module, a ninth image processing module, a first average pooling layer, a first full-connection layer, a first softmaxation activation layer, and an output layer, which are connected in sequence.

Furthermore, the first image processing module, the second image processing module, the third image processing module, the fourth image processing module, the fifth image processing module, the sixth image processing module, the seventh image processing module, the eighth image processing module and the ninth image processing module have the same structure, and each comprises a fourth convolution layer, a fifth convolution layer, a sixth convolution layer and a fifth maximum pooling layer, wherein an input end of the fourth convolution layer, an input end of the fifth convolution layer, an input end of the sixth convolution layer and an input end of the fifth maximum pooling layer jointly form an input end of the image processing module, the output end of the fourth convolution layer is connected with the input end of the polymerization layer, the fifth convolution layer is connected with the input end of the polymerization layer through the seventh convolution layer, the sixth convolutional layer is connected with the input end of the polymerization layer through the eighth convolutional layer, and the output end of the fifth largest pooling layer is connected with the input end of the polymerization layer through the ninth convolutional layer; the output end of the aggregation layer is the output end of the image processing module, and the output end of the aggregation layer is used for aggregation in the dimension of the output channel.

Further, the output end of the third image processing module is connected with a first auxiliary classification module, the output end of the sixth image processing module is connected with a second auxiliary classification module, the first auxiliary classification module and the second auxiliary classification module have the same structure and respectively comprise a second average pooling layer, a tenth convolution layer, a second full-connection layer, a third full-connection layer, a second SoftmaxAction activation layer and an auxiliary classification output layer which are sequentially connected.

Further, the loss function L in step S4 is specifically:

wherein N =1, 2.. N, N denotes the total number of samples of each class, K =1, 2.. K, K denotes the number of sample classes,

the output result of the nth sample calculated by the part recognition neural network is represented as the activation function value under the k-th condition,

indicating the probability that the nth sample is of class k,

a value representing a first loss calculation parameter,

represents a second loss calculation parameter value, R () represents regularization, W represents a network parameter of the first part identification neural network,

network parameters representing a second part identification neural network;

the above-mentioned

The method specifically comprises the following steps:

wherein the content of the first and second substances,

indicates that in the case of the part identification neural network parameters W and b, the input sample is

The resulting input signal abstract features; b represents the network parameters of the third part identification neural network;

expressing the neural network parameters in part recognition as

In the case of (2), inputting a feature

The corresponding label obtained;

the network parameters W, b and

the update formula of (2) is:

wherein the content of the first and second substances,

network parameters representing the first part recognition neural network when trained using class k samples,

network parameters representing a second part recognition neural network when trained using class k samples,

network parameters representing a third part recognition neural network when trained using class k samples,

、

and

each of which represents a differential term that is,

representing the network update learning rate.

Further, the step S5 of inputting the preprocessed image to be recognized into the trained part recognition neural network to obtain the part classification result includes the specific steps of:

b1, inputting the preprocessed image to be recognized into the trained part recognition neural network;

b2, the classification result of the acquisition output layer is

The classification result of the first auxiliary classification module is

And the second auxiliary classification module has the classification result of

；

B3, setting the weight values of the output layer, the first auxiliary classification module and the second auxiliary classification module as

、

And

；

b4, mixing

、

And

and adding the weights of the results of the same type, and taking the classification result with the maximum weight as a part classification result.

The invention has the beneficial effects that:

(1) the invention is provided with the image processing module, thereby increasing the depth and the width of the network, improving the performance of the deep neural network, accelerating the training process and ensuring the accuracy of the network in the later period.

(2) The invention provides a part classification method based on image recognition, which is used for performing auxiliary classification by arranging a plurality of classifiers and realizing accurate part classification.

(3) The part recognition neural network of the present invention avoids the problem of gradient disappearance for deeper widths and depths.

(4) The method is simple and quick, and can be used for assisting maintenance personnel or field workers to maintain equipment by realizing part classification, so that time and labor are saved in part identification in the maintenance process.

Drawings

FIG. 1 is a flow chart of a part classification method based on image recognition according to the present invention;

FIG. 2 is a schematic diagram of a part recognition neural network according to the present invention;

FIG. 3 is a schematic diagram of an image processing module according to the present invention;

FIG. 4 is a schematic diagram of an auxiliary classification module according to the present invention.

Detailed Description

The following description of the embodiments of the present invention is provided to facilitate the understanding of the present invention by those skilled in the art, but it should be understood that the present invention is not limited to the scope of the embodiments, and it will be apparent to those skilled in the art that various changes may be made without departing from the spirit and scope of the invention as defined and defined in the appended claims, and all matters produced by the invention using the inventive concept are protected.

Embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

As shown in fig. 1, a part classification method based on image recognition includes the following steps:

s1, collecting K part images, and collecting N parts of each part image;

The specific method for preprocessing the part image in step S2 is as follows:

the specific steps of the step A3 are as follows:

Comprises the following steps:

，

；

Performing gray scale on the image of the secondary processing part

Representing to obtain a three-level processing part image; the gray scale

Comprises the following steps:

The specific steps of the step A4 are as follows:

The pixel point is recorded as

；

A42, pixel point

Centering, extracting pixel points

The gray level of the pixel point with M in all the adjacent pixel points;

The step a5 of extracting the contour region in the four-level processed part image, and the specific steps of obtaining the preprocessed part image are as follows: and extracting a square area containing all contour pixel points in the four-level processed part image, and modifying the size of the square area to 224 x 224 to obtain the preprocessed part image.

As shown in fig. 2, the specific structure of the part recognition neural network in step S3 includes an input layer, a first convolution layer, a first maximum pooling layer, a first normalization layer, a second convolution layer, a third convolution layer, a second normalization layer LocalRespNorm, a second maximum pooling layer, a first image processing module, a second image processing module, a third maximum pooling layer, a third image processing module, a fourth image processing module, a fifth image processing module, a sixth image processing module, a seventh image processing module, a fourth maximum pooling layer, an eighth image processing module, a ninth image processing module, a first average pooling layer, a first full-connection layer, a first full-resolution activation layer, and an output layer, which are connected in sequence.

As shown in fig. 3, the first image processing module, the second image processing module, the third image processing module, the fourth image processing module, the fifth image processing module, the sixth image processing module, the seventh image processing module, the eighth image processing module and the ninth image processing module have the same structure, and each comprises a fourth convolution layer, a fifth convolution layer, a sixth convolution layer and a fifth maximum pooling layer, wherein an input end of the fourth convolution layer, an input end of the fifth convolution layer, an input end of the sixth convolution layer and an input end of the fifth maximum pooling layer jointly form an input end of the image processing module, the output end of the fourth convolution layer is connected with the input end of the aggregation layer DepthCocat, the fifth convolution layer is connected with the input end of the polymerization layer through a seventh convolution layer, the sixth convolution layer is connected with the input end of the polymerization layer through an eighth convolution layer, the output end of the fifth maximum pooling layer is connected with the input end of the aggregation layer through a ninth convolution layer; the output end of the aggregation layer is the output end of the image processing module, and the output end of the aggregation layer is used for aggregation in the dimension of the output channel.

The output end of the third image processing module is further connected with the first auxiliary classification module, and the output end of the sixth image processing module is further connected with the second auxiliary classification module.

As shown in fig. 4, the first auxiliary classification module and the second auxiliary classification module have the same structure, and each of the first auxiliary classification module and the second auxiliary classification module includes a second average pooling layer, a tenth convolution layer, a second full-connected layer, a third full-connected layer, a second softmaxaction activation layer, and an auxiliary classification output layer, which are sequentially connected.

In this embodiment, the output results of the first max pooling layer, the second max pooling layer, and each of the first convolution layers are all subjected to the ReLU calculation and then transmitted to the next layer.

The loss function L in step S4 is specifically:

indicating the probability that the nth sample is of class k,

a value representing a first loss calculation parameter,

network parameters representing a second part identification neural network;

the above-mentioned

The method specifically comprises the following steps:

wherein the content of the first and second substances,

The resulting input signal abstract features; b represents a third part recognition neural networkThe network parameter of (2);

expressing the neural network parameters in part recognition as

In the case of (2), inputting a feature

The corresponding label obtained;

the network parameters W, b and

the update formula of (2) is:

wherein the content of the first and second substances,

、

and

each of which represents a differential term that is,

representing the network update learning rate.

In step S5, the specific steps of inputting the preprocessed image to be recognized into the trained part recognition neural network to obtain a part classification result are as follows:

b2, the classification result of the acquisition output layer is

The classification result of the first auxiliary classification module is

And the second auxiliary classification module has the classification result of

；

、

And

；

b4, mixing

、

And

Claims

1. A part classification method based on image recognition is characterized by comprising the following steps:

s1, collecting K part images, and collecting N parts of each part image;

2. The part classification method based on image recognition according to claim 1, wherein the specific method for preprocessing the part image in step S2 is as follows:

and A5, extracting the outline region in the four-level processed part image, and acquiring the preprocessed part image.

3. The part classification method based on image recognition according to claim 2, wherein the specific steps of the step A3 are as follows:

Comprises the following steps:

；

Performing gray scale on the image of the secondary processing part

Representing to obtain a three-level processing part image; the gray scale

Comprises the following steps:

4. The part classification method based on image recognition according to claim 2, wherein the specific steps of the step A4 are as follows:

The pixel point is recorded as

；

A42, pixel point

Centering, extracting pixel points

The gray level of the pixel point with M in all the adjacent pixel points;

5. The method for classifying parts based on image recognition according to claim 4, wherein the step A5 is to extract the contour region in the four-level processed part image, and the specific step of obtaining the pre-processed part image is to: and extracting a square area containing all contour pixel points in the four-level processed part image, and modifying the size of the square area to 224 x 224 to obtain the preprocessed part image.

6. The part classification method based on image recognition according to claim 1, wherein the specific structure of the part recognition neural network in step S3 includes an input layer, a first convolution layer, a first maximum pooling layer, a first normalization layer, a second convolution layer, a third convolution layer, a second normalization layer, a second maximum pooling layer, a first image processing module, a second image processing module, a third maximum pooling layer, a third image processing module, a fourth image processing module, a fifth image processing module, a sixth image processing module, a seventh image processing module, a fourth maximum pooling layer, an eighth image processing module, a ninth image processing module, a first average pooling layer, a first full-connection layer, a first softmaxation activation layer, and an output layer, which are connected in sequence.

7. The image recognition-based part sorting method according to claim 6, wherein the first image processing module, the second image processing module, the third image processing module, the fourth image processing module, the fifth image processing module, the sixth image processing module, the seventh image processing module, the eighth image processing module and the ninth image processing module are identical in structure and each include a fourth convolution layer, a fifth convolution layer, a sixth convolution layer and a fifth maximum pooling layer, an input end of the fourth convolution layer, an input end of the fifth convolution layer, an input end of the sixth convolution layer and an input end of the fifth maximum pooling layer together constitute an input end of the image processing module, an output end of the fourth convolution layer is connected with an input end of the polymerization layer, the fifth convolution layer is connected with an input end of the polymerization layer through the seventh convolution layer, the sixth convolution layer is connected with an input end of the polymerization layer through the eighth convolution layer, the output end of the fifth maximum pooling layer is connected with the input end of the aggregation layer through a ninth convolution layer; the output end of the aggregation layer is the output end of the image processing module, and the output end of the aggregation layer is used for aggregation in the dimension of the output channel.

8. The part classification method based on image recognition according to claim 6, wherein an output end of the third image processing module is further connected with a first auxiliary classification module, an output end of the sixth image processing module is further connected with a second auxiliary classification module, and the first auxiliary classification module and the second auxiliary classification module have the same structure and each include a second average pooling layer, a tenth convolution layer, a second full-connection layer, a third full-connection layer, a second Softmaxation activation layer and an auxiliary classification output layer which are sequentially connected.

9. The method for classifying parts based on image recognition according to claim 1, wherein the loss function L in step S4 is specifically:

indicating the probability that the nth sample is of class k,

a value representing a first loss calculation parameter,

network parameters representing a second part identification neural network;

the above-mentioned

The method specifically comprises the following steps:

wherein the content of the first and second substances,

expressing the neural network parameters in part recognition as

In the case of (2), inputting a feature

The corresponding label obtained;

the network parameters W, b and

the update formula of (2) is:

wherein the content of the first and second substances,

、

and

each of which represents a differential term that is,

representing the network update learning rate.

10. The method for classifying parts based on image recognition according to claim 8, wherein the step S5 of inputting the pre-processed image to be recognized into the trained part recognition neural network to obtain the part classification result comprises the specific steps of:

b2, the classification result of the acquisition output layer is