CN113947763A

CN113947763A - Pavement arrow recognition method and device based on template self-supervision

Info

Publication number: CN113947763A
Application number: CN202111220342.0A
Authority: CN
Inventors: 王连涛; 王力民
Original assignee: Changzhou Campus of Hohai University
Current assignee: Changzhou Campus of Hohai University
Priority date: 2021-10-20
Filing date: 2021-10-20
Publication date: 2022-01-18

Abstract

The invention discloses a pavement arrow recognition method and a device based on template self-supervision, which comprises the steps of training on a pre-constructed expansion data set by using a pre-constructed self-supervision template comparison learning model containing an encoder and a projection layer to obtain a trained network, wherein the expansion data set is an arrow template and an expansion data set of a corresponding example sample; identifying the category of the arrow image by using the trained network; the invention can finish the training of the arrow recognition model only by utilizing the template without any manual marking, obtains the arrow pictures simulating various visual angles through the automatic transformation of the template, then inputs the model in pairs to train a depth model which can judge whether the instance is matched with the template, namely can be used for the recognition of the arrow, can effectively avoid the dependence of the manual marking through the self-supervision contrast learning, and finishes the reliable recognition model through the training without the manual supervision.

Description

Pavement arrow recognition method and device based on template self-supervision

Technical Field

The invention relates to a pavement arrow recognition method and device based on template self-supervision, and belongs to the technical field of image processing.

Background

The invention discloses a more advanced road surface guiding arrow recognition method which is applied to the fields of intelligent driving, intelligent transportation facilities and the like and is required to automatically recognize guiding arrows of lanes. The existing method for recognizing the road surface arrow needs to perform perspective transformation on an image acquired by a sensor to obtain a top view without deformation, and then recognize the arrow by adopting a template matching mode; or training a classifier to recognize after manually collecting a large number of arrow image labels. The template self-supervision recognition method can finish the training of the arrow recognition model only by using the template without any manual marking. Arrow pictures simulating various visual angles are obtained through automatic transformation of the template, and then the images are input into the model in pairs to train a depth model capable of judging whether the instance is matched with the template, so that the method can be used for identifying the arrow.

Disclosure of Invention

The invention aims to overcome the defects in the prior art and provides a pavement arrow recognition method and device based on template self-supervision.

In order to achieve the purpose, the invention is realized by adopting the following technical scheme:

in a first aspect, the invention provides a pavement arrow recognition method based on template self-supervision, which comprises the following steps:

training on a pre-constructed extended data set by using a pre-constructed self-supervision template contrast learning model containing an encoder and a projection layer to obtain a trained network, wherein the extended data set is an arrow template and an extended data set of a corresponding example sample;

and identifying the category of the arrow image by using the trained network.

Further, the specific data expansion method adopted during the construction of the expansion data set is as follows:

each template image is transformed by perspective to obtain 1200 arrow images with different angles, and the transformation comprises the following steps: azimuth angle is 0 to 360 degrees, step interval is 45 degrees, 8 types in total; the pitch angle is 0 to-70 degrees, the stepping interval is-5 degrees, and 15 kinds are adopted; the visual angle is 42 degrees to 132 degrees, the stepping interval is 10 degrees, and the total number is 10;

and forming a positive pair with the corresponding arrow template and forming a negative pair with the other arrow templates.

Further, the specific network structure of the self-supervision template comparison learning model is as follows:

selecting any neural network with feature extraction capability as an encoder to extract features, and calculating an image through the encoder to obtain a feature vector:

h＝f(x)；

using a two-layer fully-connected network as a projection layer, projecting the feature map into a space for contrast loss, wherein a projection vector z is the output of a feature vector h in the projection layer:

z＝g(h)＝W⁽²⁾σ(W⁽¹⁾h)；

wherein σ represents a ReLu activation function;

given a pair of images { x, T }, the similarity between them is defined by cosine similarity:

constructing a loss function to be optimized:

wherein tau is a hyper parameter and is interpreted as a temperature parameter, so that the repulsion effect between negative pairs can be enhanced, the optimization capability of a loss function is improved, and the convergence speed is accelerated.

Further, the training on the pre-constructed augmented data set by using the pre-constructed self-supervised template contrast learning model comprising the encoder and the projection layer comprises the following steps:

inputting the positive pair and the negative pair obtained by combination into a pre-constructed self-supervision template comparison learning model, and respectively obtaining a feature vector h by calculating the image and the template through an encoder_i、h_T:

h_i＝f(x_i)；h_T＝f(x_T)；

Projecting the obtained feature vectors into a mapping space for contrast loss, z_i、z_TAre respectively image feature vectors h_iTemplate feature h_TOutput at projection layer:

z_i＝g(h_i)＝W⁽²⁾σ(W⁽¹⁾h_i)；z_T＝g(h_T)＝W⁽²⁾σ(W⁽¹⁾h_T)；

the network automatically adjusts the parameters based on the back propagation of the loss function.

Further, the identifying the category of the arrow image by using the trained network includes:

sending the arrow image to be recognized into a trained feature extraction network, and obtaining a feature vector of the arrow image;

calculating the similarity between the feature vector of the image and each template vector by using a similarity calculation formula;

the arrow image is identified as the arrow category having the greatest similarity.

In a second aspect, the present invention provides a pavement arrow recognition apparatus based on template self-supervision, including:

the training unit is used for training on a pre-constructed extended data set by using a pre-constructed self-supervision template comparison learning model containing an encoder and a projection layer to obtain a trained network, wherein the extended data set is an arrow template and an extended data set of a corresponding example sample;

and the identification unit is used for identifying the category of the arrow image by using the trained network.

In a third aspect, the invention provides a road surface arrow recognition device based on template self-supervision, comprising a processor and a storage medium;

the storage medium is used for storing instructions;

the processor is configured to operate in accordance with the instructions to perform the steps of the method according to any of the above.

In a fourth aspect, the invention provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, performs the steps of any of the methods described above.

Compared with the prior art, the invention has the following beneficial effects:

the method realizes the classification of the arrow images by learning the images of the template arrows in the self-supervision contrast learning and national standards, can finish the training of the arrow recognition model only by utilizing the templates without any manual marking, obtains the arrow images simulating various visual angles through the automatic transformation of the templates, then inputs the models in pairs to train a depth model which can judge whether the examples are matched with the templates, can be used for recognizing the arrows, can effectively avoid the dependence of the manual marking through the self-supervision contrast learning, and trains and finishes the reliable recognition model without the manual supervision.

Drawings

Fig. 1 is a flowchart of a method for identifying a road surface arrow based on template self-supervision according to an embodiment of the present invention;

FIG. 2 is 11 arrow template images in national standard of China according to an embodiment of the present invention;

FIG. 3 is a partially expanded data presentation diagram provided by an embodiment of the present invention;

FIG. 4 is a diagram of a neural network training process for template self-supervised contrast learning provided by an embodiment of the present invention;

fig. 5 is a left-turn and u-turn composite direction arrow of actual shooting and a template image successfully matched by the method provided by the embodiment of the invention.

Detailed Description

The invention is further described below with reference to the accompanying drawings. The following examples are only for illustrating the technical solutions of the present invention more clearly, and the protection scope of the present invention is not limited thereby.

Example 1

The embodiment introduces a pavement arrow recognition method based on template self-supervision, which comprises the following steps:

and identifying the category of the arrow image by using the trained network.

The following drawings illustrate the technical scheme of the present invention in further detail by taking 11 arrows in national standard of China as examples.

The application process of the pavement arrow recognition method based on template self-supervision provided by the embodiment specifically relates to the following steps:

step S1: constructing an arrow template and an expansion data set of a corresponding example sample;

step S2: constructing an automatic supervision template comparison learning model comprising an encoder and a projection layer;

step S3: training on the augmented data set of step S2 using the model of step S1;

step S4: the category of the arrow image is identified using the network trained in step S3.

The method for recognizing a road surface arrow based on template self-supervision according to claim 1, as shown in fig. 1, wherein the specific data expansion method in step S1 is as follows:

s1.1, subjecting each template image to perspective transformation to obtain 1200 arrow images at different angles, as shown in fig. 2, 11 template images in the national standard, and fig. 3 is a sample of a selected partial transformation image, where the transformation includes: azimuth angle is 0 to 360 degrees, step interval is 45 degrees, 8 types in total; the pitch angle is 0 to-70 degrees, the stepping interval is-5 degrees, and 15 kinds are adopted; the visual angle is 42 degrees to 132 degrees, the step interval is 10 degrees, and the total number is 10.

S1.2: and forming a positive pair with the corresponding arrow template and forming a negative pair with the other arrow templates.

The method for recognizing the road surface arrow based on the template self-supervision as claimed in claim 2, wherein the self-supervision template comparison learning specific network structure in the step S2 is as follows:

s2.1: any neural network having a feature extraction capability is selected as the encoder extraction feature, and any network in which a convolution layer and a pooling layer are combined, including VGGNet, *** lenet, ResNet, densneet, and the like, can be used. The image is calculated by an encoder to obtain a feature vector:

h＝f(x)；

s2.2: the feature vectors are projected into the space for contrast loss using two layers of nonlinear MLPs as projection layers. The projection vector z is the output of the feature vector h in the projection layer:

z＝g(h)＝W⁽²⁾σ(W⁽¹⁾h)；

wherein σ represents a ReLu activation function;

s2.3: given a pair of images { x, T }, the similarity between them is defined by cosine similarity:

s2.4: and (3) performing back propagation by using a loss function, and optimizing network parameters:

wherein tau is a temperature parameter and takes a value of 0.1.

The method for recognizing the road surface arrow based on the template self-supervision as claimed in claim 3, as shown in fig. 4, wherein the concrete steps of training the network in step S3 are as follows:

s3.1: inputting the positive and negative pairs obtained by combining the step S1 into the network structure constructed in the step S2, and respectively obtaining a feature vector h by calculating the image and the template through an encoder_i、h_T:

h_i＝f(x_i)；h_T＝f(x_T)；

S3.2: projecting the obtained feature vectors into a mapping space for contrast loss, z_i、z_TAre respectively image feature vectors h_iTemplate feature h_TOutput at projection layer:

s3.3: the network automatically adjusts parameters according to the back propagation of the loss function, and deep learning tool boxes such as PyTorch and Tensorflow can be used.

The method for recognizing the road surface arrow based on the template self-supervision as claimed in claim 4, wherein the trained network in the step S3 is applied to a specific classification task, and the specific steps are as follows:

s4.1: sending the arrow image to be recognized into the feature extraction network trained in the step S3, and obtaining a feature vector of the arrow image;

s4.2: calculating the similarity between the feature vector of the image and each template vector by using the similarity calculation formula in the step S2.3;

s4.3: the arrow image is identified as the arrow category having the greatest similarity.

The left side of fig. 5 is a road surface left-turn and turn-around composite arrow image which is actually shot, and after the arrow image is compared with the arrow in the template library by using the method of the present invention, the left-turn and turn-around composite arrow in the template library is successfully paired.

Example 2

The embodiment provides a road surface arrow recognition device based on template self-supervision, includes:

Example 3

The embodiment provides a pavement arrow recognition device based on template self-supervision, which comprises a processor and a storage medium;

the storage medium is used for storing instructions;

the processor is configured to operate in accordance with the instructions to perform the steps of the method according to any one of:

and identifying the category of the arrow image by using the trained network.

Example 4

The present embodiment provides a computer-readable storage medium, on which a computer program is stored, which program, when being executed by a processor, carries out the steps of the method of any one of the following;

and identifying the category of the arrow image by using the trained network.

The above description is only a preferred embodiment of the present invention, and it should be noted that, for those skilled in the art, several modifications and variations can be made without departing from the technical principle of the present invention, and these modifications and variations should also be regarded as the protection scope of the present invention.

Claims

1. A pavement arrow recognition method based on template self-supervision is characterized by comprising the following steps:

and identifying the category of the arrow image by using the trained network.

2. The template-based self-supervision pavement arrow recognition method according to claim 1, characterized in that: the specific data expansion method adopted during the construction of the expansion data set comprises the following steps:

3. The template-based self-supervision pavement arrow recognition method according to claim 2, characterized in that: the specific network structure of the self-supervision template comparison learning model is as follows:

h＝f(x)；

z＝g(h)＝W⁽²⁾σ(W⁽¹⁾h)；

wherein σ represents a ReLu activation function;

constructing a loss function to be optimized:

4. The template-based self-supervision pavement arrow recognition method of claim 3, characterized in that: the training on the pre-constructed augmented data set using a pre-constructed self-supervised template contrast learning model comprising an encoder and a projection layer comprises:

inputting the positive pair and the negative pair obtained by combination into a pre-constructed self-supervision template comparison learning model, and respectively obtaining a feature vector h by calculating the image and the template through an encoder_i、h_T：

h_i＝f(x_i)；h_T＝f(x_T)；

5. The template-based self-supervision pavement arrow recognition method of claim 4, characterized in that: the identifying the category of the arrow image by using the trained network comprises:

6. A road surface arrow head recognition device based on template self-supervision, characterized by that includes:

7. The utility model provides a road surface arrow point recognition device based on template is from supervision which characterized in that: comprising a processor and a storage medium;

the storage medium is used for storing instructions;

the processor is configured to operate in accordance with the instructions to perform the steps of the method according to any one of claims 1 to 5.

8. A computer-readable storage medium having stored thereon a computer program, characterized in that: the program when executed by a processor implements the steps of the method of any one of claims 1 to 5.