CN110378167B

CN110378167B - Bar code image correction method based on deep learning

Info

Publication number: CN110378167B
Application number: CN201910614176.9A
Authority: CN
Inventors: 罗拥军; 李珉; 朱云山; 杨宏彬; 李泽郁
Original assignee: Jiangsu Anfang Electric Power Technology Co ltd
Current assignee: Jiangsu Anfang Electric Power Technology Co ltd
Priority date: 2019-07-09
Filing date: 2019-07-09
Publication date: 2022-09-16
Anticipated expiration: 2039-07-09
Also published as: CN110378167A

Abstract

The invention relates to a bar code image correction algorithm based on deep learning, which comprises the following steps: the image is to be parsed into sample points in a probability distribution; the method comprises the steps of analyzing how to generate a pseudo image by deep learning, modeling a fuzzy incomplete bar code image by using a semantic segmentation related model, realizing correction of the fuzzy incomplete bar code image, correcting the fuzzy or incomplete image in a complex scene, then finding out the optimal pseudo image required by complete restoration, and correcting and restoring the fuzzy incomplete bar code image based on U-net, wherein the neural network is of an Encoder-Decoder structure, the Encoder is a contraction path and consists of a convolution layer and a pooling layer, the model is optimized from three angles in order to realize accurate positioning, the correction of the fuzzy incomplete bar code image is realized, and the fuzzy incomplete bar code image is enabled to be complete and clear through an algorithm, so that the identification accuracy of the bar code image is improved, and the protection of an electric power warehousing full-automatic unmanned operation system is guaranteed.

Description

Bar code image correction method based on deep learning

Technical Field

The invention relates to the technical field of barcode image processing, in particular to a barcode image correction method based on deep learning.

Background

At present, in a full-automatic unmanned operation system of power storage, a bar code on a material nameplate needs to be automatically identified. The cabinet type material nameplate material has aluminium and stainless steel, has problems such as reflection of light, surface corrosion, surface mar. The image recognition camera is sensitive to light, the larger the light reflection degree is, the larger the error probability is, the removing algorithm is self, the image quality is the largest factor influencing the bar code recognition accuracy, and the quality of the image is measured from three aspects: the original bar code image of the material nameplate needs to be corrected due to the defects of inclination, definition and blur.

The current correction algorithms for blurred and incomplete images mainly include the following three types:

in the method, special hardware equipment is generally used for scanning three-dimensional shape information of a material nameplate bar code. For example, the structured light source is used to scan an image to obtain three-dimensional information, i.e., depth information, of the image, and then the image is corrected according to the depth information.

The image correction algorithm based on 3D model reconstruction is mainly based on factors causing bar code image blurring and deformity, and comprises bar codes and placing angles thereof, light source directions, image acquisition equipment characteristics and the like. 3D modeling is carried out on the bar code image, and the existing mathematical knowledge is utilized to correct the fuzzy deformity.

The algorithm discards geometric simulation and 3D modeling of a fuzzy incomplete image, directly analyzes the barcode image, and then designs the fuzzy incomplete image correction algorithm which is not influenced by factors except the barcode image.

The three algorithms have respective limitations, which can be summarized as: most of the traditional methods are used for modeling specific scenes, and the models cannot function in a plurality of large scenes.

Disclosure of Invention

The invention aims to solve the technical problem of providing a bar code image correction method based on deep learning, which is used for improving the identification accuracy of bar code images and is a full-automatic unmanned operation system for power warehousing and is used for enabling fuzzy and incomplete bar code images to be complete and clear through an algorithm.

In order to solve the technical problem, the invention provides a bar code image correction algorithm based on deep learning, which comprises the following steps:

s1: analyzing the image into sample points in probability distribution, wherein the sample points are small sample images;

s2: analyzing a pseudo image generated by deep learning, modeling a fuzzy incomplete bar code image by using a semantic segmentation related model, converting a pixel-level classification problem into a pixel-level regression problem, correcting the fuzzy incomplete bar code image, correcting a fuzzy or incomplete image in a complex scene, further deeply learning related contents in the semantic segmentation field, correcting the fuzzy incomplete bar code image, automatically generating a data set of the fuzzy incomplete bar code image by the data set, and finally constructing and optimizing the model;

s3: and then finding an optimal pseudo image required by complete restoration, and correcting and restoring the fuzzy incomplete bar code image based on U-net, wherein the neural network is an Encoder-Decoder structure, the Encoder is a contraction path which consists of a convolution layer and a Pooling layer, so that the extraction of characteristics or the capture of semantics is realized, the Decoder is an expansion path and is realized by transposition convolution and jump connection, in order to realize upsampling, the dimensionality of the image is reduced due to downsampling performed by Pooling operation, and the dimensionality of a feature map is increased by transposition convolution, so that the size of the original image is restored, pixel level regression is realized, and the model is optimized from three angles in order to realize accurate positioning:

changing the structure of the model: changing from U-net to Stacked U-net to improve resolution;

modifying the loss function: in the optimization process of the model, the difference between the original distance between adjacent pixel points and the prediction result is not too large, so that the phenomenon of fuzzy deformation of the bar code image is improved;

post-processing the prediction result to improve the noise phenomenon;

in the final model evaluation, the structural similarity indexes are adopted to evaluate the image similarity before and after correction, and the multi-level structural similarity is summarized.

In S3, in the final model evaluation, the structural similarity index is used to evaluate the similarity of the images before and after correction, and the structural similarity is summarized in multiple scales, and the calculation formula is:

after adopting the structure, the invention has the advantages that:

the fuzzy incomplete bar code image is corrected, and the fuzzy incomplete bar code image becomes complete and clear through an algorithm, so that the identification accuracy of the bar code image is improved, and the driving protection navigation of the electric power warehousing full-automatic unmanned operation system is realized.

Drawings

FIG. 1 is a schematic diagram of the present invention for self-generating a data set of a blurred and incomplete bar code image.

Detailed Description

The invention is described in detail below with reference to the drawings and the detailed description, which should not be construed as limiting the invention.

As shown in fig. 1, an algorithm of a barcode image correction method based on deep learning includes the following steps:

s1: the image is to be parsed into sample points in a probability distribution;

post-processing the prediction result to improve the noise phenomenon;

in the final model evaluation, the structural similarity index is adopted to evaluate the similarity of the images before and after correction, the multi-level structural similarity summarization builds a model for the fuzzy incomplete bar code image by using a semantic segmentation related model along with the rise of deep learning in recent years, the pixel-level classification problem is converted into the pixel-level regression problem, the correction of the fuzzy incomplete bar code image is realized, the model has certain generalization capability, and the correction can be carried out on the fuzzy or incomplete image in a complex scene.

In consideration of the complexity of actual services, the traditional method cannot be sufficient, so that the application is combined with relevant knowledge in the deep learning semantic segmentation field, an optimization scheme is provided aiming at the defects of the existing method, and the correction of the fuzzy incomplete bar code image is realized.

And (4) automatically generating a data set of the fuzzy incomplete bar code image by referring to a deep learning method.

And (4) model construction and optimization, and correction and restoration of the fuzzy incomplete bar code image based on U-net. The neural network can be understood as an Encoder-Decoder structure, wherein the Encoder is a contraction path and mainly comprises a convolution layer and a Pooling layer, the main purpose is to realize extraction of features or capture of semantics, the Decoder is an expansion path and is mainly realized by transposition convolution and jump connection, the main purpose is to realize up-sampling, the dimensionality of an image is reduced due to down-sampling performed by Pooling operation, and the transposition convolution can enlarge the dimensionality of a feature map so as to recover the size of an original image, thereby realizing pixel-level regression. The results obtained are however very crude, and the optimization of the model is carried out from three angles in order to achieve a precise positioning:

modifying the loss function: the model enables the difference between the distance between the original adjacent pixel points and the prediction result not to be too large in the optimization process, so that the phenomenon of fuzzy deformation of the bar code image is improved;

and post-processing the prediction result to improve the noise phenomenon.

And (3) evaluating the model, namely evaluating the Similarity of the images before and after correction by adopting an MS-SSIM index in the final model evaluation, wherein the MS-SSIM is called Multi-Scale structured Similarity and is the summary of SSIM (structural Similarity) under Multi-scale as the name suggests. The calculation formula is as follows:

Claims

1. a bar code image correction method based on deep learning is characterized by comprising the following steps:

s1: analyzing the image into sample points in the probability distribution, wherein the sample points are small sample images;

s3: finding an optimal pseudo image required by complete restoration, and restoring and correcting a fuzzy incomplete bar code image based on U-net, wherein the neural network is of an Encoder-Decoder structure, the Encoder is a contraction path and consists of a convolution layer and a pooling layer, so that the extraction of characteristics or the capture of semantics is realized, the Decoder is realized by transposition convolution and jump connection, and the transposition convolution enables the dimensionality of a feature map to be increased, so that the size of an original image is restored, and pixel-level regression is realized; the optimization of the model is performed from three perspectives:

modifying the loss function: the distance between the original adjacent pixel points is different from the prediction result in the optimization process of the model, so that the phenomenon of fuzzy deformation of the bar code image is improved;

post-processing the prediction result to improve the noise phenomenon;

in S3, in the final model evaluation, the structural similarity index is used to evaluate the similarity of the images before and after correction, and the multi-level structural similarity is summarized by the following calculation formula:

。