CN114841977B

CN114841977B - Defect detecting method based on Swin transducer structure combined with SSIM and GMSD

Info

Publication number: CN114841977B
Application number: CN202210535974.4A
Authority: CN
Inventors: 江结林; 朱加乐; 郭浩然
Original assignee: Nanjing University of Information Science and Technology
Current assignee: Nanjing University of Information Science and Technology
Priority date: 2022-05-17
Filing date: 2022-05-17
Publication date: 2023-04-25
Anticipated expiration: 2042-05-17
Also published as: CN114841977A

Abstract

The invention discloses a defect detection method based on a Swin transform structure combined with SSIM and GMSD, which comprises the steps of obtaining an industrial defect picture, inputting a pre-constructed characteristic extraction network based on the Swin transform to perform characteristic learning, and reconstructing defect-free area information of the learned characteristic picture to obtain a reconstructed picture; calculating the structural similarity SSIM of the industrial defect picture and the reconstructed picture to obtain a defect abnormal feature map based on the structural similarity; calculating gradient amplitude similarity deviation GMSD between the industrial defect picture and the reconstructed picture to obtain a defect abnormal feature map based on gradient amplitude similarity; the defect abnormal feature map based on the structural similarity is fused with the defect abnormal feature map based on the gradient amplitude similarity to obtain a final defect detection effect map.

Description

Defect detecting method based on Swin transducer structure combined with SSIM and GMSD

Technical Field

The invention relates to a defect detection method based on a Swin transform structure combined with SSIM and GMSD, and belongs to the technical field.

Background

Textile industry is an industry that accompanies the development of human civilization history, and is considered to be the most fundamental industry in human life. Fabric defect detection (also known as defect detection) is an essential step in quality control in textile industry production. If the surface of the fabric contains defects, the quality of the product is affected, the grading of the fabric is directly affected, and the price of the defective fabric is reduced. Thus, defect detection of fabrics appears to be critical in textile quality control.

The focus of fabric defect detection is to identify whether a given fabric image is abnormal relative to a set of normal samples and locate the corresponding abnormal region in the image. These two tasks have a great influence on the quality control in the textile industry. In the actual textile industry, fabric samples with defects rarely appear, and defects have a wide variety of shapes and textures, which are unpredictable. Therefore, the task of detecting defects in fabrics is difficult to process using supervised methods, and the method of generating formulas is preferred.

In recent years, due to the rapid development of deep learning, computer vision technology has been widely used in detecting defects in fabrics. Many researchers have directed their eyes towards convolutional neural networks, and classical full convolutional neural networks (FCNs), a nnet model, which is optimized via FCN networks and has the advantages of an automatic coding model, consisting of symmetrical codecs with jump connections, are proposed. In the encoder, a series of convolutional layers and successive downsampling layers are employed to extract the depth features of the large receive field. And then, the decoder upsamples the extracted depth features to the input resolution to perform pixel-level semantic prediction, and performs jump connection fusion on the high-resolution features of different scales output by the encoder so as to reduce the spatial information loss caused by downsampling. With such an elegant U-shaped structural design, a number of fault detection algorithms, such as MSCDAE, CASAE, etc., have been developed, and these U-shaped convolutional neural network models exhibit good performance in a variety of industrial detection applications.

At present, the reconstruction type defect detection algorithm based on the U-shaped convolutional neural network is in accordance with an unsupervised method, and only the distribution of normal data is modeled during training. An anomaly score for the fabric is calculated for the difference between the input image and the model reconstructed image at the time of testing. The whole detection process assumes that the model is insensitive to reconstruction of an abnormal region, but in practical application, the generalization capability of a convolution network is strong, and the reconstruction effect on the abnormality is very effective, so that the generated network has higher omission rate on defect detection.

Defects existing in the current defect detection mainly comprise: (1) the detection omission rate of the cloth defect detection is too high. (2) inaccurate defect localization.

Disclosure of Invention

The invention aims to overcome the defects in the prior art and provides a defect detection method based on a Swin transform structure combined with SSIM and GMSD, which can not only effectively improve the accuracy of defect detection, but also improve the accuracy of defect positioning and reduce the false detection rate.

In order to achieve the above purpose, the invention is realized by adopting the following technical scheme:

in a first aspect, the present invention provides a defect detection method based on a Swin transducer structure in combination with SSIM and GMSD, comprising:

acquiring an industrial defect picture, inputting a pre-constructed Swin transform-based feature extraction network to perform feature learning, and reconstructing defect-free region information of the learned feature picture to obtain a reconstructed picture;

calculating the structural similarity SSIM of the industrial defect picture and the reconstructed picture to obtain a defect abnormal feature map based on the structural similarity;

calculating gradient amplitude similarity deviation GMSD between the industrial defect picture and the reconstructed picture to obtain a defect abnormal feature map based on gradient amplitude similarity;

and fusing the defect abnormal characteristic map based on the structural similarity with the defect abnormal characteristic map based on the gradient amplitude similarity to obtain a final defect detection effect map.

Further, the step of obtaining the industrial defect picture, inputting a pre-constructed Swin Transformer-based feature extraction network to perform feature learning, and reconstructing the non-defect area information of the learned feature to obtain a reconstructed picture, including:

acquiring an industrial defect picture, and inputting a pre-constructed Swin transform-based feature extraction network, wherein the Swin transform-based feature extraction network comprises a Patch part module, a Swin Transformer Block module, a Patch measuring module and a PixelSheffle module; and the size of the industrial defect picture is H multiplied by W multiplied by 3;

the input industrial defect picture enters a Patch part module to convert the space information into channel information, so that the input industrial defect picture meets the input format of the Swin Transformer Block module;

inputting the converted industrial defect picture into a Swin Transformer Block module for feature learning to obtain a learned feature map;

and respectively inputting the learned feature images into a Patch measuring module or a PixelSheffle module to perform downsampling or upsampling operation to obtain a reconstructed image, wherein the reconstructed image only comprises a normal image area, and the information of the defect area is not reconstructed.

Further, the input industrial defect picture enters a Patch part module to convert the space information into channel information, which comprises:

carrying out convolution operation of 1×1 on an input industrial defect picture to obtain a first characteristic picture with the size of H×W×C, and carrying out 4 times downsampling on the obtained first characteristic picture to obtain a second characteristic picture with the size of H×W×C

Wherein->

Representing the number of patches, C represents the information dimension of a patch.

Further, inputting the transformed industrial defect picture to the Swin Transformer Block module for feature learning to obtain a learned feature map, including:

sending the second feature map to a Swin Transformer Block module for learning, and performing layer normalization operation on the input industrial defect picture;

performing a multi-headed self-attention operation, learning a relational representation between local patches;

adding the patch representation after the multi-head self-attention operation is executed with the original input to obtain a new patch representation;

performing LN operation on the obtained new patch, then performing multi-layer sensor operation, and further fitting the patch;

adding the fitted patch to the obtained new patch to form a final patch representation obtained through multi-head self-attention stage learning;

and sending the obtained final patch representation to multi-head self-attention stage learning with a moving window to obtain a patch representation containing both local relation information and global relation information, thereby obtaining a third characteristic diagram after learning.

Further, the inputting the learned feature map into the Patch berging module or the PixelShuffle module to perform downsampling or upsampling operation, respectively, to obtain a reconstructed picture, includes:

downsampling the third feature map learned by the Swin Transformer Block module by 2 times to obtain a fourth feature map with the size of

The fourth feature map is subjected to Swin Transformer Block module learning and then downsampled for 2 times to obtain a fifth feature map with the size of

The fifth feature map is subjected to Swin Transformer Block module learning and then downsampled for 2 times to obtain a sixth feature map with the size of

The sixth feature map is subjected to Swin Transformer Block module learning and then is up-sampled for 2 times to obtain a seventh feature map with the size of

Adding the seventh feature map and the fifth feature map, learning by a Swin Transformer Block module, and then up-sampling by 2 times to obtain an eighth feature map with the size of

Adding the eighth feature map and the fourth feature map, learning by a Swin Transformer Block module, and then up-sampling by 2 times to obtain a ninth feature map with the size of

And adding the ninth feature map and the third feature map, and up-sampling by 4 times after learning by a Swin Transformer Block module to obtain a reconstructed picture with the size of H multiplied by W multiplied by 3.

Further, calculating the structural similarity SSIM of the industrial defect picture and the reconstructed picture to obtain a defect abnormal feature picture based on the structural similarity, wherein the formula is as follows:

wherein X is the original input picture, Y is the reconstructed picture, u _X 、u _Y Representing the mean, delta, of images X and Y _XY Representing the covariance of images X and Y, delta _X ² 、δ _Y ² Representing variances representing images X and Y, c ₁ 、c ₂ Is constant.

Further, calculating a gradient amplitude similarity deviation GMSD between the industrial defect picture and the reconstructed picture to obtain a defect abnormal feature map based on gradient amplitude similarity, wherein the formula is as follows:

wherein g (X) and g (Y) represent gradient amplitude graphs of X and Y, and the calculation formula is as follows:

/>

wherein h is _x And h _y Is a 3 x 3 Prewitt filter along the x and y dimensions and is a convolution operation.

Further, fusing the defect abnormal feature map based on the structural similarity with the defect abnormal feature map based on the gradient amplitude similarity to obtain a final defect detection effect map, including:

will M _S And M _g Removing M by Gaussian smoothing _S And M _g Abnormal value in (1) and then smoothing the smoothed M _S And M _g And adding and averaging to obtain a final detection result M.

In a second aspect, the present invention provides a defect detection device based on a swinTransformer structure in combination with SSIM and GMSD, comprising:

the characteristic learning and reconstructing unit is used for acquiring an industrial defect picture, inputting a pre-constructed Swin Transformer-based characteristic extraction network to perform characteristic learning, and reconstructing the non-defect area information of the learned characteristic picture to obtain a reconstructed picture;

the structural similarity calculating unit is used for calculating the structural similarity SSIM of the industrial defect picture and the reconstructed picture to obtain a defect abnormal characteristic diagram based on the structural similarity;

the gradient amplitude similarity deviation calculation unit is used for calculating the gradient amplitude similarity deviation GMSD between the industrial defect picture and the reconstructed picture to obtain a defect abnormal characteristic diagram based on gradient amplitude similarity;

and the feature fusion unit is used for fusing the defect abnormal feature map based on the structural similarity with the defect abnormal feature map based on the gradient amplitude similarity to obtain a final defect detection effect map.

In a third aspect, the present invention provides a computer readable storage medium having stored thereon a computer program which when executed by a processor performs the steps of any of the methods described in the preceding claims.

Compared with the prior art, the invention has the beneficial effects that:

the invention provides a defect detection method based on a SwinTransformer structure and combined with SSIM and GMSD, which is characterized in that a novel cloth defect detection method based on deep learning is established through a characteristic extraction network based on the SwinTransformer structure, the aim of intelligent detection of industrial materials is fulfilled, compared with the existing defect detection algorithm, the characteristic extraction network based on the SwinTransformer structure is used for extracting and reconstructing the image by utilizing the SwinTransformer structure, global information in the image can be fully utilized, the reconstruction of the image is facilitated, the difference value between an input image and a reconstructed image is calculated in a model by utilizing Structural Similarity (SSIM) and gradient amplitude similarity deviation (GMSD), the integrity and global gradient information of the image can be fully considered, and the final defect detection result is obtained by fusing the defect abnormal characteristic image calculated by the SSIM and the defect positioning accuracy of the algorithm.

Drawings

FIG. 1 is a flow chart of a defect detection method based on a Swin transducer structure combined with SSIM and GMSD according to an embodiment of the present invention;

FIG. 2 is a flowchart of a defect detection algorithm provided by an embodiment of the present invention;

FIG. 3 is a diagram of a Swin TransformerNet architecture provided by an embodiment of the invention;

FIG. 4 is a schematic illustration of an industrial defect picture provided by an embodiment of the present invention;

FIG. 5 is a schematic view of a real label of an industrial defect picture provided by an embodiment of the present invention;

FIG. 6 is a graph of the resulting defect detection effect provided by embodiments of the present invention.

Detailed Description

The invention is further described below with reference to the accompanying drawings. The following examples are only for more clearly illustrating the technical aspects of the present invention, and are not intended to limit the scope of the present invention.

Example 1

This embodiment describes a defect detection method based on a Swin transducer structure in combination with SSIM and GMSD, comprising:

The defect detection method based on the Swin transducer structure combined with the SSIM and the GMSD provided by the embodiment specifically comprises the following steps:

step 10: and inputting the input original image (X) into the model to learn the characteristics, so as to realize the reconstruction of the information of the non-defect area, and finally obtaining the network reconstruction picture (Y). Specific network model as in fig. 3, the model is explained in detail below:

/>

/>

step 20: calculating the Structural Similarity (SSIM) between the input original picture (X) and the reconstructed picture (Y) obtained in the step 10 to obtain a defect anomaly characteristic map (M) based on the structural similarity _S ) The SSIM calculation formula is as follows:

wherein X is the original input picture, Y is the reconstructed picture, u _X 、u _Y Representing the mean, delta, of images X and Y _XY Representing the covariance of images X and Y, delta _X ² 、δ _Y ² Representation ofRepresenting the variance of images X and Y, c ₁ 、c ₂ Is constant.

Step 30: calculating gradient amplitude similarity deviation (GMSD) between an input original picture and a reconstructed picture to obtain a defect abnormal feature map (M) based on gradient amplitude similarity _g ) The GMS calculation formula is as follows:

Step 40: m obtained by step 20 and step 30 _S And M _g And fusing to obtain a final defect detection effect diagram. The specific operation is as follows: will M _S And M _g Removing M by Gaussian smoothing _S And M _g Is an outlier in (a). Then M after smoothing _S And M _g And adding and averaging to obtain a final detection result M.

Example 2

The embodiment provides a defect detection device based on a SwinTransformer structure combined with SSIM and GMSD, which comprises:

Example 3

The present embodiment provides a computer readable storage medium having stored thereon a computer program which when executed by a processor performs the steps of any of the methods described in the preceding claims.

The foregoing is merely a preferred embodiment of the present invention, and it should be noted that modifications and variations could be made by those skilled in the art without departing from the technical principles of the present invention, and such modifications and variations should also be regarded as being within the scope of the invention.

Claims

1. A defect detection method based on a Swin transducer structure combined with SSIM and GMSD, comprising:

fusing the defect abnormal feature map based on the structural similarity with the defect abnormal feature map based on the gradient amplitude similarity to obtain a final defect detection effect map;

the method for obtaining the industrial defect picture, inputting a pre-constructed Swin transform-based feature extraction network for feature learning, reconstructing the learned feature with no defect region information, and obtaining a reconstructed picture, comprises the following steps:

respectively inputting the learned feature images into a Patch measuring module and a PixelSheffle module to perform downsampling and upsampling operations to obtain a reconstructed image, wherein the reconstructed image only comprises a normal image area, and the information of the defect area is not reconstructed;

the input industrial defect picture enters a Patch part module to convert space information into channel information, and the method comprises the following steps:

Wherein->

Representing the number of the patches, C representing the information dimension of the patches;

inputting the converted industrial defect picture into a Swin Transformer Block module for feature learning to obtain a learned feature map, wherein the feature map comprises the following components:

sending the obtained final patch representation to multi-head self-attention stage learning with a moving window to obtain a patch representation containing local relation information and global relation information, thereby obtaining a third characteristic diagram after learning;

the step of inputting the learned feature map into a Patch Merging module and a PixelSheffe module for downsampling and upsampling respectively to obtain a reconstructed picture comprises the following steps:

Adding the ninth feature map and the third feature map, and performing up-sampling for 4 times after learning by a Swin Transformer Block module to obtain a reconstructed picture with the size of H multiplied by W multiplied by 3;

calculating the structural similarity SSIM of the industrial defect picture and the reconstructed picture to obtain a defect abnormal feature map M based on the structural similarity _S The formula is as follows:

wherein X is the original input picture, Y is the reconstructed picture, u _X 、u _Y Representing the mean, delta, of images X and Y _XY Representing the covariance of images X and Y, delta _X ² 、δ _Y ² Representing variances representing images X and Y, c ₁ 、c ₂ Is a constant;

calculating gradient amplitude similarity deviation GMSD between the industrial defect picture and the reconstructed picture to obtain a defect abnormal feature map M based on gradient amplitude similarity _g Formulas such asThe following steps:

2. A method of defect detection based on a Swin transducer structure in combination with SSIM and GMSD as claimed in claim 1, wherein: fusing the defect abnormal feature map based on the structural similarity with the defect abnormal feature map based on the gradient amplitude similarity to obtain a final defect detection effect map, wherein the method comprises the following steps of:

3. A defect detection device based on a Swin transducer structure combined with SSIM and GMSD, comprising:

the feature fusion unit is used for fusing the defect abnormal feature map based on the structural similarity with the defect abnormal feature map based on the gradient amplitude similarity to obtain a final defect detection effect map;

wherein: the method for obtaining the industrial defect picture, inputting a pre-constructed Swin transform-based feature extraction network for feature learning, reconstructing the learned feature with no defect region information, and obtaining a reconstructed picture, comprises the following steps:

the input industrial defect picture is subjected to 1X 1 convolution operation to obtain a first characteristic picture with the size of H X W X C, and the obtained first characteristic picture is subjected to 4 times lower than the first characteristic pictureSampling to obtain a second characteristic diagram with the size of

Wherein->

Subjecting the fourth feature map to Swin transformationDownsampling 2 times after r Block module learning to obtain a fifth feature map with the size of

calculating gradient amplitude similarity deviation GMSD between the industrial defect picture and the reconstructed picture to obtain a defect abnormal feature map M based on gradient amplitude similarity _g The formula is as follows:

4. A computer-readable storage medium having stored thereon a computer program, characterized by: the program being operative to perform the steps of the method as claimed in any one of claims 1 to 2 when executed by a processor.