CN111242955B - Road surface crack image segmentation method based on full convolution neural network - Google Patents

Road surface crack image segmentation method based on full convolution neural network Download PDF

Info

Publication number
CN111242955B
CN111242955B CN202010070397.7A CN202010070397A CN111242955B CN 111242955 B CN111242955 B CN 111242955B CN 202010070397 A CN202010070397 A CN 202010070397A CN 111242955 B CN111242955 B CN 111242955B
Authority
CN
China
Prior art keywords
neural network
convolution neural
full convolution
crack
image segmentation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010070397.7A
Other languages
Chinese (zh)
Other versions
CN111242955A (en
Inventor
苏超
董义佳
王文君
袁荣耀
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hohai University HHU
Original Assignee
Hohai University HHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hohai University HHU filed Critical Hohai University HHU
Priority to CN202010070397.7A priority Critical patent/CN111242955B/en
Publication of CN111242955A publication Critical patent/CN111242955A/en
Application granted granted Critical
Publication of CN111242955B publication Critical patent/CN111242955B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10004Still image; Photographic image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a road surface crack image segmentation method based on a full convolution neural network, which is characterized by comprising the following steps of: the method comprises the following steps: photographing the pavement cracks by a camera, constructing a picture data set, and dividing images in the data set into two types of cracking and cracking-free; dividing and labeling the image with the crack in the data set by using Lableme software; dividing the constructed data set into a training set and a testing set; constructing a full convolution neural network for image segmentation; training the full convolution neural network by using the divided training set, and optimizing relevant parameters until a global optimal solution is obtained; and submitting the crack images in the test set to a full convolution neural network, and outputting an image segmentation result.

Description

Road surface crack image segmentation method based on full convolution neural network
Technical Field
The invention belongs to the field of road quality and safety monitoring, and relates to a road surface crack image segmentation method based on a full convolution neural network.
Background
The detection of the pavement cracks is an important traffic maintenance work for ensuring the driving safety, is a crucial step for road management, and aims to obtain pavement maintenance state information. Cracks are the most common type of disease of the road surface. Because the automatic crack detection system has the advantages of high safety, low cost, high efficiency, objectivity and the like, the research of the automatic crack detection system is widely focused.
Image-based crack detection algorithms have been widely discussed and applied over the past few decades. In early studies, methods were based on more traditional digital combination or improved image processing techniques such as thresholding, mathematical morphology, edge detection, and the like. These methods are generally based on photometric and geometric assumptions about the nature of the fracture image. However, these methods are very sensitive to noise because they are performed on a single pixel. This remains a challenging task due to crack non-uniformity and background complexity, such as low contrast with surrounding pavement, and shadows of similar intensity that may occur.
In recent years, deep Convolutional Neural Networks (CNNs) have found widespread use in computer image processing.
Deep neural networks have an impressive manifestation in many computer vision tasks, demonstrating the effectiveness of learned depth features. And constructing a deep Full Convolution Network (FCN) model, performing semantic segmentation on the pavement crack image, realizing automatic detection of pavement cracks, and having important significance for pavement condition monitoring.
Disclosure of Invention
The invention aims to: aiming at the defects of the prior art, the invention provides a road surface crack image segmentation method based on a full convolution neural network, so as to realize crack detection, further improve crack detection efficiency and reduce detection cost.
The technical scheme is as follows: the invention provides a road surface crack image segmentation method based on a full convolution neural network, which comprises the following steps:
s1, photographing a pavement crack by a camera, constructing a picture data set, and dividing an image in the data set into two types of crack-free and crack-free;
s2, carrying out segmentation data labeling on the image with the crack in the data set by using Lableme software;
s3, dividing the constructed data set into a training set (3/5 of the data set) and a test set (2/5 of the data set);
s4, constructing a full convolution neural network for image segmentation;
s5, training the full convolution neural network by utilizing the divided training set, and optimizing relevant parameters until a global optimal solution is obtained;
s6, submitting the crack images in the test set to a full convolution neural network, and outputting an image segmentation result.
And S2, labeling the image with the crack, wherein the labeling is pixel-level labeling.
The built full convolution neural network specifically comprises:
the full convolutional neural network comprises 1 convolutional layer, 16 extended convolutional layers, 1 pyramid sampling layer, 2 decoder layers and 1 semantic output layer.
The step S4 is to make the structure and parameters of the full convolution neural network model specifically as follows:
Figure BDA0002377147810000021
the structure of the full convolution neural network expansion module is specifically as follows:
a. when the span is 1: the input picture is overlapped with the input through the results of point convolution 1x1 (the activation function is Relu 6), depth convolution 3x3 (the activation function is Relu 6) and point convolution 1x1 (the activation function is linear), and then the output is obtained.
b. Span is 2: the input picture is directly output after point convolution 1x1 (the activation function is Relu 6), depth convolution 3x3 (the activation function is Relu 6) and point convolution 1x1 (the activation function is linear).
The pyramid sampling module of the full convolution neural network has 4 paths, which are respectively used for sampling the average value with the side length of 10, and averaging the whole characteristic diagram; three holes with hole ratios of 1,2 and 4 are convolved, and different receptive fields are used for extracting information from the feature map; the output results of the 4 paths are combined in a stacked manner after being scaled to the same size.
The full convolution neural network is characterized in that a decoder 0 is connected with an extended convolution 3#, 16 feature graphs are input, and 32 feature graphs are output; the decoder 1 is connected to the convolutional layer, inputs 8 feature maps, and outputs 32 feature maps.
The full convolution neural network parameter optimization uses a movement optimizer, the dynamic value of the optimizer is 0.9, the initial learning rate is 0.0001, a decreasing learning rate updating strategy is adopted, and each time of 2000 steps of attenuation is 0.9 times of the previous learning rate; and during training, calculating the gradient of the error pair network parameters by taking two pictures, and further updating the network parameters.
The objective function of the full convolution neural network for image segmentation is the average number of cross entropy of a given label and a prediction result prediction of the neural network at each pixel point:
Figure BDA0002377147810000031
/>
the beneficial effects are that: compared with the prior art, the road surface crack image segmentation method based on the full convolution neural network has the following beneficial effects:
1. the recognition efficiency is high: the invention constructs a picture data set and divides the image in the data set into two categories of cracking and cracking-free; dividing and labeling the image with the crack in the data set by using Lableme software; dividing the constructed data set into a training set and a testing set; constructing a full convolution neural network for image segmentation; and training the full convolution neural network by utilizing the divided training set, and optimizing relevant parameters until a global optimal solution is obtained, so that the recognition efficiency of the picture is effectively improved.
2. The crack detection model is applicable to detection tasks of road surface images of the same material after training is completed, and has wide applicability.
3. According to the crack identification method, the detection vehicle with the camera is used for driving on a road to obtain an original image, and then the crack condition is evaluated through neural network processing, so that the requirements of manpower, material resources and financial resources are reduced.
Drawings
FIG. 1 is an overall flow chart designed based on the method of the present invention;
FIG. 2 is a block diagram of a full convolutional neural network according to the present invention;
FIG. 3 shows the image segmentation result of the road surface crack by the method of the invention.
Detailed Description
The present invention is further illustrated in the accompanying drawings and detailed description which are to be understood as being merely illustrative of the invention and not limiting of its scope, and various modifications of the invention, which are equivalent to those skilled in the art upon reading the invention, will fall within the scope of the invention as defined in the appended claims.
In the present embodiment, an image of a road surface crack image is subjected to image segmentation to detect cracks. Firstly, carrying out preparation work of a data set, photographing a campus road by using a camera, acquiring an original crack image, and constructing the data set. Then, the image acquired by the camera is cut on the computer, the size of the cut image is 320X320, and the data set comprises 6000 pictures after the operation of enhancing the data set. The number of images with and without cracks is 3000, and the number of images for model training and testing is 1800 and 1200 respectively. And then, the Lableme software is used for marking the split data of the cracked image in the data set. The model construction and training work is completed under the Tensorflow framework, the batch size is set to 2, the movement optimizer is used for optimizing the cost function, the initial learning rate is set to 0.0001, the descending learning rate updating strategy is adopted, and each time the learning rate is attenuated to be 0.9 times of the previous learning rate after 2000 steps. During training, two pictures are taken to calculate the gradient of the error to the network parameters, so that the network parameters are updated. The neural network inputs pictures with the image size unified to 320×320. The fracture image segmentation results are shown in fig. 3.
The experimental platform environment is configured as follows:
intel (R) Core (TM) [email protected], NVIDIA GTX1080Ti video card, 16G DDR4 memory (main frequency 2400 MHz), 11G video memory (frequency 1.4 GHz). The whole training process is completed on the GPU in an acceleration way.
Referring to fig. 1, the method for dividing the road surface crack image according to the present invention comprises the following specific steps:
s1, photographing pavement cracks by using a camera, constructing a data set, and dividing images in the data set into two types of cracks and crack-free images;
s2, carrying out segmentation data labeling on the image with the crack in the data set by using Lableme software;
s3, dividing the constructed data set into a training set (3/5 of the data set) and a test set (2/5 of the data set);
s4, constructing a full convolution neural network for image segmentation;
the full convolution neural network consists of 1 convolution layer, 16 expansion convolution layers, 1 pyramid sampling layer, 2 decoder layers and 1 semantic output layer.
The step S4 is a full-product neural network model structure and parameters specifically as follows:
Figure BDA0002377147810000041
Figure BDA0002377147810000051
s5, training the full convolution neural network by using the divided training set, and optimizing relevant parameters until a global optimal solution is obtained;
s6, submitting the crack images in the test set to a full convolution neural network, and outputting an image segmentation result;
the structure of the full convolution neural network expansion module is specifically as follows:
a. when the span is 1: the input picture is overlapped with the input through the results of point convolution 1x1 (the activation function is Relu 6), depth convolution 3x3 (the activation function is Relu 6) and point convolution 1x1 (the activation function is linear), and then the output is obtained.
b. Span is 2: the input picture is directly output after point convolution 1x1 (the activation function is Relu 6), depth convolution 3x3 (the activation function is Relu 6) and point convolution 1x1 (the activation function is linear).
The pyramid sampling module of the full convolution neural network has 4 paths, which are respectively used for sampling the average value with the side length of 10, and averaging the whole characteristic diagram; three holes with hole ratios of 1,2 and 4 are convolved, and different receptive fields are used to extract information from the feature map. The output results of the 4 paths are combined in a stacked manner after being scaled to the same size.
The full convolution neural network is characterized in that a decoder 0 is connected with an extended convolution 3#, 16 feature graphs are input, and 32 feature graphs are output; the decoder 1 is connected to the convolutional layer, inputs 8 feature maps, and outputs 32 feature maps.
The full convolution neural network parameter optimization uses a movement optimizer, the dynamic value of the optimizer is 0.9, the initial learning rate is 0.0001, a decreasing learning rate updating strategy is adopted, and each time of 2000 steps of attenuation is 0.9 times of the previous learning rate. During training, two pictures are taken to calculate the gradient of the error to the network parameters, so that the network parameters are updated.
The objective function of the full convolution neural network for image segmentation is the average number of cross entropy of a given label and a prediction result prediction of the neural network at each pixel point:
Figure BDA0002377147810000061
it should be noted that modifications and adaptations to the present invention may occur to one skilled in the art without departing from the principles of the present invention and are intended to be comprehended within the scope of the present invention. The components not explicitly described in this embodiment can be implemented by using the prior art.

Claims (4)

1. A road surface crack image segmentation method based on a full convolution neural network is characterized by comprising the following steps of: the method comprises the following steps:
s1, photographing a pavement crack by a camera, constructing a picture data set, and dividing an image in the data set into two types of crack-free and crack-free;
s2, carrying out segmentation data labeling on the image with the crack in the data set by using Lableme software;
s2, labeling the image with the crack, wherein the labeling is pixel level labeling;
s3, dividing the constructed data set into a training set and a testing set;
s4, constructing a full convolution neural network for image segmentation;
the built full convolution neural network specifically comprises:
the full convolution neural network comprises 1 convolution layer, 16 expansion convolution layers, 1 pyramid sampling layer, 2 decoder layers and 1 semantic output layer;
the step S4 is to make the structure and parameters of the full convolution neural network model specifically as follows:
Figure QLYQS_1
the structure of the full convolution neural network expansion module is specifically as follows:
a. when the span is 1: the input picture is overlapped with the input through the results of point convolution 1x1 (the activation function is Relu 6), depth convolution 3x3 (the activation function is Relu 6) and point convolution 1x1 (the activation function is linear), and then output is obtained;
b. span is 2: the input picture is directly output after point convolution 1x1 (the activation function is Relu 6), depth convolution 3x3 (the activation function is Relu 6) and point convolution 1x1 (the activation function is linear);
the pyramid sampling module of the full convolution neural network has 4 paths, which are respectively used for sampling the average value with the side length of 10, and averaging the whole characteristic diagram; three holes with hole ratios of 1,2 and 4 are convolved, and different receptive fields are used for extracting information from the feature map; the output results of the 4 paths are combined in a stacking mode after being scaled to the same size;
s5, training the full convolution neural network by utilizing the divided training set, and optimizing relevant parameters until a global optimal solution is obtained;
s6, submitting the crack images in the test set to a full convolution neural network, and outputting an image segmentation result.
2. The road surface crack image segmentation method based on the full convolution neural network as set forth in claim 1, wherein the method comprises the following steps: the full convolution neural network is characterized in that a decoder 0 is connected with an extended convolution 3#, 16 feature graphs are input, and 32 feature graphs are output; the decoder 1 is connected to the convolutional layer, inputs 8 feature maps, and outputs 32 feature maps.
3. The road surface crack image segmentation method based on the full convolution neural network as set forth in claim 1, wherein the method comprises the following steps: the full convolution neural network parameter optimization uses a movement optimizer, the dynamic value of the optimizer is 0.9, the initial learning rate is 0.0001, a decreasing learning rate updating strategy is adopted, and each time of 2000 steps of attenuation is 0.9 times of the previous learning rate; and during training, calculating the gradient of the error pair network parameters by taking two pictures, and further updating the network parameters.
4. The road surface crack image segmentation method based on the full convolution neural network as set forth in claim 1, wherein the method comprises the following steps: the objective function of the full convolution neural network for image segmentation is the average number of cross entropy of a given label and a prediction result prediction of the neural network at each pixel point:
Figure QLYQS_2
/>
CN202010070397.7A 2020-01-21 2020-01-21 Road surface crack image segmentation method based on full convolution neural network Active CN111242955B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010070397.7A CN111242955B (en) 2020-01-21 2020-01-21 Road surface crack image segmentation method based on full convolution neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010070397.7A CN111242955B (en) 2020-01-21 2020-01-21 Road surface crack image segmentation method based on full convolution neural network

Publications (2)

Publication Number Publication Date
CN111242955A CN111242955A (en) 2020-06-05
CN111242955B true CN111242955B (en) 2023-04-21

Family

ID=70872858

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010070397.7A Active CN111242955B (en) 2020-01-21 2020-01-21 Road surface crack image segmentation method based on full convolution neural network

Country Status (1)

Country Link
CN (1) CN111242955B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111539494A (en) * 2020-07-08 2020-08-14 浙江浙能天然气运行有限公司 Hydraulic protection damage detection method based on U-Net and SVM
CN112233105A (en) * 2020-10-27 2021-01-15 江苏科博空间信息科技有限公司 Road crack detection method based on improved FCN
CN113486721B (en) * 2021-06-09 2023-08-29 北京科技大学 Pavement crack detection method based on rolling optimization crack classification recognition model
CN114331869B (en) * 2021-12-01 2022-08-19 西南科技大学 Dam face crack semantic segmentation method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107766794A (en) * 2017-09-22 2018-03-06 天津大学 The image, semantic dividing method that a kind of Fusion Features coefficient can learn
CN109360206A (en) * 2018-09-08 2019-02-19 华中农业大学 Crop field spike of rice dividing method based on deep learning
CN110502972A (en) * 2019-07-05 2019-11-26 广东工业大学 A kind of pavement crack segmentation and recognition methods based on deep learning
CN110533629A (en) * 2019-07-10 2019-12-03 湖南交工智能技术有限公司 A kind of detection method and detection device of Bridge Crack

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110838124B (en) * 2017-09-12 2021-06-18 深圳科亚医疗科技有限公司 Method, system, and medium for segmenting images of objects having sparse distribution

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107766794A (en) * 2017-09-22 2018-03-06 天津大学 The image, semantic dividing method that a kind of Fusion Features coefficient can learn
CN109360206A (en) * 2018-09-08 2019-02-19 华中农业大学 Crop field spike of rice dividing method based on deep learning
CN110502972A (en) * 2019-07-05 2019-11-26 广东工业大学 A kind of pavement crack segmentation and recognition methods based on deep learning
CN110533629A (en) * 2019-07-10 2019-12-03 湖南交工智能技术有限公司 A kind of detection method and detection device of Bridge Crack

Also Published As

Publication number Publication date
CN111242955A (en) 2020-06-05

Similar Documents

Publication Publication Date Title
CN111242955B (en) Road surface crack image segmentation method based on full convolution neural network
CN107507173B (en) No-reference definition evaluation method and system for full-slice image
CN109583345B (en) Road recognition method, device, computer device and computer readable storage medium
CN111767882A (en) Multi-mode pedestrian detection method based on improved YOLO model
CN112884064A (en) Target detection and identification method based on neural network
CN109711407B (en) License plate recognition method and related device
CN108596338A (en) A kind of acquisition methods and its system of neural metwork training collection
WO2021238420A1 (en) Image defogging method, terminal, and computer storage medium
CN108109133B (en) Silkworm egg automatic counting method based on digital image processing technology
CN112329881B (en) License plate recognition model training method, license plate recognition method and device
CN113469997B (en) Method, device, equipment and medium for detecting plane glass
CN115272204A (en) Bearing surface scratch detection method based on machine vision
CN110599453A (en) Panel defect detection method and device based on image fusion and equipment terminal
CN115019279A (en) Context feature fusion method based on MobileNet lightweight network
CN115239644A (en) Concrete defect identification method and device, computer equipment and storage medium
CN116486246A (en) Intelligent recognition method for bridge underwater image diseases based on convolutional neural network
CN113807237B (en) Training of in vivo detection model, in vivo detection method, computer device, and medium
Lashkov et al. Edge-computing-facilitated nighttime vehicle detection investigations with CLAHE-enhanced images
Chen et al. Image quality assessment guided deep neural networks training
CN112686247A (en) Identification card number detection method and device, readable storage medium and terminal
CN116468625A (en) Single image defogging method and system based on pyramid efficient channel attention mechanism
CN110175622A (en) The vehicle part recognition methods of convolutional neural networks based on symbiosis and system
CN116453086A (en) Method and device for identifying traffic sign and electronic equipment
CN112801102B (en) Method for detecting pavement blocky diseases
CN112465817B (en) Pavement crack detection method based on directional filter

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant