CN116052006B - Building edge optimization method based on multitask learning and dual lottery hypothesis - Google Patents

Building edge optimization method based on multitask learning and dual lottery hypothesis Download PDF

Info

Publication number
CN116052006B
CN116052006B CN202310314668.2A CN202310314668A CN116052006B CN 116052006 B CN116052006 B CN 116052006B CN 202310314668 A CN202310314668 A CN 202310314668A CN 116052006 B CN116052006 B CN 116052006B
Authority
CN
China
Prior art keywords
building
task
semantic segmentation
edge detection
remote sensing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202310314668.2A
Other languages
Chinese (zh)
Other versions
CN116052006A (en
Inventor
邢华桥
项俊武
温奇
孙雨生
王海航
侯东阳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Jianzhu University
Original Assignee
Shandong Jianzhu University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Jianzhu University filed Critical Shandong Jianzhu University
Priority to CN202310314668.2A priority Critical patent/CN116052006B/en
Publication of CN116052006A publication Critical patent/CN116052006A/en
Application granted granted Critical
Publication of CN116052006B publication Critical patent/CN116052006B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Landscapes

  • Image Analysis (AREA)

Abstract

The invention provides a building edge optimization method based on multitask learning and dual lottery hypothesis, belonging to the technical field of remote sensing science. The method is realized by the following technical scheme: building semantic segmentation and edge detection data set of high-resolution remote sensing image
Figure ZY_1
The method comprises the steps of carrying out a first treatment on the surface of the Constructing a multi-task model of a semantic segmentation task and an edge detection task of a building by taking the CNN segmentation model as a framework; setting the sparseness of a multi-task model and the common parameter ratio of two task parameters, randomly generating a remote sensing image building semantic segmentation and edge detection task parameter mask matrix, and generating a sub-network of the two tasks; adopting a combination of norm regularization and a parameter mask matrix to respectively improve the loss functions of the semantic segmentation task and the edge detection task of the building; training the model to obtain a trained remote sensing image building multitasking model; reconstructing and extracting a semantic segmentation task model in the multi-task.

Description

Building edge optimization method based on multitask learning and dual lottery hypothesis
Technical Field
The invention relates to a building edge optimization method based on multitask learning and dual lottery hypothesis, belonging to the technical field of remote sensing science.
Background
Buildings are the most prominent man-made structures and geographical features of urban areas. Accurate extraction of building information from high resolution remote sensing images plays a vital role in urban planning, change monitoring, environmental monitoring, real estate management, population estimation and disaster risk assessment. Because a great deal of manpower and material resources are consumed for manually and visually interpreting building information in the high-resolution remote sensing image and the time cost is high, the deep learning technology is used for realizing automatic extraction of the building, and the method is a more economical and efficient extraction mode. However, due to the fact that the existing mainstream remote sensing image semantic segmentation method based on the convolutional neural network loses a large amount of building edge information when downsampling is carried out, semantic segmentation results are poor in edge performance, and automatic vectorization of high-resolution remote sensing image buildings is difficult to achieve on the basis. The remote sensing image building edge detection task has rich edge information, so that the building edge information of the edge detection task is effectively utilized to optimize the semantic segmentation edges of the high-resolution remote sensing image building, and the automatic vectorization precision, quality and application value are improved.
The existing remote sensing image building semantic segmentation edge optimization method based on CNN can be mainly divided into an edge optimization method based on traditional modeling and an edge optimization method based on CNN edge feature enhancement, but the method does not consider rich edge information contained in remote sensing image building edge detection tasks. Based on the conventional multitask CNN edge optimization method, negative influences caused by mutual drag among different tasks are not considered, effective information among different tasks is difficult to transfer effectively, and particularly, on the premise that the negative influences caused by the mutual drag among a plurality of tasks are effectively weakened and the existing model is compatible, an optimal remote sensing image building semantic segmentation result for building edge positions accurately perceived by the edge detection tasks does not have a better solution at present by fully utilizing information transfer during multitask training and constructing a general multitask frame, aiming at the fact that the remote sensing image building semantic segmentation and edge detection tasks lack an effective multitask information sharing method.
Disclosure of Invention
The invention aims to provide a building edge optimization method based on multi-task learning and dual lottery hypothesis, which reduces the negative influence caused by difficulty in avoiding mutual restriction among different tasks by adopting remote sensing image building multi-task CNN, constructs a general multi-task information transmission frame and effectively transmits edge information of an edge detection task to a semantic segmentation task in the multi-task training process.
The invention aims to achieve the aim, and the aim is achieved by the following technical scheme:
s1, building semantic segmentation and edge detection data set of high-resolution remote sensing image building
Figure SMS_1
S2, constructing a multi-task model of a semantic segmentation task and an edge detection task of the building by taking the CNN segmentation model as a framework, wherein the semantic segmentation task in the model is a main task, and the edge detection task is an auxiliary task.
S3, setting the sparsity of the multi-task model and the common parameter ratio of two task parameters, and randomly generating a remote sensing image building semantic segmentation and edge detection task parameter mask matrix
Figure SMS_2
A sub-network of two tasks is generated.
S4, adopting combination
Figure SMS_3
The norms regularization and the parameter mask matrix improve the loss functions of the building semantic segmentation task and the edge detection task, respectively.
S5, adopting a cosine unequal proportion alternating training strategy, periodically enhancing and weakening transmission of the building edge detection task to semantic segmentation task edge information, adopting a back propagation and random gradient descent algorithm, and utilizing the data set constructed in the step S1
Figure SMS_4
Training the constructed multi-task model, and combining the limiting condition of the step S3 and the method of the step S4 at the same time of trainingMask matrix randomly generated for semantic segmentation task of remote sensing image building>
Figure SMS_5
And performing iterative transformation for multiple times, and obtaining the trained remote sensing image building multitasking model after reaching the preset precision.
S6, dividing parameter mask matrix according to the last converted building semantics
Figure SMS_6
The parameter weight value is used for reconstructing and extracting a semantic segmentation task model in the multi-task; and obtaining a semantic segmentation model which is subjected to edge detection task auxiliary training, inputting a remote sensing image of an actual building into the trained semantic segmentation model, and extracting a reconstructed semantic segmentation model to obtain a semantic segmentation result of the building with more optimized edges.
Preferably, the high-resolution remote sensing image building semantic segmentation and edge detection dataset
Figure SMS_7
The construction steps of (a) are as follows:
s11, converting the original building vector data file into a binary building semantic segmentation label according to the boundary range vector file and taking the size of the corresponding remote sensing image pel as a reference.
S12, cutting the remote sensing image into corresponding ranges according to the boundary range vector file.
And S13, performing morphological corrosion treatment on the binary building semantic segmentation label generated in the step S11 to generate a corresponding building edge detection label.
S14, calculating whether each remote sensing image needs to be expanded or not according to the set size and the overlapping rate, if the remote sensing images need to be expanded, calculating the pixels of the expanded images, and expanding the original remote sensing images, the corresponding building semantic segmentation labels and the building edge detection labels to the specified size.
S15, sliding clipping is carried out on the remote sensing image, the building semantic segmentation tag image and the building edge detection tag image at the same time according to the set size and the set overlapping rate.
S16, after cutting is completed, a high-resolution remote sensing image building semantic segmentation and edge detection data set is obtained
Figure SMS_8
Taking an original remote sensing image as a reference according to 8: the scale of 2 divides the building dataset into: training and test sets, 80% of the data were used for training and 20% of the data were used for testing the performance after training.
Preferably, the multi-task model of the building semantic segmentation task and the edge detection task is input into a high-resolution remote sensing image, and output into a building semantic segmentation task building segmentation graph and a building edge detection task building edge extraction probability graph; the building semantic segmentation task and the edge detection task initialization stages share all convolution modules of the encoder section.
Preferably, the mask matrix
Figure SMS_9
The following conditions are satisfied:
Figure SMS_10
wherein:
Figure SMS_11
mask matrix randomly generated for remote sensing image building semantic segmentation task, < >>
Figure SMS_12
Mask matrix randomly generated for remote sensing image building semantic segmentation task, < >>
Figure SMS_13
Is->
Figure SMS_14
And->
Figure SMS_15
,/>
Figure SMS_16
For a set sparsity ratio->
Figure SMS_17
The parameter ratio is shared for the two task parameters set.
The multitasking model is:
Figure SMS_18
for each task its subnetwork is represented as:
Figure SMS_19
wherein:
Figure SMS_20
for building partition map, < >>
Figure SMS_21
Extracting probability map for building edge, +.>
Figure SMS_22
For multitasking model, ++>
Figure SMS_23
For high-resolution remote sensing image building semantic segmentation and edge detection data set,/for the remote sensing image building semantic segmentation and edge detection data set>
Figure SMS_24
Is a multitasking model parameter.
Preferably, the loss function in S4 is:
Figure SMS_25
in the method, in the process of the invention,
Figure SMS_26
loss function for semantic segmentation tasks of a building, < ->
Figure SMS_27
Loss function for building edge detection task, < ->
Figure SMS_28
Representing a certain loss function conventionally used, +.>
Figure SMS_29
Parameter representing a value of 1 in the parameter mask matrix, < >>
Figure SMS_30
Parameter representing 0 in the parameter mask matrix, < ->
Figure SMS_31
For weighing the parameters. />
Figure SMS_32
Starting from the set minimum value, each iteration gradually increases its value in the form of an arithmetic progression until it increases to the set maximum value.
Preferably, the concrete steps of the cosine unequal proportion alternating training strategy for further weakening the transmission of negative influence of the drag of the semantic segmentation task and the edge detection task of the building are as follows:
in each cycle, the ratio of the number of alternating iterations of the building semantic segmentation task to edge detection
Figure SMS_33
The following conditions are satisfied:
Figure SMS_34
in the method, in the process of the invention,
Figure SMS_35
for the segmentation period +.>
Figure SMS_36
Denoted as +.>
Figure SMS_37
Iterative cycle->
Figure SMS_38
Is the maximum number of iteration cycles.
Figure SMS_39
From the accuracy evaluation index in training process +.>
Figure SMS_40
Score determination and satisfies the formula:
Figure SMS_41
in the method, in the process of the invention,
Figure SMS_42
representing semantic segmentation task->
Figure SMS_43
Score, ->
Figure SMS_44
Representing edge detection task->
Figure SMS_45
Score, namely: building edge detection task->
Figure SMS_46
Score and building edge detection task->
Figure SMS_47
The fraction is simultaneously greater than the minimum period of 0.6.
Building semantic segmentation and edge detection dataset using high resolution remote sensing images by using back propagation and random gradient descent algorithms
Figure SMS_48
Performing multitask training, wherein mask matrix generated randomly for remote sensing image building semantic segmentation task in training process>
Figure SMS_49
Performing iterative transformation for a plurality of times to enable the parameter mask matrix to meet the following formula:
Figure SMS_50
in the method, in the process of the invention,
Figure SMS_51
parameter mask matrix representing a building semantic segmentation network initializing constraint generation +.>
Figure SMS_52
Indicate->
Figure SMS_53
Iterative transformation of->
Figure SMS_54
First->
Figure SMS_55
A constrained generation of a parameter mask matrix for a building semantic segmentation network for a number of iterations, wherein
Figure SMS_56
Is a multiple of M; after each iterative transformation, a new generated parameter mask matrix is used->
Figure SMS_57
Reconstructing the semantic segmentation sub-network of the building.
Preferably, the said
Figure SMS_58
Preferably, the parameter mask matrix is divided according to the semantic of the building after the last conversion
Figure SMS_59
And reconstructing and extracting a semantic segmentation task model in the multi-task according to the parameter weight value, wherein the method comprises the following specific steps of: last iteration transformation according to semantic segmentation networkIs>
Figure SMS_60
Will->
Figure SMS_61
All parameters of (2) are removed and reserved
Figure SMS_62
Reconstructing the semantic segmentation network.
The invention has the advantages that: the invention establishes the connection between two tasks through the constrained parameter mask matrix of the two subtasks of the semantic segmentation and the edge detection of the building by randomly generating the remote sensing image with constraint
Figure SMS_63
The regularization and mask matrix improved loss function is used for realizing the transformation from the randomly selected sub-network to the optimal sub-network, and the overall parameter information of the model can be fully utilized, so that the method is more flexible and can effectively weaken the negative influence caused by mutual drag among different tasks compared with the traditional multitasking sharing mode, and has stronger feasibility. A cosine unequal proportion alternating training strategy is adopted, and the transmission of the building edge detection task to semantic segmentation task edge information is periodically strengthened and weakened. The whole process can be effectively applied to various existing deep learning models. The method has the advantages that the result of the complete building can be better segmented in the aspect of semantic segmentation research of the remote sensing image building, the extracted result is more regular and is close to the real segmentation result, meanwhile, the phenomena of over-detection and omission can be better avoided, and the precision and the application value of the automatic extraction result of the building are further improved.
Drawings
The accompanying drawings are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate the invention and together with the embodiments of the invention, serve to explain the invention.
Fig. 1 is a diagram of a multi-task training framework in a building edge optimization method based on multi-task learning and dual lottery hypotheses in the practice of the present invention.
Fig. 2 is a flow chart of building semantic segmentation and edge detection dataset construction in a building edge optimization method based on a multi-task learning and dual lottery hypothesis in an example of the invention.
Fig. 3 is a graph comparing the experimental results of the building edge optimization method based on the multi-task learning and dual lottery hypothesis with the corresponding single-task model in the example of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The implementation of the invention provides a building edge optimization method based on multitask learning and dual lottery hypothesis.
Referring to fig. 1, fig. 1 is a diagram of a multi-task training frame in a building edge optimization method based on multi-task learning and dual lottery hypothesis in the implementation of the present invention, which specifically includes the following steps:
s1, building semantic segmentation and edge detection data set of high-resolution remote sensing image building
Figure SMS_64
Referring to fig. 2, a high-resolution remote sensing image building semantic segmentation and edge detection dataset is constructed, which specifically includes the following steps:
s11, converting the original building vector data file into a binary building semantic segmentation label according to the boundary range vector file and taking the size of the corresponding remote sensing image pel as a reference.
S12, cutting the remote sensing image into corresponding ranges according to the boundary range vector file.
And S13, performing morphological corrosion treatment on the binary building semantic segmentation label generated in the step S11 to generate a corresponding building edge detection label.
S14, calculating whether each remote sensing image needs to be expanded or not according to the set size and the overlapping rate, if the remote sensing images need to be expanded, calculating the pixels of the expanded images, and expanding the original remote sensing images, the corresponding building semantic segmentation labels and the building edge detection labels to the specified size.
And S15, performing sliding clipping on the remote sensing image, the building semantic segmentation label image and the building edge detection label image at the same time according to the size of 256 x 256 and the overlapping rate of 0.5.
S16, after the cutting work is completed, obtaining a high-resolution remote sensing image building semantic segmentation and edge detection data set
Figure SMS_65
Taking an original remote sensing image as a reference according to 8: the scale of 2 divides the building dataset into: training and test sets, 80% of the data were used for training and 20% of the data were used for testing the performance after training.
S2, utilizing the high-resolution remote sensing image building semantic segmentation and edge detection dataset described in S1
Figure SMS_66
In the example, a multi-task model for constructing a semantic segmentation task and an edge detection task of a building by taking a U-Net segmentation model as a framework is set, wherein the semantic segmentation task is a main task, and the edge detection task is an auxiliary task; the multi-task model extraction to obtain the high-level semantic features of the building semantic segmentation and the high-level semantic features of the edge detection are realized through a series of operations such as convolution, pooling, up-sampling and the like. The multi-task model is input into a high-resolution remote sensing image, and the model output is output from a building semantic segmentation task building segmentation graph and a building edge detection task building edge extraction probability graph. Furthermore, on the model, the two task initialization phases need to share some parameters, namely: some modules such as convolution are shared on the model.
The multitasking model is:
Figure SMS_67
in the formula->
Figure SMS_68
For building partition map, < >>
Figure SMS_69
Extracting probability map for building edge>
Figure SMS_70
For multitasking model, ++>
Figure SMS_71
For high-resolution remote sensing image building semantic segmentation and edge detection data set,/for the remote sensing image building semantic segmentation and edge detection data set>
Figure SMS_72
Is a multitasking model parameter.
S3, according to the set sparsity and the shared parameter ratio of the two task parameters, randomly generating a remote sensing image building semantic segmentation and edge detection task parameter mask matrix under the condition of the constraint
Figure SMS_73
A sub-network of two tasks is generated.
In the example, the sparsity is set to be 0.95, the shared parameter ratio of two task parameters is set to be 0.95, and the remote sensing image building semantic segmentation and edge detection task parameter mask matrix is randomly generated
Figure SMS_74
The mask matrix for both tasks should satisfy the mathematical formula:
Figure SMS_75
in the method, in the process of the invention,
Figure SMS_76
mask matrix randomly generated for remote sensing image building semantic segmentation task, < >>
Figure SMS_77
For remote sensingMask matrix randomly generated by semantic segmentation task of image building,/-for>
Figure SMS_78
Is->
Figure SMS_79
And->
Figure SMS_80
,/>
Figure SMS_81
For a set sparsity ratio->
Figure SMS_82
The parameter ratio is shared for the two task parameters set. The multitasking model may be expressed as:
Figure SMS_83
for each task its subnetwork is represented as:
Figure SMS_84
s4, according to the dual lottery hypothesis, a sub-network randomly selected from the randomly initialized dense networks can be converted into a trainable condition. Under the condition of multitasking, adopting combination
Figure SMS_85
The norm regularization can enable the randomly selected sub-network to be converted into the sub-network with the best effect, and meanwhile, the sub-network of the building edge detection task can be utilized to achieve constraint and information sharing of the semantic segmentation task during training.
In this example, according to the model output item, the loss function of the original semantic segmentation task is set as:
Figure SMS_86
in the formula->
Figure SMS_87
The result of the semantic segmentation is represented,
Figure SMS_88
the result of the semantic segmentation is represented as a probability of a building. The loss function of the original edge detection is set as follows:
Figure SMS_89
in the formula->
Figure SMS_90
Representing the result of the edge detection segmentation, < >>
Figure SMS_91
Indicating the probability of whether the edge detection result is a building edge.
In order to transform the sub-network with constraint random selection into the sub-network with best effect, we use the combination of
Figure SMS_92
And->
Figure SMS_93
Advantage of the norm regularization method ++>
Figure SMS_94
Norm regularization improves the loss function of both tasks.
Figure SMS_95
In the method, in the process of the invention,
Figure SMS_96
loss function for semantic segmentation tasks of a building, < ->
Figure SMS_101
Loss function for building edge detection task, < ->
Figure SMS_103
Representing a certain loss function conventionally used, +.>
Figure SMS_97
Parameter representing a value of 1 in the parameter mask matrix, < >>
Figure SMS_100
Parameter representing 0 in the parameter mask matrix, < ->
Figure SMS_102
For weighing the parameters. Set->
Figure SMS_104
Is 0, the growth step is +.>
Figure SMS_98
,/>
Figure SMS_99
Is 1.
S5, adopting a cosine unequal proportion alternating training strategy, and periodically strengthening and weakening the transmission of the building edge detection task to semantic segmentation task edge information. Data set constructed by step S1 by adopting back propagation and random gradient descent algorithm
Figure SMS_105
Training the constructed multi-task model, and simultaneously combining the limiting condition of the step S3 and the method of the step S4 to randomly generate a mask matrix for the semantic segmentation task of the remote sensing image building>
Figure SMS_106
And performing iterative transformation for multiple times, and obtaining the trained remote sensing image building multitasking model after reaching the preset precision.
In this example, the Epoch is set to 32, and a cosine unequal proportion alternate training strategy is adopted, so that the negative influence caused by the mutual constraint of the semantic segmentation task and the edge detection task of the building is further weakened. In each cycle, the ratio of the number of alternating iterations of the building semantic segmentation task to edge detection
Figure SMS_107
The mathematical formula is satisfied:
Figure SMS_108
in the method, in the process of the invention,
Figure SMS_109
for the segmentation period +.>
Figure SMS_110
Denoted as +.>
Figure SMS_111
Iterative cycle->
Figure SMS_112
Is the maximum number of iteration cycles.
Figure SMS_113
From the accuracy evaluation index in training process +.>
Figure SMS_114
Score determination, and satisfies the equation:
Figure SMS_115
in the method, in the process of the invention,
Figure SMS_116
representing semantic segmentation task->
Figure SMS_117
Score, ->
Figure SMS_118
Representing edge detection task->
Figure SMS_119
Score, namely: building edge detection task->
Figure SMS_120
Score and building edge detection task->
Figure SMS_121
The fraction is simultaneously greater than the minimum period of 0.6.
Using data sets with back-propagation and random gradient descent algorithms
Figure SMS_122
Performing multitask training, wherein mask matrix generated randomly for remote sensing image building semantic segmentation task in training process>
Figure SMS_127
And performing repeated iterative transformation, wherein the formula is as follows:
Figure SMS_128
,/>
Figure SMS_123
parameter mask matrix representing a building semantic segmentation network initializing constraint generation +.>
Figure SMS_126
Indicate->
Figure SMS_129
Iterative transformation of->
Figure SMS_132
First->
Figure SMS_124
A constrained generation of a parameter mask matrix for a building semantic segmentation network for a number of iterations, wherein
Figure SMS_125
Is a multiple of M, and->
Figure SMS_130
. After each iterative transformation, a new generated parameter mask matrix is used->
Figure SMS_131
Reconstructing the semantic segmentation sub-network of the building.
In addition, in the training process, the super parameters such as positive and negative sample proportion, learning rate, batch size, weight attenuation coefficient and the like are debugged according to the test condition. And obtaining the trained remote sensing image building multi-task model after reaching the preset precision.
S6, dividing parameter mask matrix according to the last converted building semantics
Figure SMS_133
And the parameter weight value is used for reconstructing and extracting the semantic segmentation task model in the multi-task. And obtaining a semantic segmentation model which is subjected to edge detection task auxiliary training, inputting a remote sensing image of an actual building into the trained semantic segmentation model, and extracting a reconstructed semantic segmentation model to obtain a semantic segmentation result of the building with more optimized edges.
Parameter mask matrix based on last iteration transformation of semantic segmentation network
Figure SMS_134
Will->
Figure SMS_135
All parameters of (2) are removed and remain->
Figure SMS_136
Reconstructing the semantic segmentation network to obtain a semantic segmentation model which is trained in an auxiliary mode through an edge detection task, inputting a remote sensing image of an actual building into the trained semantic segmentation model, and extracting the reconstructed semantic segmentation model to obtain a semantic segmentation result of the building with more optimized edges.
As shown in fig. 3, experiments prove that the invention can better segment complete building results in the aspect of remote sensing image building semantic segmentation research, the extracted results are more regular and approximate to real segmentation results, and meanwhile, the phenomena of over-detection and omission can be better avoided.
Using the public dataset: the training and testing dataset was constructed by the procedure shown in figure 2, university of armed forces Aerial imagery dataset. The U-Net semantic segmentation single task is experimentally compared with the U-Net skeleton semantic segmentation and edge detection multi-task model constructed by the building edge optimization method based on the multi-task learning and dual lottery hypothesis. The results for U-Net were: accuracy 0.9377, precision 0.7443, F1Score 0.8017, iou 0.6982, kappa 0.747, the results of the present invention are: accuracy 0.9475, precision 0.8230, F1Score 0.8098, iou 0.7153, kappa 0.7613. Experiments show that compared with the corresponding single task, the method has a certain improvement on each evaluation index. In the actual extraction results, as shown in fig. 3, the invention improves the missing detection and the false detection to a certain extent, and the extraction results at the edges of the building are more regular and approximate to the actual segmentation results.
Finally, it should be noted that: the foregoing description is only a preferred embodiment of the present invention, and the present invention is not limited thereto, but it is to be understood that modifications and equivalents of some of the technical features described in the foregoing embodiments may be made by those skilled in the art, although the present invention has been described in detail with reference to the foregoing embodiments. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (5)

1. The building edge optimization method based on the multitask learning and dual lottery hypothesis is characterized by comprising the following steps of:
s1, building semantic segmentation and edge detection data set of high-resolution remote sensing image building
Figure QLYQS_1
S2, constructing a multi-task model of a semantic segmentation task and an edge detection task of the building by taking the CNN segmentation model as a framework, wherein the semantic segmentation task in the model is a main task, and the edge detection task is an auxiliary task; the multi-task models of the building semantic segmentation task and the edge detection task are input into high-resolution remote sensing images, and are output into building semantic segmentation task building segmentation graphs and building edge detection task building edge extraction probability graphs; all convolution modules of the encoder part are shared in the initialization stage of the semantic segmentation task and the edge detection task of the building;
s3, setting the sparsity of the multi-task model and the common parameter ratio of two task parameters, and randomly generating a remote sensing image building semantic segmentation and edge detection task parameter mask matrix
Figure QLYQS_2
Generating a sub-network of two tasks;
the mask matrix
Figure QLYQS_3
Satisfies the following conditions
Figure QLYQS_4
Wherein:
Figure QLYQS_5
mask matrix randomly generated for remote sensing image building semantic segmentation task, < >>
Figure QLYQS_6
Mask matrix randomly generated for remote sensing image building semantic segmentation task, < >>
Figure QLYQS_7
Is->
Figure QLYQS_8
And->
Figure QLYQS_9
,/>
Figure QLYQS_10
For a set sparsity ratio->
Figure QLYQS_11
Sharing a parameter ratio for the set two task parameters;
the multitasking model is:
Figure QLYQS_12
for each task its subnetwork is represented as:
Figure QLYQS_13
wherein:
Figure QLYQS_14
for building partition map, < >>
Figure QLYQS_15
Extracting probability map for building edge, +.>
Figure QLYQS_16
For multitasking model, ++>
Figure QLYQS_17
For high-resolution remote sensing image building semantic segmentation and edge detection data set,/for the remote sensing image building semantic segmentation and edge detection data set>
Figure QLYQS_18
Is a multitasking model parameter;
s4, adopting combination
Figure QLYQS_19
The norms regularization and the parameter mask matrix respectively improve the loss functions of the semantic segmentation task and the edge detection task of the building;
the loss function is:
Figure QLYQS_20
in the middle of,
Figure QLYQS_21
Loss function for semantic segmentation tasks of a building, < ->
Figure QLYQS_22
Loss function for building edge detection task, < ->
Figure QLYQS_23
Representing a certain loss function conventionally used, +.>
Figure QLYQS_24
Parameter representing a value of 1 in the parameter mask matrix, < >>
Figure QLYQS_25
Parameter representing 0 in the parameter mask matrix, < ->
Figure QLYQS_26
Is a trade-off parameter; />
Figure QLYQS_27
Starting from the set minimum value, gradually adding the value of the value in the form of an arithmetic progression for each iteration until the value is increased to the set maximum value;
s5, adopting a cosine unequal proportion alternating training strategy, periodically reinforcing and weakening transmission of negative influence of building edge detection tasks on semantic segmentation task restriction, adopting back propagation and random gradient descent algorithm, and utilizing the high-resolution remote sensing image building semantic segmentation and edge detection data set constructed in the step S1
Figure QLYQS_28
Training the constructed multi-task model, and simultaneously combining the limiting condition of the step S3 and the method of the step S4 to randomly generate a mask matrix for the semantic segmentation task of the remote sensing image building>
Figure QLYQS_29
Performing iterative transformation for multiple times, and obtaining a trained remote sensing image building multitasking model after reaching a preset precision;
the concrete steps for transmitting the adverse effect of the constraint of the semantic segmentation task and the edge detection task of the building are further weakened by the cosine unequal proportion alternating training strategy are as follows:
in each cycle, the ratio of the number of alternating iterations of the building semantic segmentation task to edge detection
Figure QLYQS_30
The following conditions are satisfied:
Figure QLYQS_31
in the method, in the process of the invention,
Figure QLYQS_32
for the segmentation period +.>
Figure QLYQS_33
Denoted as +.>
Figure QLYQS_34
Iterative cycle->
Figure QLYQS_35
Is the maximum iteration cycle number;
Figure QLYQS_36
from the accuracy evaluation index in training process +.>
Figure QLYQS_37
Score determination and satisfies the formula:
Figure QLYQS_38
in the method, in the process of the invention,
Figure QLYQS_39
representing semantic segmentation task->
Figure QLYQS_40
Score, ->
Figure QLYQS_41
Representing edge detection task->
Figure QLYQS_42
Score, namely: building edge detection task->
Figure QLYQS_43
Score and building edge detection task->
Figure QLYQS_44
A fraction of minimum cycles simultaneously greater than 0.6;
building semantic segmentation and edge detection dataset using high resolution remote sensing images by using back propagation and random gradient descent algorithms
Figure QLYQS_45
Performing multitask training, wherein mask matrix generated randomly for remote sensing image building semantic segmentation task in training process>
Figure QLYQS_46
Performing iterative transformation for a plurality of times to enable the parameter mask matrix to meet the following formula:
Figure QLYQS_47
in the method, in the process of the invention,
Figure QLYQS_48
parameter mask matrix representing a building semantic segmentation network initializing constraint generation +.>
Figure QLYQS_49
Indicate->
Figure QLYQS_50
Iterative transformation of->
Figure QLYQS_51
First->
Figure QLYQS_52
A parameter mask matrix of a constraint-generated building semantic segmentation network for a number of iterations, wherein +.>
Figure QLYQS_53
Is a multiple of M; after each iterative transformation, a new generated parameter mask matrix is used->
Figure QLYQS_54
Reconstructing a semantic segmentation sub-network of the building;
s6, dividing parameter mask matrix according to the last converted building semantics
Figure QLYQS_55
The parameter weight value is used for reconstructing and extracting a semantic segmentation task model in the multi-task; and obtaining a semantic segmentation model which is subjected to edge detection task auxiliary training, inputting a remote sensing image of an actual building into the trained semantic segmentation model, and extracting a reconstructed semantic segmentation model to obtain a semantic segmentation result of the building with more optimized edges.
2. The method for optimizing building edges based on multi-task learning and dual lottery hypothesis according to claim 1, wherein the high resolution remote sensing image building semantic segmentation and edge detection data set
Figure QLYQS_56
The construction steps of (a) are as follows:
s11, converting an original building vector data file into a binary building semantic segmentation tag according to a boundary range vector file by taking the size of a corresponding remote sensing image pel as a reference;
s12, cutting the remote sensing image into a corresponding range according to the boundary range vector file;
s13, carrying out morphological corrosion treatment on the binary building semantic segmentation label generated in the step S11 to generate a corresponding building edge detection label;
s14, calculating whether each remote sensing image needs to be expanded or not according to the set size and the overlapping rate, if the remote sensing images need to be expanded, calculating the pixels of the expanded images at the same time, and expanding the original remote sensing images, the corresponding building semantic segmentation labels and the building edge detection labels to the specified size at the same time;
s15, sliding cutting is carried out on the remote sensing image, the building semantic segmentation tag image and the building edge detection tag image at the same time according to the set size and the set overlapping rate;
s16, after cutting is completed, a high-resolution remote sensing image building semantic segmentation and edge detection data set is obtained
Figure QLYQS_57
Taking an original remote sensing image as a reference according to 8: the scale of 2 divides the building dataset into: training and test sets, 80% of the data were used for training and 20% of the data were used for testing the performance after training.
3. The method for optimizing building edges based on the multiple learning and dual lottery hypothesis according to claim 1, wherein the following steps are performed
Figure QLYQS_58
4. The method for optimizing building edges based on multi-task learning and dual lottery hypothesis according to claim 1, wherein the parameter mask matrix is partitioned according to the last converted building semantics
Figure QLYQS_59
And reconstructing and extracting a semantic segmentation task model in the multi-task according to the parameter weight value, wherein the method comprises the following specific steps of: parameter mask matrix according to last iteration transformation of semantic segmentation network>
Figure QLYQS_60
Will->
Figure QLYQS_61
All parameters of (2) are removed and remain->
Figure QLYQS_62
Reconstructing the semantic segmentation network.
5. A building edge optimization system based on a multi-task learning and dual lottery hypothesis, wherein the system is capable of implementing a building edge optimization method based on a multi-task learning and dual lottery hypothesis as in any one of claims 1-4.
CN202310314668.2A 2023-03-29 2023-03-29 Building edge optimization method based on multitask learning and dual lottery hypothesis Active CN116052006B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310314668.2A CN116052006B (en) 2023-03-29 2023-03-29 Building edge optimization method based on multitask learning and dual lottery hypothesis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310314668.2A CN116052006B (en) 2023-03-29 2023-03-29 Building edge optimization method based on multitask learning and dual lottery hypothesis

Publications (2)

Publication Number Publication Date
CN116052006A CN116052006A (en) 2023-05-02
CN116052006B true CN116052006B (en) 2023-06-16

Family

ID=86116629

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310314668.2A Active CN116052006B (en) 2023-03-29 2023-03-29 Building edge optimization method based on multitask learning and dual lottery hypothesis

Country Status (1)

Country Link
CN (1) CN116052006B (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114332071A (en) * 2022-01-04 2022-04-12 广东工业大学 Video anomaly detection method based on foreground information enhancement

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113614738A (en) * 2019-03-22 2021-11-05 国际商业机器公司 Unification of multiple models with individual target classes using distillation
CN113449640B (en) * 2021-06-29 2022-02-11 中国地质大学(武汉) Remote sensing image building semantic segmentation edge optimization method based on multitask CNN + GCN
CN113744185A (en) * 2021-07-28 2021-12-03 江苏东印智慧工程技术研究院有限公司 Concrete apparent crack segmentation method based on deep learning and image processing
CN114596502B (en) * 2022-03-03 2023-04-07 合肥工业大学 Arable land information extraction method based on edge-enhanced high-resolution remote sensing image
CN114821014B (en) * 2022-05-17 2024-06-21 湖南大学 Multi-mode and countermeasure learning-based multi-task target detection and identification method and device
CN115661246A (en) * 2022-10-25 2023-01-31 中山大学 Attitude estimation method based on self-supervision learning

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114332071A (en) * 2022-01-04 2022-04-12 广东工业大学 Video anomaly detection method based on foreground information enhancement

Also Published As

Publication number Publication date
CN116052006A (en) 2023-05-02

Similar Documents

Publication Publication Date Title
US20210295169A1 (en) Deep learning based identification of difficult to test nodes
US11205028B2 (en) Estimating physical parameters of a physical system based on a spatial-temporal emulator
CN113780296B (en) Remote sensing image semantic segmentation method and system based on multi-scale information fusion
US11720727B2 (en) Method and system for increasing the resolution of physical gridded data
CN111583276B (en) CGAN-based space target ISAR image component segmentation method
CN116206185A (en) Lightweight small target detection method based on improved YOLOv7
CN113011568B (en) Model training method, data processing method and equipment
CN116612288B (en) Multi-scale lightweight real-time semantic segmentation method and system
WO2022100607A1 (en) Method for determining neural network structure and apparatus thereof
Atik et al. Comparative research on different backbone architectures of DeepLabV3+ for building segmentation
CN111797833A (en) Automatic machine learning method and system oriented to remote sensing semantic segmentation
CN111626311A (en) Heterogeneous graph data processing method and device
CN111860465A (en) Remote sensing image extraction method, device, equipment and storage medium based on super pixels
Han et al. CoolVox: Advanced 3D convolutional neural network models for predicting solar radiation on building facades
CN114511785A (en) Remote sensing image cloud detection method and system based on bottleneck attention module
Patel et al. Smart adaptive mesh refinement with NEMoSys
CN114219701A (en) Dunhuang fresco artistic style conversion method, system, computer equipment and storage medium
CN116052006B (en) Building edge optimization method based on multitask learning and dual lottery hypothesis
Čomić et al. Computing a discrete Morse gradient from a watershed decomposition
CN113920311A (en) Remote sensing image segmentation method and system based on edge auxiliary information
Vasin et al. Geometric modeling of raster images of documents with weakly formalized description of objects
Kolibarov et al. Roof Segmentation Towards Digital Twin Generation in LoD2+ Using Deep Learning
CN112801294B (en) Method for accelerating neural network reasoning based on graph signal processing
Kwon et al. Deep learning-based point cloud upsampling: a review of recent trends
CN113449817B (en) Image classification implicit model acceleration training method based on phantom gradient

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant