CN115346137A

CN115346137A - High-standard farmland land mass vectorization extraction method based on multi-task learning

Info

Publication number: CN115346137A
Application number: CN202211257905.8A
Authority: CN
Inventors: 王心宇; 潘洋; 钟燕飞; 张良培
Original assignee: Wuhan University WHU
Current assignee: Wuhan University WHU
Priority date: 2022-10-14
Filing date: 2022-10-14
Publication date: 2022-11-15
Anticipated expiration: 2042-10-14
Also published as: CN115346137B

Abstract

The invention relates to a high-standard farmland plot vectorization extraction method based on multi-task learning, which mainly comprises the following steps: designing an ultra-complete high-resolution dense multi-scale module to jointly extract shape variability features of the parcel objects and edge features related to parcel boundary categories, and designing a region-boundary-parcel decoupling multi-task module to jointly optimize parcel object extraction and parcel boundary extraction tasks; and designing a plot boundary-object interactive vectorization module to further optimize the adhesion phenomenon of the plot object result. The method can be suitable for optical remote sensing images under various spatial resolutions to be used for a high-standard farmland plot vectorization extraction task, and compared with the existing farmland plot extraction method, the method provided by the invention can process a vector result, has higher plot object extraction precision and smooth plot boundaries, and can meet application requirements of plot scale crop classification, yield estimation and the like.

Description

High-standard farmland plot vectorization extraction method based on multitask learning

Technical Field

The invention belongs to the technical field of remote sensing image processing, and particularly relates to a high-standard farmland land parcel vectorization extraction method based on multi-task learning.

Background

Accurate farmland plot boundary information is important for farmers and agricultural managers to monitor crop growth in the field of precision agricultural services. However, the fully automatic extraction of the boundary of the farmland plot from the satellite image still faces a challenge. On the one hand, the land mass is fine and dense and has shape variability. The different plots have larger form difference and different sizes; on the other hand, as crops are mostly planted in the land parcels, the land parcels have larger difference of phenology between different land parcels, and the land parcels show space-time spectrum variability; in addition, from the visual effect, the plot is obscured by edge occlusion, most of the existing methods are difficult to completely identify plot objects, and the existing farmland plot data set is concentrated on a small-range single satellite image data source.

The current farmland plot extraction methods comprise methods based on edge detection, object segmentation and deep learning. The edge detection-based method mainly defines an edge detection operator manually, cannot automatically determine algorithm parameters, has very rough segmented boundaries and a saw-tooth phenomenon, and is difficult to apply to a region with dense plots. The object segmentation based method carries out edge connection or region segmentation through a region similarity criterion, and then carries out region classification through a sub-region result of segmentation, but the traditional methods can only segment plots and cannot extract farmland plots and boundaries at the same time. The deep learning method takes a land area or a land boundary as an identification object based on an edge or a region, tries the recognition object as a two-classification semantic segmentation task, trains two semantic segmentation networks, and is respectively used for detecting the land boundary and the land area, and finally outputs an example grid result of the land through post-processing. However, requirements cannot be met in subsequent applications, extraction results are rough, further post-processing is needed to obtain final segmentation results, a large number of parameters are needed for setting post-processing methods and effects, and the degree of automation is not high.

Disclosure of Invention

The invention provides a high-standard farmland land parcel vectorization extraction method based on region-boundary-land parcel decoupling multi-task learning, which has the following advantages: the method comprises the steps of designing an ultra-complete high-resolution dense multi-scale module to extract shape variability features of a parcel object and edge features related to parcel boundary classes. And secondly, designing a region-boundary-plot decoupling multitask module to jointly optimize the plot object extraction and the plot boundary extraction tasks, realizing multitask cooperative supervision of the plot boundary and the plot objects, and relieving the adhesion phenomenon among the plot objects. And thirdly, designing a plot boundary-object interactive vectorization module to further optimize the adhesion phenomenon of the plot object result, and generating a plot vectorization result which can be directly applied to subsequent agricultural remote sensing tasks. And fourthly, the method is applied to a plurality of domestic and foreign research areas, the optimal identification precision is obtained, and the method can be suitable for the optical satellite remote sensing images with medium and high resolution.

The invention provides a high-standard farmland plot vectorization extraction method based on multitask learning, which comprises the following steps:

step 1, performing cutting and normalization pretreatment on an input image;

step 2, extracting multi-scale features and edge detail features of the preprocessed image by using a multi-scale module;

step 3, obtaining a land area segmentation result, a land boundary prediction result and a land object segmentation result through a multitask module according to the multi-scale features and the edge detail features;

step 4, inputting the result output in the step 3 into a multi-task joint optimization loss function, and outputting the resultlossValue, back propagationlossUpdating network model parameters by values, wherein the network model packet is composed of a multi-scale module and a multi-task module;

step 5, inputting the remote sensing image into a network model, and outputting a block boundary and block object prediction result;

and 6, post-processing the prediction result to generate a vectorization block result, so as to realize accurate extraction of the block object and the block boundary.

Further, the specific implementation of step 1 includes:

for original input image large pictureXCutting with uniform sliding window to generate sample setXIs normalized band by band for each small pattern, by

Whereinb _i ,mean _i , std _i Respectively representing the original remote-sensing imagesiSingle band raster data, firstiMean value of single band grid andithe variance of the grid of the individual single band,

represents the normalized firstiNormalizing all the image pairs of the sample set by using the single-waveband raster data, and outputting the normalized sample set

。

Further, the specific implementation of step 2 includes:

step 2.1, sample set after pretreatment

Input the methodkExtracting input image from network structure composed of serially connected dense feature extraction modulesX ₀ Each module respectively outputs the dense features

The formula is as follows:

in the above formula, each dense feature extraction moduleDense _i (. C) contains 3 dense feature convolution modulesConvdense ₁ (·)，Convdense ₂ (·), Convdense ₃ (·), X _i-1 Represents the firstiIndividual dense feature extraction moduleDense _i (ii) an input profile of the (v),Convdense(. Cndot.) represents a dense feature convolution module, consisting of two convolution-batch normalization-activation modules,C(. Cndot.) represents the superposition of input features in the channel dimension,x ₁ , x ₂ respectively representing the output characteristic diagram of a first intensive characteristic convolution module and the output characteristic diagram of a second intensive characteristic convolution module in each intensive characteristic extraction module;

step 2.2, outputting the multi-stage characteristic diagram of 2.1

Input to a decoder comprisingkAn upsampling module, wherein the formula is as follows:

in the above-mentioned formula, the compound has the following structure,

represents the firstkThe up-sampling module outputs a characteristic map,Convblockrepresents a 3 x 3 convolution module, consisting of two convolution-batch normalization-activation modules,C(. Cndot.) represents the superposition of input features in the channel dimension,F _interpolate (. Cndot.) represents a bilinear upsampling of the input feature map in a spatial dimension twice the original dimension

When in use, will

Bilinear upsampled sumX _n-1 The input features are superposed in the channel dimension and then are obtained by a 3 multiplied by 3 convolution module

When it comes to

And so on to finally obtain

Then, it is named as a feature mapF _dense ；

Step 2.3, the preprocessed sample set

Image forming methodX ₀ Is inputted intokExtracting high-resolution detail features of an original input image in a network structure consisting of a plurality of high-resolution feature extraction modules connected in series, wherein each module respectively corresponds to an output detail feature

The formula is as follows:

in the above formula, the first and second carbon atoms are,Convblockrepresents a 3 x 3 convolution module, consisting of two convolution-batch normalization-activation modules,F _interpolate (. O) represents a bilinear upsampling of the input feature map for spatial dimensions twice the original sizeX _hk Is through the original inputX ₀ TokThe high-resolution feature extraction module is used for obtaining the feature;

step 2.4, the details of step 2.3 are characterized

The data is input into a network formed by the down-sampling modules, and the formula is as follows:

in the above formula, the first and second carbon atoms are,

represents the firstkThe down-sampling module outputs a feature map,F _{interpolate_down} (-) represents a bilinear down-sampling of the input feature map for the spatial dimension of 0.5 times the original size,Convblockrepresents a 3 x 3 convolution module, consists of 2 convolution-batch normalization-activation modules,Cdenotes the superposition of input features in the channel dimension when

When in use, willX _hn After bilinear down-sampling with the size of 0.5 timesX _hn-1 The input features are superposed in the channel dimension and then are obtained by a 3 multiplied by 3 convolution module

When it comes to

And so on to finally obtain

Then, it is named as a feature mapF _hr 。

Further, the specific implementation of step 3 includes:

step 3.1, outputting steps 2.2 and 2.4F _dense AndF _hr stacking in channel dimension, outputtingF _DH Characteristic diagram ofF _DH Inputting the result to a land region segmentation module and outputting the resultY _region The formula is as follows:

whereinConvblock ₁ Represents a 3 × 3 convolution batch normalization and activation module, finally through 1 × 1 convolution sumSigmoidActivating function, and finally outputting the result of block region segmentationY _region 。

Step 3.2, the step 3.1F _DH Inputting the characteristic graph into a boundary plot generation module to obtain a plot boundary characteristic graphF _pb And land object feature mapF _po The formula is as follows:

whereinConvblockRepresents a 3 x 3 convolution module, consisting of 2 convolution-batch normalization-activation modules,F _DH throughConvblockOutputting a location-aware field feature mapF _pf ，F _{interpolate_size} (·,Size(. DEG)) represents a dimension-specific bilinear interpolation of the input feature map by applying to the perceptual field feature mapF _pf Bilinear interpolation toF _DH The size of the space characteristic graph of the characteristic graph is sampled by gridgridsampleAnd operation, taking the pixel value of the perception field characteristic image as the row-column coordinates of the characteristic image, sampling the pixel value of the space grid, wherein the sampled characteristic image is a feature image of the boundary of the land parcelF _pb And will beF _DH Feature map minus parcel boundary feature mapF _pb Obtaining a land object feature mapF _po ；

Step 3.3, the characteristic diagram of the step 3.2 is processedF _pb And step 2.4 feature mapF _hr Inputting the data into a boundary maintaining module, and further learning the edge characteristics of the land parcel, wherein the formula is as follows:

wherein,Crepresents the superposition of input features in the channel dimension,Convblock ₁ represents a 3 × 3 convolution batch normalization and activation module, and finally integrates the class layer andSigmoidactivating function to output prediction result of block boundaryY _boundary ；

Step 3.4, the step 3.1F _DH Feature map, boundary prediction result output in step 3.3Y _boundary And the land object feature map in step 3.2F _po The data are input to a land segmentation enhancement module, and the formula is as follows:

wherein, inputF _DH Integrating the class layers by 1 x 1 convolution andSigmoidactivation function generating initial block object segmentation resultY _ipo Then through the parcel boundary attention module, which passes

Operation, i.e. subtracting the boundary prediction result output in step 3.3 from the identity matrixY _boundary Characteristic diagram of rear and land objectF _po Multiplying to suppress the boundary characteristics of land blocks, enhancing the object characteristics of land blocks and outputting the enhanced land block characteristic diagramF _sp Relieving the adhesion phenomenon before the land parcel, and finally obtaining the characteristic diagramF _sp With the initial block segmentation resultY _ipo After stacking channel dimensions, 1 x 1 convolution sumSigmoidActivating the function to generate the final land object segmentation resultY _rp 。

Further, the specific implementation of step 4 includes:

the segmentation result of the region of the multi-task segmentation result output by the step 3Y _region Block boundary prediction resultsY _boundary Initial block object segmentation resultY _ipo And enhancing land objectsSegmentation resultY _rp And inputting the data into a multitask joint loss function, wherein the formula is as follows:

wherein,

respectively representing a real parcel area tag, a parcel boundary tag and a parcel object tag,L _region , L _ip , L _rp are both binary cross-entropy loss functions,L _bound is a combination of a binary cross entropy loss function and a Dice loss function, wherein,

respectively a binary cross entropy loss function and a Dice loss function, for relieving the foreground and background imbalance phenomenon of the land parcel boundary, wherein the binary cross entropy loss function and the Dice loss function are used according to experience in the patent

For balancing the multitasking losses.

Further, the specific implementation of step 5 includes:

and storing the optimal precision result of the trained model on the verification set, predicting the test set, and outputting a final block object prediction result and a block boundary prediction result.

Further, the specific implementation of step 6 includes:

step 6.1, predicting the result of the block boundaryY _Pb Inputting a morphological boundary enhancement module to enhance the edge connectivity in the process of extracting the land parcel boundary, wherein the formula is as follows:

wherein

Respectively, a block boundary prediction result and a structural element, wherein the structural elementSIs a predefined 3 x 3 size identity matrix, operated by morphological dilation

Finally generating a land boundary enhancement resultY _Db ；

Step 6.2, enhancing the result of the land parcel boundaryY _Db And block object prediction resultsY _Pp Inputting a boundary-object interaction module to relieve the adhesion phenomenon between the plots, wherein the formula is as follows:

wherein the operation is performed by boundary-object interactionP(,) generate the final parcel interaction resultY _Mp ；

And 6.3, generating a final land mass vectorization result by the result in the step 6.2 through boundary smoothing and hole filling.

The method of the invention has the following remarkable effects: (1) Designing a complete high-resolution dense multi-scale module to jointly extract the shape variability features of the parcel objects and the edge features related to the parcel boundary categories; (2) Designing a region-boundary-plot decoupling multi-task module to jointly optimize plot object extraction and plot boundary extraction tasks, and jointly constraining prediction results among different tasks by modeling a spatial relationship among multi-task prediction results; (3) And designing a plot boundary-object interactive vectorization module to further optimize the blocking phenomenon of the plot object result.

Drawings

Fig. 1 is a remote sensing image input in step 1 according to the embodiment of the present invention.

Fig. 2 is a diagram of an ultra-complete high resolution dense multi-scale module network structure in step 2 according to an embodiment of the present invention.

Fig. 3 is a diagram of a zone-boundary-parcel decoupling multitask module network structure in step 3 according to the embodiment of the present invention.

Fig. 4 is a block boundary-object interactive vectorization module structure diagram in step 6 according to the embodiment of the present invention.

Fig. 5 shows the vectorized tile extraction result output in step 6 according to the embodiment of the present invention.

Detailed Description

The technical scheme of the invention is further specifically described by the following embodiments and the accompanying drawings.

The invention provides a high-standard farmland plot vectorization extraction method based on multitask learning, which comprises the following steps of:

step 1, inputting an original remote sensing image large image, and performing sliding window clipping and normalization pretreatment on the original image as shown in fig. 1. The method further comprises the following steps:

for original input image large pictureXPerforming uniform sliding window clipping, with window size of 256 × 256 and step size of 256 × 256, generating a sample setXIs normalized band by band for each small pattern, by

Whereinb _i ,mean _i , std _i Respectively representing the original remote-sensing imagesiSingle band raster data, secondiSingle band grid mean andithe variance of the grid of the individual single band,

represents the normalizediNormalizing all image pairs of the sample set by using single-waveband raster data, and outputting the normalized sample set

。

And 2, extracting the multi-scale features and the edge detail features by using an ultra-complete high-resolution dense multi-scale module, as shown in fig. 2. The method further comprises the following steps:

step 2.1, the preprocessed sample set

Inputting the image into a network structure composed of 5 serially connected dense feature extraction modules to extract an input imageX ₀ The 5 modules respectively output the dense features

The formula is as follows:

in the above formula, each dense feature extraction moduleDense _i (. 2) contains 3 dense feature convolution modulesConvdense ₁ (·)，Convdense ₂ (·), Convdense ₃ (·)，X _i-1 Represents the firstiIndividual dense feature extraction moduleDense _i (ii) an input profile of the graph,Convdense(. Cndot.) represents a dense feature convolution module, consisting of two convolution-batch normalization-activation modules,Crepresents the superposition of input features in the channel dimension,x ₁ , x ₂ respectively representing the output characteristic diagram of the first dense characteristic convolution module and the output characteristic diagram of the second dense characteristic convolution module in each dense characteristic extraction module.

Step 2.2, outputting the 2.1 multi-stage characteristic diagram

Input to a decoder, which comprises 5 upsampling modules, the formula is as follows:

in the above formula, the first and second carbon atoms are,

represents the firstkThe individual up-sampling modules output a profile map,Convblockrepresents a 3 x 3 convolution module, consisting of two convolution-batch normalization-activation modules,Crepresents the superposition of input features in the channel dimension,F _interpolate (. To) represents a bilinear upsampling of the input feature map in a spatial dimension twice the original size

Then, the bilinear upsampled sumX ₄ The input features are superposed in the channel dimension and then are obtained through a 3 multiplied by 3 convolution module

When it comes to

And so on, finally obtain

It is named as a feature mapF _dense 。

Step 2.3, the preprocessed sample set

Middle imageX ₀ Inputting the image data into a network structure consisting of 2 high-resolution feature extraction modules connected in series to extract high-resolution detail features of an original input image, wherein the 2 modules respectively correspond to output detail features

The formula is as follows:

in the above-mentioned formula, the compound has the following structure,Convblockrepresents a 3 x 3 convolution module, consisting of two convolution-batch normalization-activation modules,F _interpolate (. O) represents a bilinear upsampling of the input feature map for spatial dimensions twice the original size

Is through the original inputX ₀ The high-resolution feature extraction module obtains the high-resolution feature,

is through input

And the high-resolution feature extraction module.

Step 2.4, the details of step 2.3 are characterized

The data is input into a network formed by the down-sampling modules, the formula is as follows,

in the above-mentioned formula, the compound has the following structure,

represents the firstkThe down-sampling module outputs a feature map,F _{interpolate_down} (. Cndot.) represents a bilinear down-sampling of the input feature map in a spatial dimension of 0.5 times the original size,Convblockrepresents a 3 x 3 convolution module, consisting of 2 convolution-batch normalization-activation modules,C(. Cndot.) represents the superposition of input features in the channel dimension when

When in use, willX _h2 After bilinear down-sampling with the size of 0.5 timesX _h1 The input features are superposed in the channel dimension and then are obtained through a 3 multiplied by 3 convolution module

When is coming into contact with

And so on, finally obtain

Name it as feature mapF _hr 。

And 3, jointly extracting the plot boundary and the plot object by using the region-boundary-plot decoupling multitask module and the spectrum attention module, as shown in fig. 3. The method further comprises the following steps:

whereinConvblock ₁ Represents a 3 × 3 convolution batch normalization and activation module, eventually by 1 × 1 convolution sumSigmoidActivating function, and finally outputting the result of block region segmentationY _region 。

Step 3.2, the product of step 3.1F _DH Inputting the characteristic graph into a boundary plot generation module to obtain a plot boundary characteristic graphF _pb And land object feature mapF _po The formula is as follows:

whereinConvblockRepresents a 3 x 3 convolution module, consists of 2 convolution-batch normalization-activation modules,F _DH through a processConvblockOutput location-aware field feature mapsF _pf ，F _{interpolate_size} (·,Size(. DEG)) represents a dimension-specific bilinear interpolation of the input feature map by applying to the perceptual field feature mapF _pf Bilinear interpolation toF _DH The size of the space characteristic graph of the characteristic graph is sampled by gridgridsampleAnd operation, taking the pixel value of the perception field characteristic image as the row-column coordinates of the characteristic image, sampling the pixel value of the space grid, wherein the sampled characteristic image is a feature image of the boundary of the land parcelF _pb And will beF _DH Feature map minus parcel boundary feature mapF _pb Obtaining a feature map of the land objectF _po 。

Step 3.3, the characteristic diagram of the step 3.2 is processedF _pb And step 2.4 signatureF _hr Inputting the data into a boundary keeping module, and further learning the edge characteristics of the land parcel, wherein the formula is as follows:

wherein,Crepresents the superposition of input features in the channel dimension,Convblock ₁ represents a 3 × 3 convolution batch normalization and activation module, and finally integrates the class layer andSigmoidactivating function to output prediction result of block boundaryY _boundary 。

wherein, inputF _DH Integrating the class layers by 1 x 1 convolution andSigmoidactivation function generates initial block object segmentation resultY _ipo Then through the parcel boundary attention module, which passes

Operation, i.e. subtracting the block boundary prediction result output in step 3.3 by the identity matrixY _boundary Characteristic diagram of rear and land objectF _po Multiplying to suppress the boundary features of land, enhancing the object features of land, and outputting the enhanced land feature mapF _sp Relieving the adhesion phenomenon before the land parcel and finally generating the characteristic diagramF _sp With the initial block segmentation resultY _ipo After stacking channel dimensions, 1 x 1 convolution sumSigmoidActivating the function to generate the final land object segmentation resultY _rp 。

And 4, inputting the characteristic diagram finally output by the multitask module into the multitask joint optimization loss function, outputting a loss value, and reversely transmitting the loss value to update the network model parameters. The method further comprises the following steps:

the segmentation result of the region of the multi-task segmentation result output by the step 3Y _region Block boundary prediction resultsY _boundary Initial block object segmentation resultY _ipo And enhancing the land object segmentation resultY _rp And inputting the data into a multitask joint loss function, wherein the formula is as follows:

wherein,

respectively representing a real parcel region label, a parcel boundary label and a parcel object label,L _region , L _ip , L _rp are both binary cross-entropy loss functions that,L _bound is a combination of a binary cross entropy loss function and a Dice loss function, wherein,

For balancing the multitasking losses.

Further, the specific implementation of step 5 includes:

Further, the specific implementation of step 6 includes:

step 6.1, predicting the block boundaryY _Pb Inputting a morphological boundary enhancement module to enhance the edge connectivity in the process of extracting the land parcel boundary, wherein the formula is as follows:

wherein

Finally generating the land parcel boundaryEnhancing resultsY _Db ；

wherein operations are performed through boundary-object interactionsP(,) generate the final parcel interaction resultY _Mp ；

And 6.3, generating a final parcel vectorization result by the result in the step 6.2 through boundary smoothing and hole filling.

The specific embodiments described herein are merely illustrative of the spirit of the invention. Various modifications or additions may be made to the described embodiments or alternatives may be employed by those skilled in the art without departing from the spirit or ambit of the invention as defined in the appended claims.

Claims

1. A high-standard farmland plot vectorization extraction method based on multitask learning is characterized by comprising the following steps:

step 1, performing cutting and normalization pretreatment on an input image;

step 4, inputting the result output in the step 3 into a multi-task joint optimization loss function, and outputtinglossValue, back propagationlossUpdating network model parameters, wherein the network model package consists of a multi-scale module and a multi-task module;

2. The high-standard farmland parcel vectorization extraction method based on multitask learning according to claim 1, characterized by comprising the following steps: the implementation of said step 1 is as follows,

for original input image large pictureXCutting with uniform sliding window to generate sample setXNormalizing each small pattern band by band, passing

Whereinb _i ,mean _i , std _i Respectively representing the original remote-sensing imagesiSingle band raster data, firstiMean value of single band grid andithe variance of the grid of the single band is,

。

3. The high-standard farmland plot vectorization extraction method based on multitask learning according to claim 1, characterized in that: the implementation of said step 2 is as follows,

step 2.1, the preprocessed sample set

Input devicekExtracting input image from network structure composed of series dense feature extraction modulesX ₀ Of (2) a dense meshTarget characteristics, each module respectively corresponding to the output dense characteristics

The formula is as follows:

in the above formula, each dense feature extraction moduleDense _i (. C) contains 3 dense feature convolution modulesConvdense ₁ (·)，Convdense ₂ (·), Convdense ₃ (·), X _i-1 Represents the firstiDense feature extraction moduleDense _i (ii) an input profile of the (v),Convdense(. Cndot.) represents a dense feature convolution module, consisting of two convolution-batch normalization-activation modules,Crepresents the superposition of input features in the channel dimension,x ₁ , x ₂ respectively representing the output characteristic diagram of a first intensive characteristic convolution module and the output characteristic diagram of a second intensive characteristic convolution module in each intensive characteristic extraction module;

step 2.2, outputting the 2.1 multi-stage characteristic diagram

Input to a decoder comprisingkAn up-sampling module, the formula is as follows:

in the above formula, the first and second carbon atoms are,

represents the firstkThe individual up-sampling modules output a profile map,Convblockrepresents a 3 x 3 convolution module, consisting of two convolution-batch normalization-activation modules,C(. Cndot.) represents the superposition of input features in the channel dimension,F _interpolate (. Cndot.) represents a bilinear upsampling of the input feature map in a spatial dimension twice the original dimensionk=At 1 time, will

Bilinear upsampled sumX _n-1 The input features are superposed in the channel dimension and then are obtained through a 3 multiplied by 3 convolution module

When it comes to

And so on to finally obtain

Then, it is named as a feature mapF _dense ；

Step 2.3, the preprocessed sample set

Image forming methodX ₀ Is inputted intokExtracting high-resolution detail features of an original input image in a network structure consisting of serially connected high-resolution feature extraction modules, wherein each module respectively corresponds to output detail features

The formula is as follows:

in the above-mentioned formula, the compound has the following structure,Convblockrepresents a 3 × 3 convolutionA module consisting of two convolution-batch normalization-activation modules,F _interpolate (. O) represents a bilinear upsampling of the input feature map for spatial dimensions twice the original sizeX _hk Is through the original inputX ₀ To is thatkThe high-resolution characteristic extraction module obtains the characteristic;

step 2.4, the detail of step 2.3 is characterized

in the above-mentioned formula, the compound has the following structure,

represents the firstkThe down-sampling module outputs a feature map,F _{interpolate_down} (. Cndot.) represents a bilinear down-sampling of the input feature map in a spatial dimension of 0.5 times the original size,Convblockrepresents a 3 x 3 convolution module, consists of 2 convolution-batch normalization-activation modules,Cdenotes the superposition of input features in the channel dimension when

When in use, willX _hn Bilinear downsampling with size of 0.5 times andX _hn-1 the input features are superposed in the channel dimension and then are obtained through a 3 multiplied by 3 convolution module

When it comes to

And so on to finally obtain

Then, it is named as a feature mapF _hr 。

4. The high-standard farmland plot vectorization extraction method based on multitask learning according to claim 3, characterized in that: the implementation of said step 3 is as follows,

whereinConvblock ₁ Represents a 3 × 3 convolution batch normalization and activation module, eventually by 1 × 1 convolution sumSigmoidActivating function, and finally outputting the result of block region segmentationY _region ；

whereinConvblockRepresents a 3 x 3 convolution module, consisting of 2 convolution-batch normalization-activation modules,F _DH through a processConvblockOutput location-aware field feature mapsF _pf ，F _{interpolate_size} (·,Size(. DEG)) represents a dimension-specific bilinear interpolation of the input feature map by applying to the perceptual field feature mapF _pf Bilinear interpolation toF _DH The size of the space characteristic graph of the characteristic graph is sampled by gridgridsampleAnd operation, taking the pixel value of the perception field characteristic diagram as the row-column coordinates of the characteristic diagram, sampling the pixel value of the spatial grid, wherein the sampled characteristic diagram is a feature diagram of the boundary of the land parcelF _pb And will beF _DH Feature map minus parcel boundary feature mapF _pb Obtaining a feature map of the land objectF _po ；

wherein,Crepresents the superposition of input features in the channel dimension,Convblock ₁ represents a 3 × 3 convolution batch normalization and activation module, finally integrates class layers through 1 × 1 convolutionSigmoidActivating function to output prediction result of block boundaryY _boundary ；

Step 3.4, the step 3.1F _DH Feature map, boundary prediction result output in step 3.3Y _boundary And the land object feature map in step 3.2F _po Input to the land segmentation enhancement module, the formula is as follows:

Operation, i.e. subtracting the boundary prediction result output in step 3.3 from the identity matrixY _boundary Characteristic diagram of rear and land objectF _po Multiplying to suppress the boundary features of land, enhancing the object features of land, and outputting the enhanced land feature mapF _sp Relieving the adhesion phenomenon before the land parcel and finally generating the characteristic diagramF _sp With the initial block segmentation resultY _ipo After stacking channel dimensions, 1 x 1 convolution sumSigmoidThe activation function generates the final block object segmentation resultY _rp 。

5. The high-standard farmland parcel vectorization extraction method based on multitask learning according to claim 4, characterized by that: the implementation of said step 4 is as follows,

the division result of the region of the multi-task division result output in the step 3Y _region Block boundary prediction resultsY _boundary Initial block object segmentation resultY _ipo And enhancing land object segmentation resultsY _rp And inputting the data into a multitask joint loss function, wherein the formula is as follows:

wherein,

respectively representing a real parcel region label, a parcel boundary label and a parcel object label,L _region , L _ip , L _rp are both binary cross-entropy loss functions,L _bound is a combination of a binary cross entropy loss function and a Dice loss function, wherein,

respectively binary cross entropy loss function and Dice loss function, for relieving foreground and background imbalance of land parcel boundary, λ ₁ 、λ ₂ 、λ ₃ 、λ _bce 、λ _dice Are smoothing parameters used to balance the multitasking penalty.

6. The high-standard farmland parcel vectorization extraction method based on multitask learning according to claim 1, characterized by comprising the following steps: the implementation of said step 5 is as follows,

and storing the optimal precision result of the trained network model on a verification set, predicting the test set, and outputting a final block object prediction result and a block boundary prediction result.

7. The high-standard farmland parcel vectorization extraction method based on multitask learning according to claim 1, characterized by comprising the following steps: the implementation of said step 6 is as follows,

wherein

Respectively, a block boundary prediction result and a structural element, wherein the structural elementSIs a predefined 3 × 3 unit matrix, and is expanded by morphology

Finally generating a land boundary enhancement resultY _Db ；

wherein the operation is performed by boundary-object interactionP(-) generating the final parcel interaction resultY _Mp ；