CN115272529B

CN115272529B - Layout-first multi-scale decoupling ocean remote sensing image coloring method and system

Info

Publication number: CN115272529B
Application number: CN202211186781.9A
Authority: CN
Inventors: 聂婕; 王京禹; 赵恩源; 魏志强; 刘安安; 宋丹; 李文辉; 孙正雅; 张文生
Original assignee: Ocean University of China
Current assignee: Ocean University of China
Priority date: 2022-09-28
Filing date: 2022-09-28
Publication date: 2022-12-27
Anticipated expiration: 2042-09-28
Also published as: CN115272529A

Abstract

The invention belongs to the technical field of image processing, and discloses a layout-first multi-scale decoupling ocean remote sensing image coloring method and system, wherein an input original gray-scale image is sampled into a plurality of gray-scale images with different scales, and the gray-scale images are input into a multi-scale decoupling characteristic extraction module to extract multi-scale decoupling characteristics; inputting the multi-scale decoupling features into a multi-scale feature fusion module with a priority layout, guiding semantic features by using enhanced layout dividing features, then guiding coloring features by using semantic features including layout constraints, and fusing the extracted multi-scale decoupling features; finally, generating a color image; the generated color image and the original color image are discriminated by a discriminator, and a discrimination result can be output. The method solves the problem of consistency of spatial layout of the ocean remote sensing image, the problem that a large amount of noise is contained in the large-scale feature extraction process after down sampling, and the problem that the large-scale constraint is weak to the small-scale constraint in the multi-scale information utilization process.

Description

Layout-first multi-scale decoupling ocean remote sensing image coloring method and system

Technical Field

The invention belongs to the technical field of image processing, and particularly relates to a layout-first multi-scale decoupling ocean remote sensing image coloring method and system.

Background

The coloring of the ocean remote sensing image is a process of generating a color remote sensing image by utilizing the gray remote sensing image, and the colorization of the gray remote sensing image can increase the interpretation and analysis capability of the remote sensing image. Traditional coloring methods use a single-scale generation network to implement the coloring process, making coloring lack spatial consistency. As the ocean remote sensing image has the characteristic of unbalanced spatial layout, the remote sensing image coloring method of the front edge takes generation of a countermeasure network as a basic framework, and coloring is realized by designing a multi-scale generator based on U-net. The method has the advantages that the macroscale in the multi-scale generator is utilized to constrain the microscale, and the consistency of the image space layout is ensured to a certain extent.

However, the conventional method has the following problems:

first, the geospatial layout consistency constraint is ignored. Although the existing coloring method adopts a multi-scale modeling mode, the space consistency of large-scale target depiction can be improved, but the consistency of the spatial layout of ocean remote sensing images with continuous regions with ultra-large areas cannot be ensured due to the lack of consistency constraint of the layout (sky, sea surface and land). For example, on a continuous sea surface area, light spots caused by strong reflection of a sea surface water body are easily classified as ships or ice surfaces by mistake, and the layout information is not fully utilized for disambiguation in the existing method, so that coloring is wrong.

Second, no scale decoupling is performed, and a large amount of noise is contained in the large scale. The large scale directly extracts coloring features from the gray scale image after down-sampling, so that the features extracted by the large scale still contain a large amount of small scale information, and the small scale information belongs to redundant noise for the large scale and is not beneficial to ensuring the space consistency of the ocean remote sensing image.

Thirdly, in the process of multi-scale information fusion, most of the existing methods consider a plurality of scale features simultaneously to realize coloring, and the influence of different scales on coloring results under different scenes cannot be considered. For example, the pixel of the ship is on a small scale, and the pixel is easy to be judged as the sea on a large scale, so that the guiding effect of the small scale is stronger; however, an ambiguous pixel such as a speckle pixel is misjudged as a ship on a small scale (high-precision remote sensing image), but is eliminated after down-sampling, so that the pixel is judged as an ocean on a large scale (low-precision remote sensing image), and the large scale is more guiding. Since large-scale scenes are more common in marine remote sensing images, attention should be paid to large-scale features. The existing method only adds corresponding elements of two scale features to realize fusion operation, and does not consider the dominant effect of large-scale features in the coloring process of the ocean remote sensing image.

Disclosure of Invention

Aiming at the defects in the prior art, the invention provides a layout-first multi-scale decoupling ocean remote sensing image coloring method and system, and the semantic features are constrained by a layout dividing module to generate so as to constrain coloring and ensure the coloring consistency; by the multi-scale decoupling method, the problem of noise introduction in the layout division module and the semantic constraint module is solved, and the initial features for layout division and semantic division are obtained, so that efficient layout division and semantic division features are generated for guiding the coloring of the image, and the consistency of a coloring space is ensured.

In order to solve the technical problems, the invention adopts the technical scheme that:

the invention firstly provides a layout-preferred multi-scale decoupling ocean remote sensing image coloring method, which is based on a generation countermeasure network architecture, generates a color image through a generator G, and discriminates a real image and a generated image through a discriminator D, and specifically comprises the following steps:

step 1, inputting an image: the input image is an original gray scale image

And downsampling to two gray-scale maps of different scales

、

Wherein

、

、

Are sequentially reduced;

step 2, designing a multi-scale decoupling feature extraction module, processing the input image in the step 1, and extracting multi-scale decoupling features; the multi-scale decoupling feature extraction module comprises a multi-scale feature decoupling module, a layout division module, a semantic constraint module and an image coloring module, wherein the multi-scale feature decoupling module comprises a multi-scale feature decoupling module I and a multi-scale feature decoupling module II which have the same structure and are used for carrying out multi-scale feature decoupling; when extracting the multi-scale decoupling features, the following are concrete steps:

step 2.1, inputting the input images with different scales in the step 1 into a multi-scale feature decoupling module for multi-scale feature decoupling, and respectively generating decoupling features aiming at scale tasks;

step 2.2, the decoupling features generated in the step 2.1 are subjected to layout division, semantic constraint and image coloring processing to generate layout division features, semantic features and coloring features, wherein the features obtained by the original gray level map are directly input into an image coloring module, the decoupling features output by the multi-scale feature decoupling module I are input into a semantic constraint module, and the decoupling features output by the multi-scale feature decoupling module II are input into a layout division module;

step 3, designing a multi-scale feature fusion module with a priority layout, guiding semantic features by using the enhanced layout dividing features, then guiding coloring features by using the semantic features including layout constraints, and fusing the multi-scale decoupling features extracted in the step 2;

step 4, generating a color image;

and 5, distinguishing the color image generated in the step 4 from the original color image through a discriminator D, and outputting a distinguishing result.

Further, in step 2.1, the multi-scale feature decoupling module performs multi-scale feature decoupling, specifically operating as follows:

first, for the original gray-scale image of step 1

And downsampling to two grayscale images of different scales

、

Respectively carrying out convolution operation to obtain initial coloring characteristic

Initial semantic features

And initial layout features

；

Secondly, completing multi-scale feature decoupling by using the feature map, specifically, in the multi-scale feature decoupling module I, features

Obtaining features after average pooling

By the features

Subtracting features

To obtain a decoupling characteristic

(ii) a Similarly, will be characterized

Upsampling to obtain features

By the features

Subtracting features

To obtain a decoupling characteristic

(ii) a Will be provided with

And

adding to obtain decoupling characteristics

(ii) a The formula is expressed as:

wherein,

characteristic of a representation pair

Performing upsampling operation;

characteristic of a representation pair

Carrying out average pooling operation;

same, characteristic

And features of

The characteristic decoupling is carried out by a multi-scale characteristic decoupling module II which has the same structure as the multi-scale characteristic decoupling module I, in particular, in the multi-scale characteristic decoupling module II, the characteristics

Obtaining features after average pooling

By the features

Subtracting features

To obtain a decoupling characteristic

(ii) a Similarly, will be characterized

Upsampling to obtain features

By using characteristics

Subtracting characteristics

To obtain a decoupling characteristic

(ii) a Will be provided with

And with

Adding to obtain decoupling characteristics

(ii) a The formula is expressed as:

wherein,

characteristic of expression pair

An upsampling operation;

characteristic of expression pair

And (4) averaging pooling operation.

Further, the layout partitioning module in step 2.2 includes two operations, one of which is to extract semantic features through a pre-trained U-net network for semantic segmentation tasks, and the other of which is to calculate and merge similar semantic regions according to the correlation to generate the layout segmentation graph.

Further, the specific operation of the layout dividing module in step 2.2 is as follows:

firstly, generating a semantic segmentation graph through a pre-trained U-Net network, and extracting the last layer of features of the pre-trained U-Net network

Taking out, is characterized in

Averaging the characteristic values contained in the corresponding position of each semantic area in the semantic segmentation graph, and calculating the centroid of each semantic area; in addition, will be characterized

Divided into q ² /r ² Blocks of size r, q being characteristic

R is the size of each block, r equals the feature

The common factor of the size is represented by the centroid of the semantic region to which each block belongs, and the centroid of the block P and the centroids of the surrounding 8 blocks are respectively subjected to subtraction to obtain an absolute value, so that 8-dimensional feature representation of the block P is obtained;

secondly, merging similar semantic regions to generate layout division characteristics: if the two adjacent blocks A and B do not belong to the same semantic meaning, calculating the cosine similarity between the two blocks A and B, wherein the calculation formula is as follows:

wherein,

a z-th dimension vector representing the block a,

representing the z-th dimension vector of the block B, calculating the similarity of the blocks A and B by calculating cosine similarity through a formula, wherein z is the number dimension on the 8-dimension vector; if cosine similarity

Greater than a threshold value

And merging semantic areas to which the two blocks belong to generate layout division characteristics, wherein the concrete merging method comprises the following steps: the mean z of the centroids of the two semantic regions is calculated, and the pixels contained in the two semantic regions are set to z to obtain a feature representing the layout division.

Further, the layout-first multi-scale feature fusion module in step 3 specifically operates as follows:

step 3.1, carrying out Sigmoid function processing on the output characteristics of the layout division module, and strengthening the layout division characteristics with weak significance to prevent the loss of the layout division characteristics;

step 3.2, multiplying the features processed in the step 3.1 by corresponding elements of the output features of the semantic constraint module, thereby generating semantic features through layout constraint;

and 3.3, carrying out Tanh function processing on the semantic features with the layout constraint obtained in the step 3.2 to realize feature mapping, and then multiplying the semantic features with the generated features of the image coloring module by the constraint to finally color.

Further, the network is optimized by a game method of the generator G and the discriminator D, which specifically comprises the following steps:

first, training generator G, where the discriminator is fixed, has the following formula:

in the formula (1), x ₁ For large-scale samples, x ₂ Is a mesoscale sample, x ₃ Is a sample with a small scale, and the sample is,

is a function of

With respect to X ₁ ~P(X ₁ )，X ₂ ~P(X ₂ )，X ₃ ~P(X ₃ ) P () represents a probability distribution,

for the generated color image sample, y is the original image,

a representation discriminator D discriminates the generated image;

a probability that the generated image is true is discriminated by the discriminator; when the generator G is trained, the generator G,

the smaller the better;

in the formula (2), the first and second groups,

is an L1 loss function for generating a color image by minimizing the sum of absolute differences of corresponding pixel values of a generated image and a real image, lambda being a hyper-parameter,

in the formula (3)

For cross-entropy loss, a semantic segmentation map for mesoscale generation, wherein,

representing the output of the mesoscale network, i.e. the probability that the class is i, i is the semantic class, m is the total number of all semantic classes, y ₂ Is the truth value of the mesoscale semantic segmentation result; thereby, an objective function (4) is obtained,

in the formula (4), the first and second groups,

are the model parameters of the discriminator D,

is a small-scale model parameter,

Is a mesoscale model parameter,

Is a model parameter of a large scale,

loss of generator G;

then, discriminant D is trained, with the generator stationary, and during discriminant D training, the following equation is applied:

in the formula (5), the first and second groups,

is a function of

With respect to the expected value of y, P (.) represents a probability distribution,

representation arbiter D on the real image y ₁ The judgment is carried out, and the judgment is carried out,

discriminating the real image y for the discriminator D ₁ A probability of being true, the larger the better;

in the formula (6), the first and second groups,

is a function of

the presentation discriminator discriminates the generated image,

the probability that the generated image is true is judged for the discriminator D, and the larger the probability, the better the probability; thereby, an objective function (7) is obtained,

in the formula (7), the first and second groups of the compound,

is the loss of discriminator D.

The invention also provides a layout-prior multi-scale decoupling marine remote sensing image coloring system, which is used for implementing the layout-prior multi-scale decoupling marine remote sensing image coloring method and comprises an input module, a generator and a discriminator, wherein the input module inputs gray-scale images with different scales into the generator, the generator is used for generating a color image and comprises a multi-scale decoupling feature extraction module, a layout-prior multi-scale feature fusion module and a final coloring module, the multi-scale decoupling feature extraction module comprises a multi-scale feature decoupling module, a layout partitioning module, a semantic constraint module and an image coloring module, the multi-scale feature decoupling module is used for carrying out multi-scale feature decoupling on the input image to respectively generate decoupling features aiming at scale tasks, and the decoupling features, the layout partitioning features, the semantic features and the coloring features are respectively processed by the layout partitioning module, the semantic constraint module and the image coloring module; the multi-scale feature fusion module with the prior layout utilizes the enhanced layout division features to guide semantic features and utilizes the semantic features containing layout constraints to guide coloring features, fuses the multi-scale decoupling features extracted by the multi-scale decoupling feature extraction module, and generates a color image through the final coloring module; and the discriminator is used for discriminating and comparing the color image generated by the generator with the original color image and outputting a discrimination result.

Compared with the prior art, the invention has the advantages that:

(1) The invention designs a layout division module, generates constraint coloring by utilizing constraint semantic features of the layout division module, and ensures coloring consistency. Dividing the semantic features into a plurality of blocks with the same size, wherein each block is represented by the centroid of the semantic region to which the block belongs, and each block is characterized by calculating the relation with the surrounding blocks (the centroid of one block and the centroid of the surrounding 8 blocks are respectively subjected to subtraction to obtain an absolute value to obtain 8-dimensional feature representation of the block). Compared with a method of only self-characterization, the method of utilizing the ambient environment characterization establishes the associated description between the features, and has higher robustness. Secondly, calculating the correlation between adjacent blocks belonging to different semantic regions, merging the related semantic regions, and generating layout division characteristics in an unsupervised mode, so that the problem that the layout division lacks labels is solved, and meanwhile, the generalization of the network is improved.

(2) The invention uses a multi-scale feature decoupling method to solve the problem of noise introduction in a layout dividing bit module and a semantic constraint module so as to mine potential features of each scale for modeling aiming at scale tasks. Removing redundant noise in the initial semantic features by using the initial coloring features to obtain decoupling semantic features; and removing redundant noise by utilizing the decoupling semantic features in the initial layout features to obtain decoupling layout features. By the multi-scale decoupling method, initial features used for layout division and semantic division are obtained, so that efficient layout division and semantic division features are generated to guide image coloring, the consistency of a coloring space is guaranteed, namely the difference between a small-scale feature and an up-sampled large-scale feature is calculated, meanwhile, the difference between the down-sampled small-scale feature and the up-sampled large-scale feature is calculated, and two different modes are used for fully decoupling a macroscopic feature to better remove noise.

(3) The invention realizes multi-scale feature fusion by a layout priority method and enhances the leading action of layout features. Firstly, sigmoid function activation is carried out on layout division characteristics, and the layout division characteristics are multiplied by semantic characteristics to carry out fusion operation, so that layout information is strengthened, generation of semantic characteristics is restrained, rationality of generated semantics is ensured, and leading effect of large-scale characteristics in a coloring process of a marine remote sensing image is enhanced; secondly, tanh function processing is carried out on the semantic features, more accurate semantic information is reserved, and the processed features are multiplied by the coloring features for final coloring. The method for fusing the emphatic layout is favorable for ensuring the space consistency of the ocean remote sensing images containing more large-scale scenes.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.

FIG. 1 is a structural diagram of a layout-first multi-scale decoupling marine remote sensing image coloring system according to an embodiment of the present invention;

FIG. 2 is a schematic diagram of a multi-scale feature decoupling module according to an embodiment of the invention.

Detailed Description

The invention is further described with reference to the following figures and specific embodiments.

Example 1

With reference to fig. 1-2, the present embodiment designs a layout-first multi-scale decoupling marine remote sensing image coloring method, which generates a color image through a generator G based on a generation countermeasure network architecture, and discriminates a real image and a generated image through a discriminator D, and specifically includes the following steps:

step 1, inputting an image: the input image is an original gray scale image

And downsampling to two grayscale images of different scales

、

Wherein

、

、

In turn decreases in size.

In use, for an input raw gray scale map

For convenience of understanding, the present embodiment only takes three different scales as an example, the grayscale images adopted in the present embodiment are the original grayscale images and the grayscale images downsampled to the original grayscale images 2/1 and 4/1, and the input sizes of the images in the three scales are 256 × 256, 128 × 128, and 64 × 64, respectively.

Step 2, designing a multi-scale decoupling feature extraction module, processing the input image in the step 1, and extracting multi-scale decoupling features; the multi-scale decoupling feature extraction module comprises a multi-scale feature decoupling module, a layout division module, a semantic constraint module and an image coloring module, wherein the image coloring module and the semantic constraint module adopt a U-net structure, the prior art can be referred to, the description is omitted here, and the layout division module is introduced in step 2.2. The multi-scale characteristic decoupling module comprises a multi-scale characteristic decoupling module I and a multi-scale characteristic decoupling module II which have the same structure, and is used for carrying out multi-scale characteristic decoupling; when extracting the multi-scale decoupling features, the following are concrete steps:

and 2.1, inputting the input images with different scales in the step 1 into a multi-scale feature decoupling module for multi-scale feature decoupling, and respectively generating decoupling features aiming at scale tasks. The step realizes characteristic decoupling by using different scale characteristics, and solves the problem of introducing a large amount of noise in a macro scale.

In step 2.1, the multi-scale feature decoupling module performs multi-scale feature decoupling, and the specific operations are as follows:

firstly, respectively performing convolution operation on the original gray-scale image, the gray-scale image with the 2 times of down-sampling and the gray-scale image with the 4 times of down-sampling in the step 1 to obtain initial coloring characteristics

Initial semantic features

And initial layout features

；

Secondly, completing multi-scale feature decoupling by using the feature map, specifically, in a multi-scale feature decoupling module I, features

Obtaining features after average pooling

By the features

Subtracting characteristics

To obtain a decoupling characteristic

(ii) a Similarly, will be characterized

Upsampling to obtain features

By using characteristics

Subtracting characteristics

To obtain a decoupling characteristic

(ii) a Will be provided with

And with

Adding to obtain decoupling characteristics

(ii) a The formula is expressed as:

wherein,

characteristic of expression pair

An upsampling operation;

characteristic of expression pair

Carrying out average pooling operation;

same, characteristic

And features of

Obtaining features after average pooling

By the features

Subtracting features

To obtainTo a decoupling feature

(ii) a Similarly, will be characterized

Upsampling to obtain features

By the features

Subtracting features

To obtain a decoupling characteristic

(ii) a Will be provided with

And

adding to obtain decoupling characteristics

(ii) a The formula is expressed as:

wherein,

characteristic of a representation pair

Performing upsampling operation;

characteristic of expression pair

And (4) averaging pooling operation.

And 2.2, performing layout division, semantic constraint and image coloring on the decoupling features generated in the step 2.1 to generate layout division features, semantic features and coloring features, wherein the features obtained by the original gray level map are directly input into the image coloring module, the decoupling features output by the multi-scale feature decoupling module I are input into the semantic constraint module, and the decoupling features output by the multi-scale feature decoupling module II are input into the layout division module. In each scale, the input and output of each module of the image coloring module, the semantic constraint module and the layout dividing module are the same as the input size of the scale, and the input and output sizes of the image coloring module, the semantic constraint module and the layout dividing module are 256 × 256, 128 × 128 and 64 × 64, respectively.

The layout partitioning module in step 2.2 includes two operations, one of which is to extract semantic features through a pre-trained U-net network for semantic segmentation tasks, and the other of which is to calculate and merge similar semantic regions according to the correlation to generate a layout segmentation graph.

The specific operation of the layout partitioning module is as follows:

Taking out, is characterized in

In the method, the feature values contained in the corresponding positions of each semantic region in the semantic segmentation graph are averaged, and the centroid of each semantic region is calculated. In addition, will be characterized by

Divided into q ² /r ² Blocks of size r, q being characteristic

R is the size of each block, r equals the feature

The common factor of the size (the side length changes with the change of the input size of the third scale) is used to ensure that the whole graph can be exactly divided, in this embodiment, 64 × 64, each block is represented by the centroid of the semantic region to which the block belongs, and the centroid of the block P and the centroids of the surrounding 8 blocks are respectively subjected to difference to obtain an absolute value, so as to obtain an 8-dimensional feature representation for the block P. For example, if the centroid of the block p is (x, y), i is the centroid, (x-1, y-1), (x-1, y + 1), (x +1, y-1), (x +1, y + 1), (x +1, y + 1) is a, b, c, d, e, f, g, h, then the block p is represented as an 8-dimensional vector (| i-a |, | i-b |, | i-c |, | i-d |, | i-e |, | i-f |, | i-g |, | i-h |).

Secondly, merging similar semantic regions to generate layout division characteristics: if two adjacent blocks A and B do not belong to the same semantic meaning, calculating the cosine similarity between the two blocks A and B, wherein the calculation formula is as follows:

wherein, among others,

a z-th dimension vector representing the block a,

representing a z-dimension vector of the block B, calculating the similarity of the blocks A and B by calculating cosine similarity through a formula, wherein z is a dimension on the 8-dimension vector; if cosine similarity

Greater than a threshold value

Disclosure of the inventionAnd the semantic regions to which the two blocks belong, the specific merging method is as follows: the mean z of the centroids of the two semantic regions is calculated, and the pixels contained in the two semantic regions are set to z to obtain a feature representing the layout division.

And 3, designing a multi-scale feature fusion module with a priority layout, guiding semantic features by using the enhanced layout dividing features, guiding coloring features by using the semantic features including layout constraints, and fusing the multi-scale decoupling features extracted in the step 2. The specific operation is as follows:

step 3.1, performing Sigmoid function processing on output features of the layout division module, and reinforcing the layout division features with weak significance to prevent loss of the layout division features, so that the leading effect of large-scale features in the coloring process of the ocean remote sensing image is enhanced;

step 3.2, multiplying the corresponding elements of the features processed in the step 3.1 and the output features of the semantic constraint module, thereby generating reasonable semantic features through a layout constraint network;

and 3.3, carrying out Tanh function processing on the semantic features with the layout constraint obtained in the step 3.2 to realize conventional mapping of the features, multiplying the conventional mapping features by the generated features of the image coloring module to restrict and finally coloring. By utilizing different activation functions, the leading effect of large-scale features in the coloring process of the ocean remote sensing image is strengthened, and therefore the coloring spatial consistency is guaranteed.

And 4, generating a color image.

The following describes the training and penalty functions of the generator G and the discriminator D according to the invention. The invention optimizes the network by a game method of a generator G and a discriminator D, which comprises the following steps:

is a function of

for the generated color image sample, y is the original image,

a representation discriminator D discriminates the generated image;

a probability that the generated image is true is discriminated by the discriminator; when the generator G is to be trained, it is,

the smaller the better; in the formula (2), the first and second groups,

is the L1 loss function of the generated color image, which is the sum of the absolute differences of the corresponding pixel values of the generated image and the real image, where λ is the hyperparameter, in equation (3)

For cross-entropy losses, for mesoscale generationA semantic segmentation graph is formed in which,

representing the output of the mesoscale network, i.e. the probability that the class is i, i is the semantic class, m is the total number of all semantic classes, y ₂ A true value of the result of the mesoscale semantic segmentation; thereby, an objective function (4) is obtained,

in the formula (4), the first and second groups of the chemical reaction are shown in the specification,

are the model parameters of the discriminator D,

is a small-scale model parameter,

Is a mesoscale model parameter,

Is a parameter of the model with a large scale,

in order to minimize the loss of the generator G, the objective function (4) is to minimize the score given to the generated image by the discriminator D and minimize the corresponding pixel difference between the generated image and the real image;

in the formula (5), the first and second groups of the chemical reaction materials are selected from the group consisting of,

is a function of

With respect to the expected value of y, P () represents a probability distribution,

representation discriminator D for real image y ₁ The judgment is carried out, and the judgment is carried out,

discriminating the real image y for the discriminator D ₁ Probability of being true, so the larger the better;

in the formula (6), the first and second groups of the compound,

is a function of

the presentation discriminator discriminates the generated image,

the probability that the generated image is true is judged by a discriminator D, and the larger the probability, the better the probability; thereby, an objective function (7) is obtained,

in the formula (7), the first and second groups,

the objective function (7) is to maximize the score given by the discriminator D to the real image and minimize the score given by the discriminator D to the generated picture, for the loss of the discriminator D.

The training process and the calculation of each loss function of the present invention can refer to the prior art, and are only briefly described here, and will not be described in detail.

As a preferred embodiment, the present embodiment uses the NWPU-reic 45 dataset. The data set consisted of 45 scene categories, each consisting of 700 images, for a total of 31,500 images of size 256 x 256. 500, 100 and 100 images were used in each category as training set, test machine and validation set, respectively. Using an Adam optimizer, the hyperparameter λ was set to 100. The network adopts an end-to-end batch training method, and each batch is set to be 8.

Example 2

The embodiment provides a layout-prioritized multi-scale decoupling marine remote sensing image coloring system, which comprises an input module, a generator and a discriminator, wherein the input module inputs gray-scale images of different scales into the generator, the generator is used for generating a color image and comprises a multi-scale decoupling feature extraction module, a multi-scale feature fusion module with prioritized layout and a final coloring module, the multi-scale decoupling feature extraction module comprises a multi-scale feature decoupling module, a layout dividing module, a semantic constraint module and an image coloring module, wherein the multi-scale feature decoupling module performs multi-scale feature decoupling on the input image, respectively generates decoupling features aiming at scale tasks, and respectively generates layout dividing features, semantic features and coloring features through the processing of the layout dividing module, the semantic constraint module and the image coloring module; the layout-first multi-scale feature fusion module utilizes the enhanced layout division features to guide semantic features and utilizes semantic features containing layout constraints to guide coloring features, fuses the multi-scale decoupling features extracted by the multi-scale decoupling feature extraction module, and generates a color image through the final coloring module; and the discriminator is used for discriminating and comparing the color image generated by the generator with the original color image and outputting a discrimination result. The system is used for implementing the layout-first multi-scale decoupling ocean remote sensing image coloring method, the functions of all modules and the coloring method steps can be recorded as in the embodiment 1, and the description is omitted here.

In summary, the invention designs a multi-scale decoupling feature extraction module and a multi-scale feature fusion module with a preferential layout, and for input gray-scale images with different scales, firstly, decoupling features aiming at scale tasks are respectively generated by the multi-scale feature decoupling module; secondly, generating a layout division characteristic, a semantic characteristic and a coloring characteristic by the decoupling characteristic through a layout division module, a semantic constraint module and an image coloring module; then, the layout-first multi-scale feature fusion module guides semantic features by using the enhanced layout division features, and then guides coloring features by using semantic features containing layout constraints; and finally, generating a color chart by using a final coloring module. The method solves the problem of consistency of spatial layout of the ocean remote sensing image, the problem that a large amount of noise is contained in the large-scale feature extraction process after down sampling, and the problem that the large-scale constraint is weak to the small-scale constraint in the multi-scale information utilization process.

It will be understood that the foregoing description is not intended to limit the invention, and that the invention is not limited to the examples described above, and that various changes, modifications, additions and substitutions which may be made by one of ordinary skill in the art without departing from the spirit of the invention are therefore intended to be included within the scope of the invention.

Claims

1. A layout-preferred multi-scale decoupling ocean remote sensing image coloring method is based on a generation countermeasure network architecture, a color image is generated through a generator G, a real image and a generated image are distinguished through a discriminator D, and the method is characterized by specifically comprising the following steps:

step 1, inputting an image: the input image is an original gray scale image

And downsampling to two grayscale images of different scales

、

In which

、

、

The sizes of (a) and (b) are sequentially reduced;

step 2, designing a multi-scale decoupling feature extraction module, processing the input image in the step 1, and extracting multi-scale decoupling features; the multi-scale decoupling feature extraction module comprises a multi-scale feature decoupling module, a layout division module, a semantic constraint module and an image coloring module, wherein the multi-scale feature decoupling module comprises a multi-scale feature decoupling module I and a multi-scale feature decoupling module II which have the same structure and are used for carrying out multi-scale feature decoupling; when extracting the multi-scale decoupling features, the following are specific:

step 2.2, the decoupling features generated in the step 2.1 are subjected to layout division, semantic constraint and image coloring processing to generate layout division features, semantic features and coloring features, wherein the features obtained by the original gray-scale image are directly input into an image coloring module, the decoupling features output by the multi-scale feature decoupling module I are input into a semantic constraint module, and the decoupling features output by the multi-scale feature decoupling module II are input into a layout division module;

step 3, designing a multi-scale feature fusion module with a preferential layout, inputting the multi-scale decoupling features extracted in the step 2 into the multi-scale feature fusion module with the preferential layout, guiding semantic features by utilizing enhanced layout division features, then guiding coloring features by utilizing the semantic features containing layout constraints, and fusing the multi-scale decoupling features extracted in the step 2;

step 4, generating a color image;

2. The layout-first multi-scale decoupling marine remote sensing image coloring method according to claim 1, characterized in that in step 2.1, a multi-scale feature decoupling module performs multi-scale feature decoupling, and the specific operations are as follows:

first, the original gray-scale map of step 1

And downsampling to two gray-scale maps of different scales

、

Respectively carrying out convolution operation to obtain initial coloring characteristics

Initial semantic features

And initial layout features

；

Obtaining features after average pooling

By using characteristics

Subtracting characteristics

Obtaining decoupling characteristics

(ii) a Similarly, will be characterized

Upsampling to obtain features

By using characteristics

Subtracting features

To obtain a decoupling characteristic

(ii) a Will be provided with

And

adding to obtain decoupling characteristics

(ii) a The formula is expressed as:

wherein,

characteristic of a representation pair

Performing upsampling operation;

characteristic of expression pair

Carrying out average pooling operation;

same, characteristic of

And features of

Performing characteristic decoupling by a multi-scale characteristic decoupling module II with the same structure as the multi-scale characteristic decoupling module I, specifically, in the multi-scale characteristic decoupling module II, performing characteristic decoupling

Obtaining features after average pooling

By the features

Subtracting features

To obtain a decoupling characteristic

(ii) a Similarly, will be characterized

Upsampling to obtain features

By the features

Subtracting features

Obtaining decoupling characteristics

(ii) a Will be provided with

And

adding to obtain decoupling characteristics

(ii) a The formula is expressed as:

wherein,

characteristic of expression pair

An upsampling operation;

characteristic of expression pair

Average poolAnd (5) carrying out chemical operation.

3. The layout-first multi-scale decoupling marine remote sensing image coloring method according to claim 1, wherein the layout division module in step 2.2 comprises two operations, one is to extract semantic features through a pre-trained U-net network for semantic segmentation tasks, and the other is to calculate and merge similar semantic regions according to correlation to generate the layout division map.

4. The layout-first multi-scale decoupling marine remote sensing image coloring method according to claim 3, wherein the layout division module in step 2.2 specifically operates as follows:

Taking out, is characterized in

Divided into q ² /r ² Blocks of size r, q being characteristic

R is the size of each block, r equals the feature

The common factor of the size is represented by the centroid of the semantic region to which each block belongs, the centroid of the block P and the centroids of the surrounding 8 blocks are respectively subjected to difference to obtain an absolute value, and an 8-dimensional feature representation of the block P is obtained；

wherein,

a z-th dimension vector representing the block a,

Greater than a threshold value

Merging the semantic regions to which the two blocks belong to generate layout division characteristics, wherein the specific merging method comprises the following steps: the mean z of the centroids of the two semantic regions is calculated, and the pixels contained in the two semantic regions are set as z to obtain a feature representing the layout division.

5. The layout-first multi-scale decoupling ocean remote sensing image coloring method according to claim 1, wherein the layout-first multi-scale feature fusion module in step 3 specifically operates as follows:

and 3.3, carrying out Tanh function processing on the semantic features with the layout constraint obtained in the step 3.2 to realize feature mapping, and then multiplying the semantic features with the generated features of the image coloring module by the constraint for final coloring.

6. The layout-first multi-scale decoupling marine remote sensing image coloring method according to claim 1, characterized in that a network is optimized by a game method of a generator G and a discriminator D, and the method specifically comprises the following steps:

in the formula (1), x ₁ Is a large scale sample, x ₂ Is a mesoscale sample, x ₃ Is a sample with a small scale, and the sample is,

is a function of

About

，

，

Is expected toThe value, P (), represents the probability distribution,

for the generated color image sample, y is the original image,

a representation discriminator D discriminates the generated image;

the smaller the better;

in the formula (2), the first and second groups of the compound,

in the formula (3)

are the model parameters of the discriminator D,

is a small-scale model parameter,

Is a mesoscale model parameter,

Is a parameter of the model with a large scale,

loss of generator G;

is a function of

With respect to the expected value of y,

formula (6) In the step (1), the first step,

is a function of

About

，

，

P () represents a probability distribution,

the presentation discriminator discriminates the generated image,

in the formula (7), the first and second groups of the compound,

is the loss of discriminator D.

7. The layout-first multi-scale decoupling ocean remote sensing image coloring system is characterized by being used for implementing the layout-first multi-scale decoupling ocean remote sensing image coloring method according to any one of claims 1 to 6, and comprising an input module, a generator and a discriminator, wherein the input module inputs gray-scale images of different scales into the generator, the generator comprises a multi-scale decoupling feature extraction module, a layout-first multi-scale feature fusion module and a final coloring module, and the final coloring module is used for generating a color image.