CN114612662A

CN114612662A - Polyp image segmentation method based on boundary guidance

Info

Publication number: CN114612662A
Application number: CN202210228391.7A
Authority: CN
Inventors: 陈舒涵; 陆露; 徐秀奇; 俞锦豪; 陈泽宇; 汤浩楠
Original assignee: Yangzhou University
Current assignee: Yangzhou University
Priority date: 2022-03-10
Filing date: 2022-03-10
Publication date: 2022-06-10

Abstract

The invention discloses a polyp image segmentation method based on boundary guidance, which comprises the following steps: constructing a polyp image segmentation model; inputting the polyp image to be segmented into the polyp image segmentation model to obtain region prediction maps P output by each side_R4‑P_R1And a boundary prediction map P_B4‑P_B1And upsampled to polyp to be segmentedObtaining a region prediction map with the same resolution

And boundary prediction map

Computing region prediction maps using cross entropy loss functions

Boundary prediction graph

Errors between the images and the real labeled graph are reversely transmitted to update the parameters of the polyp image segmentation model so as to train the polyp image segmentation model; inputting a polyp image to be segmented into a trained polyp image segmentation model, normalizing a region prediction graph output by a residual refinement decoder A to (0,1) through a Sigmoid layer, and outputting a final segmentation result after restoring the resolution of an original image; the invention greatly improves the accuracy of polyp image segmentation; more accurate region prediction is achieved.

Description

Polyp image segmentation method based on boundary guidance

Technical Field

The invention relates to the technical field of polyp image segmentation, in particular to a polyp image segmentation method based on boundary guidance.

Background

In recent years, the incidence of colon cancer is rapidly increased and endangers human health, colonoscopy is the most main way to detect colon cancer, and early treatment is an effective way to improve patient recovery. Colorectal polyps are accurately segmented from the resulting images of colonoscopy, contributing to the clinical treatment of colon cancer. The traditional manual segmentation method is complex in procedure, low in efficiency and high in technical requirements on medical staff, so that automatic segmentation of the image is a development trend of medical image application. However, the automatic segmentation of the polyp image has high requirement on accuracy, and due to the complex tissue structure, variable shape, fuzzy boundary and high similarity to the background, the segmentation of the polyp image is greatly challenged. Most of the traditional polyp segmentation methods are based on low-level features such as texture or iterative clustering super pixels, and the like, so that the improvement of the polyp image segmentation accuracy is promoted to a certain extent, but the following two problems still exist: firstly, the boundary segmentation of the polyp is still not accurate enough, and the factors of low contrast between the polyp and the background, fuzzy and tortuous boundary and the like are not considered in the boundary positioning; secondly, the adaptability to the multi-scale features of the polyp tissues is not high, the polyp tissues are different in size and shape, and the multi-scale feature extraction capability of the model should be improved, so that the current polyp image segmentation method still has certain limitations.

Disclosure of Invention

This section is for the purpose of summarizing some aspects of embodiments of the invention and to briefly introduce some preferred embodiments. In this section, as well as in the abstract and the title of the invention of this application, simplifications or omissions may be made to avoid obscuring the purpose of the section, the abstract and the title, and such simplifications or omissions are not intended to limit the scope of the invention.

The present invention has been made in view of the above-mentioned conventional problems.

In order to solve the above technical problems, the present invention provides the following technical solutions, including: constructing a polyp image segmentation model; inputting the polyp image to be segmented into the polyp image segmentation model to obtain region prediction maps P output by each side_R4-P_R1And a boundary prediction map P_B4-P_B1And upsampling to the same resolution as the polyp image to be segmented to obtain a region prediction map

And boundary prediction map

Computing region prediction maps using cross entropy loss functions

Boundary prediction graph

Errors between the real labeling graphs and the polyp image segmentation model are reversely transmitted to update parameters of the polyp image segmentation model so as to train the polyp image segmentation model; and inputting the polyp image to be segmented into a trained polyp image segmentation model, normalizing the region prediction graph output by the residual refinement decoder A to (0,1) through a Sigmoid layer, and outputting a final segmentation result after restoring the resolution of the original image.

As a preferred embodiment of the polyp image segmentation method based on boundary guiding proposed by the present invention, wherein: the proposed construction of a polyp image segmentation model comprises: res2Net-50 is used as a backbone network, and the backbone networks are layerB to layerE and are used as side output characteristics F_t(t is 1,2,3 and 4) which are respectively sent to residual error refinement decoders A to C and an initial positioning decoder, wherein t represents the side output stage number of the backbone network, and 1 to 4 correspond to the side output stage numbers of the backbone networks layerB to E; detecting an initial coarse position of polyp tissue through an initial positioning decoder, wherein the resolution of an output result image of the initial positioning decoder is 1/32 of an original image; and repairing the lost target part in the initial coarse position of the polyp tissue through residual refinement decoders A-C, and refining the detail of the boundary.

As a preferred embodiment of the polyp image segmentation method based on boundary guiding proposed by the present invention, wherein: the operation of the initial positioning decoder comprises the following steps: input features F by initially positioning the decoder₄Feeding into a 1 × 1 convolution layer, reducing the number of channels to 64 to obtain the input feature x⁴The parameter of the convolutional layer is 2048, 64,1 × 1, wherein in _ channel of { in _ channel, out _ channel, k × k } represents the number of channels of the input feature, out _ channel represents the number of channels of the output feature, k represents the size of the convolutional kernel, and a batch normalization layer Batchnorm and a nonlinear active layer ReLU are attached after the convolutional layer; inputting a feature x⁴Inputting into a layer division convolution HSC to obtain the boundary characteristics of the polyp tissue

Then reducing the channel number to 1 through a convolution layer, the convolution parameter is {64,1,3 x 3}, a batch normalization layer BatchNorm and a nonlinear active layer ReLU are attached to the convolution layer, and an initial boundary prediction graph of the initial positioning decoder is obtained

Wherein HSC (. cndot.) represents a hierarchical fission convolution operation, Conv_3×3(. cndot.) represents a 3X 3 convolutional layer; characterizing the boundaries of polyp tissue

And input feature x⁴Splicing along the channel dimension, carrying out boundary-region guided learning, inputting learning region characteristics in 4 convolutional layers, wherein convolutional parameters are {128,64,3 × 3}, {64,64,3 × 3}, and {64,1,3 × 3}, a batch normalization layer BatchNorm and a nonlinear active layer ReLU are attached to the convolutional layers in sequence, and finally outputting a region prediction map of the initial positioning decoder

Where Cat (-) denotes the stitching operation along the channel dimension.

As a preferred embodiment of the polyp image segmentation method based on boundary guiding proposed by the present invention, wherein: the working process of the residual refinement decoders A-C comprises the following steps: input feature F₁～F₃Inputting a 1 × 1 convolution layer to reduce the number of channels to 64 to obtain the input characteristic x^t,(t＝[1,2,3]) The convolution parameter is { in _ channel, 64,1 × 1},after the convolution layer, a batch normalization layer BatchNorm and a nonlinear active layer ReLU are added, and then the batch normalization layer BatchNorm and the nonlinear active layer ReLU are respectively sent to the boundary branch and the regional branch for residual correction.

As a preferred embodiment of the polyp image segmentation method based on boundary guiding proposed by the present invention, wherein: the step of sending the boundary branch to carry out residual error correction comprises the following steps: in boundary branching, deep region prediction

Upsampling to and input feature x^tSame resolution, again with input features x^tPhase splicing to obtain boundary features

Characterizing the boundary

Inputting a convolution layer to reduce the channel number to 1, the convolution parameter is {64,1,3 x 3}, the convolution layer is attached with a batch normalization layer BatchNorm and a nonlinear active layer ReLU to obtain the boundary prediction graph output by the decoder

Where Up (-) denotes the upsampling operation, Conv_3×3(. cndot.) represents a 3X 3 convolutional layer.

As a preferred embodiment of the polyp image segmentation method based on boundary guiding proposed by the present invention, wherein: the sending into the regional branch for residual correction comprises the following steps: in zone branching, zone characteristics are obtained by a reverse attention mechanism module

Then, the boundary characteristics are related to the obtained data

Splicing along channel dimensions to obtain region features after fusion of boundary features and region features after fusion of boundary features

By four convolutional layer pairs

Residual error learning is carried out, convolution parameters are {128,64,3 × 3}, {64,64,3 × 3}, and {64,1,3 × 3}, a batch normalization layer BatchNorm and a nonlinear active layer ReLU are attached to the first three convolutional layers, the number of channels of the last convolutional layer is reduced to 1, and the result and the upsampled deep layer region prediction layer region are obtained

Adding to obtain the area prediction graph output by the residual error refinement decoders A-C

As a preferred embodiment of the polyp image segmentation method based on boundary guiding proposed by the present invention, wherein: the method comprises the following steps: the stratified fission convolutional HSCs are:

step 1: inputting a feature x⁴Divided into 6 split groups in channel dimension, each being [ x ]₁,x₂,x₃,x₄,x₅,]x, wherein x₁-x₅Each having 11 channels, x₆There are 9 channels, specifically denoted as:

[x₁,x₂,x₃,x₄,x₅,x₆]＝Split(x⁴)

where Split () means splitting along the channel dimension;

step 2: grouping the first split into x₁Feeding expansion rate of R₁The expansion convolution of (d) yields an output characteristic d₁After the expansion convolution, a batch normalization layer BatchNorm and a nonlinear activation layer ReLU are attached, and the output characteristic d₁The specific calculation formula is as follows:

wherein, the first and the second end of the pipe are connected with each other,

denotes an expansion ratio R₁The expansion rates of the expansion convolutions used by the 1 st to 5 th split groups are R_k＝[1,2,4,6,8],(k＝1,2,3,4,5)；

And step 3: will output the characteristic d₁Divided into two groups along the channel dimension, denoted d respectively_1,1And d_1,2There are 6 channels and 5 channels, respectively, specifically expressed as:

d_1,1，d_1,2＝Split(d₁)

will d_1,2With the next split group x₂Splicing along channel dimension to obtain new split group

The concrete expression is as follows:

will be provided with

Inputting another expansion rate of R₂The expansion convolution is added with a batch normalization layer BatchNorm and a nonlinear activation layer ReLU to obtain an output characteristic d₂There are 16 channels, and the specific calculation formula is:

and 4, step 4: will output the characteristic d₂Are divided into two groups, respectively denoted as d_2,1And d_2,2There are 8 channels and 8 channels, respectively, specifically expressed as:

d_2,1，d_2,2＝Split(d₂)

will d_2,2With the next split group x₃Splicing along channel dimension to obtain new split group

The concrete expression is as follows:

will be provided with

Inputting another expansion rate of R₃Is convolved to obtain an output characteristic d₃There are 19 channels, and the specific calculation formula is:

and 5: repeating the steps 2 to 3 for a plurality of times until all the split groups are finished, finally splicing all the split groups to recover the channel number 64 to obtain the boundary characteristics of the polyp tissue

Wherein x is₆The last split group is indicated.

As a preferred embodiment of the polyp image segmentation method based on boundary guiding proposed by the present invention, wherein: the reverse attention mechanism module comprises: predicting deep regions

Upsampling to and input feature x^t(t＝[1,2,3]) The same resolution is input, then a Sigmoid layer is input to be normalized to (0,1), and then the normalization is subtracted from 1 to generate an inverse weight map

Inverse weight map

Multiplying by regional characteristics

Obtaining weighted regional characteristics for each channel of the plurality of channels

Wherein the content of the first and second substances,

representing regional characteristics

The c-th channel of (1).

The invention has the beneficial effects that: according to the invention, the initial positioning decoder and the residual refinement decoder are designed, so that the accuracy of polyp image segmentation is greatly improved; meanwhile, the boundary features are used for guiding the regional branches to carry out residual learning, and the regional branches combine the position information of the boundary features with the semantic information extracted by the regional branches, so that the challenges brought by irregular deformation of polyp tissues and similarity with the background are solved, and more accurate regional prediction is realized; in addition, a hierarchical fission convolution HSC is designed and embedded into a boundary branch, output characteristics of the HSC occupy different channels, polyp boundary characteristics of different scales can be captured corresponding to different receptive fields, and adaptability of the model to multi-scale polyp tissues is improved.

Drawings

Fig. 1 is a schematic diagram of the overall network architecture of a polyp image segmentation model of a polyp image segmentation method based on boundary guidance according to a first embodiment of the present invention;

fig. 2 is a block diagram of an initial positioning decoder and residual refinement decoders a-C of a polyp image segmentation method based on boundary guiding according to a first embodiment of the present invention;

FIG. 3 is a block diagram of a hierarchical split convolution HSC of the boundary-guided polyp image segmentation based method proposed in the first embodiment of the present invention;

fig. 4 is a diagram showing the results of the output of each decoder of the polyp image segmentation model in the method for segmenting a polyp image based on boundary guidance according to the second embodiment of the present invention.

Detailed Description

In order to make the aforementioned objects, features and advantages of the present invention comprehensible, specific embodiments accompanied with figures are described in detail below, and it is apparent that the described embodiments are a part of the embodiments of the present invention, not all of the embodiments. All other embodiments, which can be obtained by a person skilled in the art without making creative efforts based on the embodiments of the present invention, shall fall within the protection scope of the present invention.

In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, but the present invention may be practiced in other ways than those specifically described and will be readily apparent to those of ordinary skill in the art without departing from the spirit of the present invention, and therefore the present invention is not limited to the specific embodiments disclosed below.

Furthermore, reference herein to "one embodiment" or "an embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one implementation of the invention. The appearances of the phrase "in one embodiment" in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments.

While the present invention has been described in detail with reference to the drawings, in describing the embodiments of the present invention in detail, the cross-sectional views illustrating the structure of the device are not enlarged partially according to the general scale for convenience of illustration, and the drawings are provided as examples only, which should not limit the scope of the present invention. In addition, the three-dimensional dimensions of length, width and depth should be included in the actual fabrication.

Also in the description of the present invention, it should be noted that the terms "upper, lower, inner and outer" and the like indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings, which are only for convenience of description and simplification of description, but do not indicate or imply that the device or element referred to must have a specific orientation, be constructed and operated in a specific orientation, and thus, should not be construed as limiting the present invention. Furthermore, the terms first, second, or third are used for descriptive purposes only and are not to be construed as indicating or implying relative importance.

The terms "mounted, connected and connected" in the present invention are to be understood broadly, unless otherwise explicitly specified or limited, for example: can be fixedly connected, detachably connected or integrally connected; they may be mechanically, electrically, or directly connected, or indirectly connected through intervening media, or may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood in specific cases to those skilled in the art.

Example 1

Referring to fig. 1 to 3, a first embodiment of the present invention provides a method for segmenting a polyp image based on boundary guiding, including:

s1: and constructing a polyp image segmentation model.

Referring to fig. 1, the polyp image segmentation model includes backbone networks layerA to layerE, residual refinement decoders a to C, and an initial positioning decoder.

(1) Res2Net-50 is used as a backbone network, and the backbone networks are layerB to layerE and are used as side output characteristics F_tAnd (t is 1,2,3 and 4) respectively sent to the residual error refinement decoders A to C and the initial positioning decoder (as shown in fig. 2), wherein t represents the side output stage number of the backbone network, and t is 1 to 4 corresponding to the side output stage numbers of the backbone networks layerB to E.

(2) Detecting an initial coarse position of polyp tissue through an initial positioning decoder, wherein the resolution of an output result image of the initial positioning decoder is 1/32 of an original image;

the working process of the initial positioning decoder is specifically as follows:

firstly, input characteristic F is processed by initial positioning decoder₄Feeding into a 1 × 1 convolution layer, reducing the number of channels to 64 to obtain the input feature x⁴The parameter of the convolutional layer is 2048, 64,1 × 1, wherein in _ channel of { in _ channel, out _ channel, k × k } represents the number of channels of the input feature, out _ channel represents the number of channels of the output feature, k represents the size of the convolutional kernel, and a batch normalization layer Batchnorm and a nonlinear active layer ReLU are attached after the convolutional layer;

will input the feature x⁴Input into a hierarchical fission convolution HSC (as shown in FIG. 3) to obtain the boundary characteristics of polyp tissue

Specifically, the working process of the layer splitting convolution HSC is as follows:

[x₁,x₂,x₃,x₄,x₅,x₆]＝Split(x⁴)

where Split () means splitting along the channel dimension;

step 2: grouping the first split into x₁Feeding an expansion rate of R₁The expansion convolution of (d) yields an output characteristic d₁After the expansion convolution, a batch normalization layer BatchNorm and a nonlinear activation layer ReLU are attached, and the output characteristic d₁The specific calculation formula is as follows:

wherein the content of the first and second substances,

And step 3: will output the characteristic d₁Divided into two groups along the channel dimension, denoted d_1,1And d_1,2There are 6 channels and 5 channels, respectively, specifically expressed as:

d_1,1，d_1,2＝Split(d₁)

Detailed description of the inventionComprises the following steps:

will be provided with

Inputting another expansion ratio of R₂The expansion convolution is added with a batch normalization layer BatchNorm and a nonlinear activation layer ReLU to obtain an output characteristic d₂There are 16 channels, and the specific calculation formula is:

d_2,1，d_2,2＝Split(d₂)

The concrete expression is as follows:

will be provided with

Wherein x is₆The last split group is indicated.

Thirdly, reducing the number of channels to 1 through a convolution layer, wherein the convolution parameter is {64,1,3 multiplied by 3}, and a batch normalization layer BatchNorm and a nonlinear active layer ReLU are attached to the convolution layer to obtain an initial boundary prediction graph of the initial positioning decoder

Wherein HSC (. cndot.) represents a hierarchical fission convolution operation, Conv_3×3(. cndot.) represents a 3X 3 convolutional layer;

boundary characteristics of the meat texture

Where Cat (-) denotes the stitching operation along the channel dimension.

(3) And repairing the lost target part in the initial coarse position of the polyp tissue through residual refinement decoders A-C, and refining the boundary details.

The working process of the residual refinement decoders A-C is specifically as follows: input feature F₁～F₃Inputting a 1 × 1 convolution layer to reduce the number of channels to 64 to obtain the input characteristic x^t,(t＝[1,2,3]) The convolution parameter is { in _ channel, 64,1 × 1}, a batch normalization layer BatchNorm and a nonlinear activation layer ReLU are attached to the convolution layer, and then the batch normalization layer BatchNorm and the nonlinear activation layer ReLU are sent to the boundary branch and the region branch respectively for residual correction.

Wherein, (1) the specific steps of sending into the boundary branch for residual correction are as follows:

first, in the boundary branch, the deep region is predicted

Upsampling to and input feature x^tSame resolution, again with input features x^tPhase splicing to obtain boundary characteristics

② characterizing the boundary

Wherein Up (-) denotes an upsampling operation, Conv_3×3(. cndot.) represents a 3X 3 convolutional layer.

(2) The specific steps of sending the area branch to carry out residual error correction are as follows:

in the region branch, the region feature is obtained by the reverse attention mechanism module

Then, it is combined with the boundary feature

Splicing along channel dimension to obtain region features after boundary features are fused

Wherein, the working process of the reverse attention mechanism module is as follows:

predicting deep regions

Upsampling to and input feature x^t(t＝[1,2,3]) The same resolution is input, then a Sigmoid layer is input to be normalized to (0,1), and then the result is subtracted from 1 to generate an inverse weight map

Inverse weight map

Multiplying by regional characteristics

Wherein the content of the first and second substances,

representing regional characteristics

The c-th channel of (1).

② passing through four convolution layer pairs

S2: will be divided intoThe polyp image is input into a polyp image segmentation model, and a region prediction map P output from each side is obtained_R4-P_R1And a boundary prediction map P_B4-P_B1And upsampling to the same resolution as the polyp image to be segmented to obtain a region prediction map

And boundary prediction map

S3: computing region prediction maps using cross entropy loss functions

Boundary prediction graph

And (4) errors with the real label graph are transmitted reversely to update the parameters of the polyp image segmentation model so as to train the polyp image segmentation model.

S4: and inputting the polyp image to be segmented into a trained polyp image segmentation model, normalizing the region prediction graph output by the residual refinement decoder A to (0,1) through a Sigmoid layer, and outputting a final segmentation result after restoring the resolution of the original image.

Example 2

In order to verify and explain the technical effect of the boundary-guided polyp image segmentation adopted in the method, different methods selected in the embodiment and the method adopted for carrying out contrast test are compared with the test result by means of scientific demonstration to verify the real effect of the method.

The boundary segmentation of polyps is still not accurate enough in the prior art, the factors of low contrast between the polyps and the background, fuzzy and zigzag boundary and the like are not considered in the boundary positioning, in addition, the adaptability of the prior art to the multi-scale features of polyp tissues is not high, and the prior art of polyp image segmentation has certain limitation.

Compared with the existing method, the method (our) is higher in the aspects of positioning and segmenting the polyp boundary and extracting the multi-scale features of the model. In this embodiment, the existing technical solution and the method are adopted to respectively perform experiments on four public polyp segmentation data sets, which are CVC-clicidb, CVC-ColonDB, ETIS-LaribPolypDB, and Lvasir. This experiment introduces four measurement methods that are widely used in the field of target detection: weighted set similarity metric F_βPixel level precision measurement MAE, enhanced alignment measurement

Similarity measure S between prediction and truth map_α. The results of the experiments are shown in the following table.

Results of the experiment

(in the tables, the best performance indicators are indicated in grey.

The first column is the existing method model and the last row of the first column is the method model proposed by the present invention.

The first line represents the data set name, CVC-ClinicDB, CVC-ColonDB, ETIS-LaribPolypDB, Lvasir, respectively, with 62, 380, 196, 100 pictures, respectively.

The second row is the performance index, S α,

the larger F.beta.value is better, and the smaller M value is better.

The results were: the best performance in the four data sets is basically obtained by the network model provided by the invention

It should be recognized that embodiments of the present invention can be realized and implemented by computer hardware, a combination of hardware and software, or by computer instructions stored in a non-transitory computer readable memory. The methods may be implemented in a computer program using standard programming techniques, including a non-transitory computer-readable storage medium configured with the computer program, where the storage medium so configured causes a computer to operate in a specific and predefined manner, according to the methods and figures described in the detailed description. Each program may be implemented in a high level procedural or object oriented programming language to communicate with a computer system. However, the program(s) can be implemented in assembly or machine language, if desired. In any case, the language may be a compiled or interpreted language. Furthermore, the program can be run on a programmed application specific integrated circuit for this purpose.

Further, the operations of processes described herein can be performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. The processes described herein (or variations and/or combinations thereof) may be performed under the control of one or more computer systems configured with executable instructions, and may be implemented as code (e.g., executable instructions, one or more computer programs, or one or more applications) collectively executed on one or more processors, by hardware, or combinations thereof. The computer program includes a plurality of instructions executable by one or more processors.

Further, the method may be implemented in any type of computing platform operatively connected to a suitable interface, including but not limited to a personal computer, mini computer, mainframe, workstation, networked or distributed computing environment, separate or integrated computer platform, or in communication with a charged particle tool or other imaging device, and the like. Aspects of the invention may be embodied in machine-readable code stored on a non-transitory storage medium or device, whether removable or integrated into a computing platform, such as a hard disk, optically read and/or write storage medium, RAM, ROM, or the like, such that it may be read by a programmable computer, which when read by the storage medium or device, is operative to configure and operate the computer to perform the procedures described herein. Further, the machine-readable code, or portions thereof, may be transmitted over a wired or wireless network. The invention described herein includes these and other different types of non-transitory computer-readable storage media when such media include instructions or programs that implement the steps described above in conjunction with a microprocessor or other data processor. The invention also includes the computer itself when programmed according to the methods and techniques described herein. A computer program can be applied to input data to perform the functions described herein to transform the input data to generate output data that is stored to non-volatile memory. The output information may also be applied to one or more output devices, such as a display. In a preferred embodiment of the invention, the transformed data represents physical and tangible objects, including particular visual depictions of physical and tangible objects produced on a display.

As used in this application, the terms "component," "module," "system," and the like are intended to refer to a computer-related entity, either hardware, firmware, a combination of hardware and software, or software in execution. For example, a component may be, but is not limited to being: a process running on a processor, an object, an executable, a thread of execution, a program, and/or a computer. By way of example, both an application running on a computing device and the computing device can be a component. One or more components can reside within a process and/or thread of execution and a component can be localized on one computer and/or distributed between two or more computers. In addition, these components can execute from various computer readable media having various data structures thereon. The components may communicate by way of local and/or remote processes such as in accordance with a signal having one or more data packets (e.g., data from one component interacting with another component in a local system, distributed system, and/or across a network such as the internet with other systems by way of the signal).

It should be noted that the above-mentioned embodiments are only for illustrating the technical solutions of the present invention and not for limiting, and although the present invention has been described in detail with reference to the preferred embodiments, it should be understood by those skilled in the art that modifications or equivalent substitutions may be made on the technical solutions of the present invention without departing from the spirit and scope of the technical solutions of the present invention, which should be covered by the claims of the present invention.

Claims

1. A method for boundary-guided polyp image segmentation, comprising:

constructing a polyp image segmentation model;

inputting the polyp image to be segmented into the polyp image segmentation model to obtain region prediction maps P output by each side_R4-P_R1And a boundary prediction map P_B4-P_B1And upsampling to the same resolution as the polyp image to be segmented to obtain a region prediction map

And boundary prediction map

Computing region prediction maps using cross entropy loss functions

Boundary prediction graph

Errors between the real labeling graphs and the polyp image segmentation model are reversely transmitted to update parameters of the polyp image segmentation model so as to train the polyp image segmentation model;

and inputting the polyp image to be segmented into a trained polyp image segmentation model, normalizing the region prediction graph output by the residual refinement decoder A to (0,1) through a Sigmoid layer, and outputting a final segmentation result after restoring the resolution of the original image.

2. The boundary-guided polyp image segmentation method as set forth in claim 1, wherein the constructing a polyp image segmentation model comprises:

res2Net-50 is used as a backbone network, and the backbone networks are layerB to layerE and are used as side output characteristics F_t(t is 1,2,3 and 4) which are respectively sent to residual error refinement decoders A to C and an initial positioning decoder, wherein t represents the side output stage number of the backbone network, and 1 to 4 correspond to the side output stage numbers of the backbone networks layerB to E;

detecting an initial coarse position of polyp tissue through an initial positioning decoder, wherein the resolution of an output result image of the initial positioning decoder is 1/32 of an original image;

and repairing the lost target part in the initial coarse position of the polyp tissue through residual refinement decoders A-C, and refining the detail of the boundary.

3. The boundary-guided polyp image segmentation method as set forth in claim 2, wherein the operation of the initial positioning decoder comprises:

input features F by initially positioning the decoder₄Feeding into a 1 × 1 convolution layer, reducing the number of channels to 64 to obtain the input feature x⁴The parameter of the convolutional layer is 2048, 64,1 × 1, wherein in _ channel of { in _ channel, out _ channel, k × k } represents the number of channels of the input feature, out _ channel represents the number of channels of the output feature, k represents the size of the convolutional kernel, and a batch normalization layer Batchnorm and a nonlinear active layer ReLU are attached after the convolutional layer;

inputting a feature x⁴Inputting into a layer division convolution HSC to obtain the boundary characteristics of the polyp tissue

Where HSC (. cndot.) represents a hierarchical splitting convolution operation, Conv_3×3(. cndot.) represents a 3X 3 convolutional layer;

characterizing the boundaries of polyp tissue

And input feature x⁴Splicing along the channel dimension, carrying out boundary-region guided learning, inputting learning region characteristics in 4 convolutional layers, wherein convolutional parameters are {128,64,3 x 3}, {64,64,3 x 3}, and {64,1,3 x 3}, a batch normalization layer BatchNorm and a nonlinear active layer ReLU are attached to the convolutional layers in sequence, and finally outputting a region prediction map of the initial positioning decoder

Where Cat (-) indicates the stitching operation along the channel dimension.

4. The boundary-guided polyp image segmentation method as set forth in claim 2 or 3, wherein the residual refinement decoders A-C are operated by:

will input the feature F₁～F₃Inputting a 1 × 1 convolution layer to reduce the number of channels to 64 to obtain the input characteristic x^t,(t＝[1,2,3]) The convolution parameters are { in _ channel, 64,1 × 1}, and a batch normalization layer Batchnorm and a nonlinear active layer ReLU are added after the convolution layer and then sent to the boundary branch and the region branch respectively for residual correction.

5. The boundary-guided polyp image segmentation method as set forth in claim 4, wherein the entering the boundary branch for residual correction comprises:

in boundary branching, deep region prediction

Characterizing the boundary

6. The boundary-guided polyp image segmentation method as set forth in claim 5, wherein the entering region branch residual correction includes:

in zone branching, zone characteristics are obtained by a reverse attention mechanism module

Then, the boundary characteristics are related to the obtained data

By four convolutional layer pairs

7. The boundary-guided polyp image segmentation method according to claim 5 or 6, comprising: the stratified fission convolutional HSCs are:

step 1: will input the feature x⁴Divided into 6 in channel dimensionSplit group of each of [ x₁,x₂,x₃,x₄,x₅,x₆]Wherein x is₁-x₅Each having 11 channels, x₆There are 9 channels, specifically denoted as:

[x₁,x₂,x₃,x₄,x₅,x₆]＝Split(x⁴)

wherein Split () means splitting along the channel dimension;

And 3, step 3: will output the characteristic d₁Divided into two groups along the channel dimension, denoted d_1,1And d_1,2There are 6 channels and 5 channels, respectively, specifically expressed as:

d_1,1，d_1,2＝Split(d₁)

The concrete expression is as follows:

wherein

As a new split group

Will be provided with

d_2,1，d_2,2＝Split(d₂)

will d_2,2With the next split group x₃Splicing along the channel dimension to obtain a new split group

The concrete expression is as follows:

will be provided with

Wherein x is₆The last split group is indicated.

8. The boundary-guided polyp image segmentation method as set forth in claim 7, wherein the reverse attention mechanism module comprises:

predicting deep regions

Upsampling to and inputting feature x^t(t＝[1,2,3]) The same resolution is input, a Sigmoid layer is input to be normalized to (0,1), and then the normalization is subtracted from 1 to generate an inverse weight graph

Inverse weight map

Multiplying by regional characteristics

Each of which isTrack, obtaining weighted regional characteristics

Wherein the content of the first and second substances,

representing regional characteristics

The c-th channel of (1).