CN112258456B

CN112258456B - Three-dimensional image segmentation method based on convolutional neural network supervision

Info

Publication number: CN112258456B
Application number: CN202011040012.9A
Authority: CN
Inventors: 庄树昕; 庄哲民; 陈贵清; 袁野
Original assignee: Shantou University
Current assignee: Shantou University
Priority date: 2020-09-28
Filing date: 2020-09-28
Publication date: 2023-10-17
Anticipated expiration: 2040-09-28
Also published as: CN112258456A

Abstract

The application discloses a three-dimensional image segmentation method based on convolutional neural network supervision, which combines 3D U-Net based on convolutional neural network supervision with a bidirectional convolutional recurrent neural network to construct a totally new ABVUS three-dimensional image segmentation network model architecture, and segments a three-dimensional tumor, a gland layer, a fat layer and a subcutaneous tissue layer from a three-dimensional mammary gland ultrasonic image; the encoder for the convolutional neural network may have the problem of gradient disappearance and performance degradation in the training process. Therefore, by adding the depth boundary supervision into the encoder part of the convolutional neural network to serve as the convolutional neural network and by means of boundary prompt of the depth boundary supervision convolutional neural network, the shallow layer network is trained more fully, gradient disappearance is avoided, and meanwhile learning capacity of the convolutional neural network can be improved, so that noise of an ultrasonic image is resisted better.

Description

Three-dimensional image segmentation method based on convolutional neural network supervision

Technical Field

The application relates to the field of ultrasonic imaging methods, in particular to a three-dimensional image segmentation method based on convolutional neural network supervision.

Background

The application discloses a full-automatic breast volume ultrasonic imaging system (Automated Breast Volume Ultrasound System, ABVUS) as an emerging ultrasonic technology, and only three companies of Siemens, general American and Shanshan Korea ultrasonic instruments research all of the company (SIUI) in Germany are currently used for providing own breast volume ultrasonic imaging products, and the application provides an artificial intelligent segmentation scheme for the breast volume ultrasonic imaging products attached to the company (SIUI) of Shanshan Korea ultrasonic instruments research all of China.

The method has important significance in artificial intelligence aided diagnosis due to high-quality breast ultrasound medical image segmentation. The three-dimensional breast ultrasonic image is segmented into a three-dimensional tumor form, so that doctors can better judge focuses; accurate segmentation of the tumor-imaged region also aids the clinician in assessing the surgical region; meanwhile, the breast density is considered as one of biomarkers of breast cancer risk, and the results of the three-dimensional breast image segmentation and layering are divided into glandular layers, fat layers, subcutaneous tissue layers and the like, so that the results can be used for breast density calculation and breast assessment, and the segmentation work has important clinical significance.

Disclosure of Invention

The application aims to provide a three-dimensional image segmentation method based on convolutional neural network supervision, which aims to solve one or more technical problems in the prior art and at least provides a beneficial selection or creation condition.

The application provides a method for constructing a totally new ABVUS three-dimensional image segmentation network model architecture by combining a 3D U-Net (3D Auxiliary Supervision U-Net, convolutional neural network) based on convolutional neural network supervision with a bidirectional convolutional neural network, and segmenting a three-dimensional tumor, a gland layer, a fat layer and a subcutaneous tissue layer from a three-dimensional mammary gland ultrasonic image.

In order to achieve the above object, the present application provides a three-dimensional image segmentation method based on convolutional neural network supervision, the method comprising the steps of:

preliminary segmentation is carried out on the breast three-dimensional ultrasonic image through a convolutional neural network to obtain a first segmentation atlas;

dividing the first divided atlas by the cyclic neural network to obtain a second divided atlas;

the first segmentation atlas and the second segmentation atlas are three-dimensional feature images of the breast three-dimensional ultrasound image.

Further, the breast three-dimensional ultrasound image is obtained by scanning the entire breast in a fixed trajectory by a full-automatic breast volume ultrasound imaging system (Automated Breast Volume Ultrasound System, ABVUS) of all companies (SIUI) under the study of the ultrasonic instruments in siemens, usa and china, germany; or from an open medical database: LIDC-NLST (The Lung Image Database Consortium, lung CT dataset), ADNI (Alzheimer's Disease Neuroimaging Initiative, brain Alzheimer's disease MR three-dimensional image dataset), braTS (Brain Tumor Segmentation, brain MR three-dimensional image dataset), and a CT, MR data source provided by the affiliated Hospital of the university of Shandong university of medicine.

Further, the convolutional neural network is 3D U-Net, and is composed of an encoder and a decoder, wherein the encoder is used for analyzing and learning characteristics from input data, and the decoder is used for generating a separation result according to the output characteristics of the encoder; wherein all the operation procedures are transformed into 3D operations such as 3D convolution, 3D max-pooling layer, 3D up-convolution layer, etc. The breast tumor segmentation and layering difficulty is increased by considering the characteristics of high noise, blurred edges, uneven gray values, complex tumor shape and tissue hierarchy and the like of the breast ultrasonic image.

Further, the convolutional neural network is formed by coupling a single convolutional layer and a deconvolution layer, and the convolutional neural network is connected to the tail end of an encoder of the convolutional neural network in an opposite mode to realize depth boundary supervision; auxiliary because edge pixels and non-edge pixels of the breast ultrasound image are not distributed evenlyThe auxiliary boundary network adopts a class balance cross entropy function and is used for calculating a loss function corresponding to a convolution layer of the convolution neural networkOptimizing the loss function as:

wherein ,W,w^(m) For the convolution layer of the convolution neural network, W represents the weight parameter of the convolution neural network neuron, and W ^(m) An mth layer representing a convolution layer; wherein,a loss function of a convolution layer which is an mth connected convolution neural network, wherein E is an edge mapping set of a layering boundary; /> and />|E _-| and |E₊ The I represents the real edge label sets of the edge and the non-edge surface respectively; />Is the activation value calculated using the sigmoid function σ (·) at pixel i; obtaining edge mapping prediction ++after each convolution neural network output>I.e. < ->

wherein ,is output by the convolutional neural network each time;

finally, the function of the convolutional neural network can be realized by minimizing the loss function corresponding to each layer, and the total loss function of the convolutional neural network is as follows:wherein p is a network parameter of the convolutional neural network, < ->Is the weight matrix parameter of the convolutional neural network, p is more than or equal to 1 and less than or equal to Q, and Q is the total number of convolutional layers in the encoder, and is +.>Label prediction loss of the mth layer of the convolution layer is represented, i i·i ₂ The L2 norm of the expression; by minimizing the total loss function L (E, X; W _e ) And boundary regularization of the convolutional neural network, the shallow network of the encoder portion of the convolutional neural network can learn more useful features.

Further, the method for segmenting the first segmentation atlas to obtain the second segmentation atlas through the cyclic neural network comprises the following steps: the accurate segmentation of the second stage is performed, namely, a 3D convolution long-short-term memory network (3 DConvolume-LSTM, 3D C-LSTM) is constructed based on a full-connection long-short-term memory (FC-LSTM) bidirectional coding mechanism, so that abundant context information on the breast ultrasound 3D data is extracted for prediction enhancement, boundary information is further perfected, and the final segmentation edge and texture are clear and continuous.

The FC-LSTM combines the long-short-term memory neural network LSTM and the fully-connected neural network FC, and the probability volume of the primary segmentation result and the original ultrasonic volume data can be fused, so that the problem of boundary incompleteness can be effectively solved, and the method is used for prediction enhancement. Therefore, the FC-LSTM can further refine the segmentation result based on acquiring more detailed boundary information. Although the FC-LSTM layer is powerful in terms of processing time dependencies, it uses a full connection in input to state and state-to-state transitions, without encoding three-dimensional spatial information. Therefore, we extend LSTM through 3D convolution to obtain convolution long-short-term memory network 3D C-LSTM, which contains 3D convolution structure in both input to state and state-to-state transition.

The construction method of the 3D C-LSTM comprises the following steps:

all inputs, neurons and various gate operations of the full-connection long-short-time memory FC-LSTM framework are expanded into 3D tensors, 3D data are directly adopted as inputs, vector multiplication is replaced by convolution operation, and the framework is defined as follows: i.e _t ＝σ(W _xi *x _t +H _t-1 *W _hi +W _ci oc _t-1 +b _i )，f _t ＝σ(x _t *W _xf +H _t-1 *W _hf +W _cf oc _t-1 +b _f )，c _t ＝c _t-1 of _t +i _t otanh(x _t *W _xc +H _t-1 *W _hc +b _c )，o _t ＝σ(x _t *W _xo +H _t-1 *W _ho +W _co oc _t +b _o )，H _t ＝o _t otanh(c _t ) The method comprises the steps of carrying out a first treatment on the surface of the Here, σ (·) and tanh (·) are logical Sigmoid and hyperbolic tangent functions; i.e _t 、f _t 、o _t 、c ₁ ，…，c _t The parameters are 3D tensors, and the 3D data input of the network at the t moment is x _t ,H _t To conceal the layer state, H _t-1 In the state of the hidden layer at the previous moment, W _xi 、W _hi 、W _ci 、W _xf 、W _cf 、W _xc 、W _hc 、W _xo 、W _ho 、W _co All are weight matrixes in the network; b _i 、b _f 、b _c 、b _o Are both bias vectors.

Although the current time output can be inferred by 3D C-LSTM using the current input and the history information accumulated in the hidden state, to better recover breast ultrasound image boundary information, 3D C-LSTM is extended to a Bi-Directional convolutional neural network (BDC-LSTM) that works in two opposite directions by stacking two layers of 3D C-LSTM. Since the context information carried in the two layers is more detailed, the two layers of context information are connected as output, so that the structure can extract finer nonlinear characteristics in the three-dimensional breast ultrasound image.

Further, due to the problems of over-tuning of BDC-LSTM in tuning early time steps and parameters, a suitable depth supervision strategy is advantageous for shortening the gradient back propagation path while preserving the potential context information in the sequence. Therefore, a deep auxiliary supervision mechanism is added in the network structure to improve the training efficiency and generalization capability of BDC-LSTM; each time an auxiliary loss function is added through p BDC-LSTM sequences, each additional auxiliary loss function is l in turn ₀ ，l ₁ ，...，l _k K is the number of auxiliary loss functions that need to be appended, and the final loss function is: wherein ,L_N Is the main loss function of BDC-LSTM, L _n Is auxiliary loss function, X, Y are three-dimensional input and output respectively, N is the number of BDC-LSTM sequences used, N is the number of auxiliary loss functions used, beta _n And the final loss association rate. Through the auxiliary loss function, the BDC-LSTM can acquire faster convergence speed and lower training error, and meanwhile, the generalization capability of the BDC-LSTM is improved.

The beneficial effects of the application are as follows: the application provides a three-dimensional image segmentation method based on convolutional neural network supervision, which aims at solving the problems that gradient disappearance and performance degradation can occur in the training process of an encoder of a convolutional neural network. Therefore, by adding the depth boundary supervision into the encoder part of the convolutional neural network to serve as the convolutional neural network and by means of boundary prompt of the depth boundary supervision convolutional neural network, the shallow layer network is trained more fully, gradient disappearance is avoided, and meanwhile learning capacity of the convolutional neural network can be improved, so that noise of an ultrasonic image is resisted better.

Drawings

The above and other features of the present application will become more apparent from the detailed description of the embodiments thereof given in conjunction with the accompanying drawings, in which like reference characters designate like or similar elements, and it is apparent that the drawings in the following description are merely some examples of the present application, and other drawings may be obtained from these drawings without inventive effort to those of ordinary skill in the art, in which:

FIG. 1 is a flow chart of image segmentation by an image segmentation model;

FIG. 2 is a flow chart for constructing an image segmentation model.

Detailed Description

The conception, specific structure, and technical effects produced by the present application will be clearly and completely described below with reference to the embodiments and the drawings to fully understand the objects, aspects, and effects of the present application. It should be noted that, without conflict, the embodiments of the present application and features of the embodiments may be combined with each other.

The application provides a three-dimensional image segmentation method based on convolutional neural network supervision, which can accurately obtain wind load of each part in a rigid body section model force measurement test by utilizing a plurality of force sensors and can ensure that the wind load of each part of a rigid body member is within a safe range, and the method specifically comprises the following steps:

as shown in fig. 1, fig. 1 is a flowchart for segmenting an image by an image segmentation model; the convolution neural network and the circulation neural network are image segmentation models;

As shown in fig. 2, fig. 2 is a flowchart of constructing an image segmentation model; the following is a flow of constructing a model structure of the image segmentation model convolutional neural network and the cyclic neural network;

Further, the convolutional neural network is formed by coupling a single convolutional layer and a deconvolution layer, and the convolutional neural network is connected to the tail end of an encoder of the convolutional neural network in an opposite mode to realize depth boundary supervision; because the edge pixels and the non-edge pixels of the mammary gland ultrasonic image are unevenly distributed, a similar balance cross entropy function is adopted for the auxiliary boundary network and is used for calculating a loss function corresponding to a convolution layer of the convolution neural networkOptimizing the loss function as:

wherein ,is output by the convolutional neural network each time;

The construction method of the 3D C-LSTM comprises the following steps:

Although the present application has been described in considerable detail and with particularity with respect to several described embodiments, it is not intended to be limited to any such detail or embodiment or any particular embodiment so as to effectively cover the intended scope of the application. Furthermore, the foregoing description of the application has been presented in its embodiments contemplated by the inventors for the purpose of providing a useful description, and for the purposes of providing a non-essential modification of the application that may not be presently contemplated, may represent an equivalent modification of the application.

Claims

1. A three-dimensional image segmentation method based on convolutional neural network supervision, which is characterized by comprising the following steps:

the convolutional neural network is 3D U-Net, and is composed of an encoder and a decoder, wherein the encoder is used for analyzing and learning characteristics from input data, and the decoder is used for generating a separation result according to the output characteristics of the encoder;

the convolutional neural network is formed by coupling a single convolutional layer and an deconvolution layer, and a quasi-balanced cross entropy function is adopted for an auxiliary boundary network and is used for calculating a loss function corresponding to the convolutional layer of the convolutional neural networkOptimizing the loss function as:

wherein ,W,w^(m) For the convolution layer of the convolution neural network, W represents the weight parameter of the convolution neural network neuron, and W ^(m) An mth layer representing a convolution layer; wherein,a loss function of a convolution layer which is an mth connected convolution neural network, wherein E is an edge mapping set of a layering boundary; /> and />|E _-| and |E₊ The I represents the real edge label sets of the edge and the non-edge surface respectively; />Is the activation value calculated using the sigmoid function σ (·) at pixel i; obtaining edge mapping prediction ++after each convolution neural network output>I.e. < -> wherein ,is output by the convolutional neural network each time; finally, the total loss function of the convolutional neural network is:wherein p is a network parameter of the convolutional neural network, W _e ^p Is the weight matrix parameter of the convolutional neural network, p is more than or equal to 1 and less than or equal to Q, Q is the total number of convolutional layers in the encoder, and l (W, W ^(m) ；W _e ^p ) Label prediction loss of the mth layer of the convolution layer is represented, i i·i ₂ The L2 norm of the expression; by minimizing the total loss function L (E, X; W _e ) And boundary regularization of the convolutional neural network, the shallow network of the encoder portion of the convolutional neural network learning more useful features.

2. The three-dimensional image segmentation method based on convolutional neural network supervision according to claim 1, wherein the breast three-dimensional ultrasonic image is obtained by scanning the whole breast according to a fixed track through a full-automatic breast volume ultrasonic imaging system; or from an open medical database: one of the LIDC-NLST, ADNI or BraTS CT, MR data sources are read.

3. The three-dimensional image segmentation method based on convolutional neural network supervision according to claim 1, wherein the method for segmenting the first segmentation atlas by the convolutional neural network to obtain the second segmentation atlas comprises the following steps: and (3) performing accurate segmentation in the second stage, namely constructing a 3D convolution long-short-term memory network 3DC-LSTM based on a full-connection long-short-term memory bidirectional coding mechanism:

the construction method of the 3D C-LSTM comprises the following steps:

4. A three-dimensional image segmentation method based on convolutional neural network supervision according to claim 3, wherein 3D C-LSTM is extended into a two-way convolutional neural network BDC-LSTM, by stacking two layers of 3D C-LSTM so that they operate in two opposite directions, an auxiliary loss function is added once per p BDC-LSTM sequences, each additional auxiliary loss function being in turn l ₀ ，l ₁ ，...，l _k K is the number of auxiliary loss functions that need to be appended, and the final loss function is: wherein ,L_N Is the main loss function of BDC-LSTM, L _n Is auxiliary loss function, X, Y are three-dimensional input and output respectively, N is the number of BDC-LSTM sequences used, N is the number of auxiliary loss functions used, beta _n And the final loss association rate.