CN115018874A

CN115018874A - Fundus blood vessel segmentation domain generalization method based on frequency domain analysis

Info

Publication number: CN115018874A
Application number: CN202210786220.6A
Authority: CN
Inventors: 刘义鹏; 曾东旭
Original assignee: Zhejiang University of Technology ZJUT
Current assignee: Zhejiang University of Technology ZJUT
Priority date: 2022-07-04
Filing date: 2022-07-04
Publication date: 2022-09-06

Abstract

A fundus blood vessel segmentation domain generalization method based on frequency domain analysis comprises the following steps: 1) preprocessing the fundus image, including numerical processing and size adjustment; 2) performing feature extraction on the preprocessed fundus images, including the construction and training of a neural network model; 3) the fundus blood vessels and retinal background were segmented using a training model. The method adopts a feature normalization algorithm based on frequency domain analysis, enhances the feature expression capability and generalization capability of the model by learning a uniform semantic frequency distribution space, effectively solves the problem of data dependence of a supervised learning method and the problem of accuracy reduction of a segmentation algorithm when the segmentation algorithm is applied to data of different domains, and further improves the generalization capability of the fundus blood vessel segmentation algorithm.

Description

Fundus blood vessel segmentation domain generalization method based on frequency domain analysis

Technical Field

The invention belongs to the field of medical image analysis, relates to the field of domain generalization in fundus blood vessel segmentation, uses a frequency domain analysis method based on Discrete Cosine Transform (DCT), is one of deep learning technologies, and belongs to a labeled data learning method.

Background

The fundus is distributed with abundant capillaries, which can moisten the eyeball. Modern imaging technology can help doctors to directly observe the vascular structure of the retina through fundus images so as to judge the state of illness of patients. For example, capillary hemangioma, oozing blood and small bleeding spots are often seen in fundus images of diabetic patients, and arteriosclerosis phenomenon is observed in fundus blood vessels of hypertensive patients. The lesions of the ocular vessels reflect the general vascular conditions to a certain extent, and doctors can analyze and judge the types and severity of diseases through fundus examination. Fundus examination is therefore an important means of diagnosing disease in the fundus, and fundus vessel segmentation can provide an aid in quantification tools for disease analysis and diagnosis.

The fundus image has abundant details, and if the manual analysis of a doctor is relied on, the fundus image has higher requirements on the observation capability and the medical experience of the doctor. Computer-aided diagnosis techniques can help physicians improve the efficiency and accuracy of diagnosis. Conventional computer-aided diagnosis techniques extract some manually preset features from an algorithm, and the doctor makes a judgment based on the features. The rapid development of deep learning in recent years enables the visual recognition accuracy of computers to exceed that of human beings in certain specific fields. The trained deep neural network learns semantic information in the image by extracting features layer by layer, and automatically outputs the segmentation result of the fundus blood vessel at the network output end, so that the workload of doctors is remarkably reduced, and the screening and diagnosis efficiency is improved.

In an actual application scenario, samples of a deep learning training set and a testing set have differences in aspects such as illumination, contrast, sampling equipment and the like, that is, a data source domain in a training process and a data target domain in a testing process generate a domain offset problem, so that generalization performance of a model is affected, and a deep neural network often cannot obtain a satisfactory blood vessel segmentation result in a testing stage. Meanwhile, a model with good generalization capability needs a large amount of abundant labeled data for training, however, the collection and labeling of the fundus blood vessel segmentation data set are difficult, and the disclosed data set often has the problems of small sample size and insufficient diversity.

Disclosure of Invention

In order to overcome the problems of data dependence of a supervised learning method and reduction of precision of a target domain data segmentation algorithm, and further improve the generalization capability of the fundus blood vessel segmentation algorithm, the invention provides a fundus blood vessel segmentation domain generalization method based on frequency domain analysis.

The technical scheme adopted by the invention for solving the problems is as follows:

a fundus blood vessel segmentation domain generalization method based on frequency domain analysis, the method comprising the steps of:

1) preprocessing the fundus image;

2) performing characteristic extraction on the preprocessed fundus image;

3) and segmenting blood vessels and the retina background according to the features extracted by the threshold value.

Further, the step 1) comprises the following steps:

11) enhancing an original picture by using random rotation, horizontal random inversion, vertical random inversion, random brightness and saturation, and enhancing the contrast effect of the image by using Gamma nonlinear coding;

12) mapping the pixel value from 0-255 to 0-1 by using the most value normalization;

13) redundant pixels in the fundus image are cropped out, only the retinal area in the image is reserved, and then, in order to ensure the input scale invariance, the cropped fundus image is unified to 512 × 512 pixels by a bilinear interpolation method. In order to fully utilize the image information and introduce randomness, random position cutting is carried out for 64 times on each fundus image with 512 × 512 pixels, and the size of the cut image block is 64 × 64.

Still further, the step 2) comprises the following steps:

21) constructing a neural network model, and taking the preprocessed data as the input of model training;

22) training neural network, updating model parameters by algorithm

23) The predicted probabilities output by the model are used as features extracted from the input data.

Further, in the step 21), the process of network construction is as follows:

211) constructing a frequency domain-space domain normalization layer;

212) setting an encoder, wherein each layer of module consists of a residual block, a frequency domain-space domain normalization layer and a down-sampling layer;

213) a decoder is arranged, and each layer of module consists of a residual error block, a frequency domain-space domain normalization layer and an upper sampling layer;

214) the encoder and decoder are connected in a cross-layer hopping fashion.

Further, in the step 22), the training process of the network is as follows:

221) setting a loss function;

222) setting model parameters for initialization;

223) and setting an optimizer.

The process of the step 3) is as follows: and segmenting the blood vessels and the retinal background according to the extracted features, setting 0.5 as a threshold, marking feature pixels larger than 0.5 as blood vessel features, and marking feature pixel values smaller than 0.5 as fundus background features.

Compared with the prior art, the invention has the beneficial effects that: the fundus data set is preprocessed, a model fusing frequency domain information and spatial domain information is established, and features of different domains are normalized to a uniform hidden space by introducing an attention mechanism. The method can improve the robustness of the blood vessel segmentation model to data distribution disturbance, and further realize the generalization capability of fundus images from different sources.

Drawings

Fig. 1 is a flowchart of a fundus blood vessel segmentation domain generalization method based on frequency domain analysis.

Fig. 2 is a schematic diagram of preprocessing of fundus images.

Fig. 3 is a schematic diagram of a frequency domain normalization module.

FIG. 4 is a schematic illustration of spatial domain normalization.

Fig. 5 is a schematic view of a fundus blood vessel segmentation model.

Fig. 6 shows the fundus blood vessel segmentation results, where (a) is the original image, (b) is the expert label, and (c) is the segmentation result.

Detailed Description

The present invention is further described below with reference to the flow chart.

Referring to fig. 1 to 5, a fundus blood vessel segmentation domain generalization method based on frequency domain analysis includes the following steps:

1) referring to fig. 2, the fundus image is preprocessed, and the image preprocessing includes the following steps: 11) enhancing an original picture by using random rotation, horizontal random inversion, vertical random inversion, random brightness and saturation, and enhancing the contrast effect of the image by using Gamma nonlinear coding; 12) mapping the pixel value from 0-255 to 0-1 by using the most value normalization; 13) redundant pixels in the fundus image are cropped out, only the retinal area in the image is reserved, and then, in order to ensure the input scale invariance, the cropped fundus image is unified to 512 × 512 pixels by a bilinear interpolation method. In order to fully utilize image information and introduce randomness, random position cutting is carried out on each fundus image with 512 x 512 pixels for 64 times, and the size of a cut image block is 64 x 64;

2) referring to fig. 3, 4 and 5, the blood vessels and the retinal background are segmented according to the extracted features, wherein the feature extraction comprises the following steps: 21) constructing a neural network model, and taking the preprocessed data as the input of model training; 22) training a neural network, updating model parameters by an algorithm 23) the predicted probability of the model output as a feature extracted from the input data;

the process of network construction is as follows:

211) with reference to fig. 2 and 3, a frequency domain-spatial domain normalization layer is constructed, which is composed of a frequency domain normalization module and a spatial domain normalization module, wherein the frequency domain normalization module is decomposed by DCT features and summedIntroducing a learnable weight to obtain a normalized frequency domain hidden space; and the space domain normalization module dynamically generates a scaling coefficient gamma from the input image _s And a translation coefficient beta _s And obtaining normalized space domain characteristics.

212) With reference to fig. 5, an encoder is provided, the purpose of which is to obtain hidden spatial features, each layer of modules being composed of a residual block, a frequency domain-spatial domain normalization layer and a downsampling layer;

213) the decoder is arranged with reference to fig. 5, the purpose of which is to reconstruct a segmented image from the eigenvectors z, each layer of blocks being composed of a residual block, a frequency-domain-spatial normalization layer and an upsampling layer;

214) the encoder and decoder are connected in a cross-layer hopping fashion, the purpose of this step being to connect features of different levels and resolutions.

Further, in the step 22), the training process of the network is as follows:

221) setting a loss function, and calculating a binary cross entropy between the network output characteristics and the segmentation labels;

222) setting model parameters for initialization, and initializing parameters of the fundus blood vessel segmentation network randomly by adopting Kaiming;

223) and setting an optimizer, adopting an Adam optimization algorithm, setting the momentum to be 0.99, setting the default learning rate to be 0.002, carrying out network cycle training, and updating the parameters for 4000 rounds.

3) And (3) segmenting blood vessels and the retinal background according to the features extracted by the threshold, setting 0.5 as the threshold, marking the feature pixels larger than 0.5 as the blood vessel features, and marking the feature pixel values smaller than 0.5 as the fundus background features.

The above description is intended to be illustrative of the preferred embodiment of the present invention and should not be taken as limiting the invention, but rather, the intention is to cover all modifications, equivalents, and alternatives falling within the spirit and scope of the invention.

Claims

1. A fundus blood vessel segmentation domain generalization method based on frequency domain analysis is characterized by comprising the following steps:

1) preprocessing the fundus image;

2) performing characteristic extraction on the preprocessed fundus image;

3) segmenting the blood vessels and the retinal background according to the extracted features.

2. A fundus blood vessel segmentation domain generalization method based on frequency domain analysis according to claim 1, wherein said step 1) comprises the steps of:

13) redundant pixels in the fundus image are cut out, only the retinal area in the image is reserved, and then, in order to ensure the invariance of the input scale, bilinear interpolation is carried out on the cut fundus image.

3. A fundus blood vessel segmentation domain generalization method based on frequency domain analysis according to claim 1 or 2, wherein said step 2) comprises the steps of:

22) training a neural network, and updating model parameters through an algorithm;

4. A fundus blood vessel segmentation domain generalization method based on frequency domain analysis according to claim 3 wherein in said step 21), the process of network construction is as follows:

211) constructing a frequency domain-space domain normalization layer;

213) a decoder is arranged, and each layer of module consists of a residual block, a frequency domain-space domain normalization layer and a down-sampling layer;

214) the encoder and decoder are connected in a cross-layer hopping fashion.

5. A fundus blood vessel segmentation domain generalization method based on frequency domain analysis according to claim 3 wherein in said step 22), the training process of the network is as follows:

221) setting a loss function;

222) setting model parameters for initialization;

223) and setting an optimizer.

6. The fundus blood vessel segmentation domain generalization method based on frequency domain analysis according to claim 1 or 2, wherein the process of step 3) is: and segmenting blood vessels and a retinal background according to the extracted features, setting 0.5 as a threshold, marking the feature pixels more than 0.5 as blood vessel features, and marking the feature pixels less than 0.5 as fundus background features.