CN109949302A

CN109949302A - Retinal feature Structural Techniques based on pixel

Info

Publication number: CN109949302A
Application number: CN201910235344.3A
Authority: CN
Inventors: 耿磊; 车恒毅; 肖志涛; 吴骏; 张芳; 刘彦北; 王雯
Original assignee: Tianjin Polytechnic University
Current assignee: Tianjin Polytechnic University
Priority date: 2019-03-27
Filing date: 2019-03-27
Publication date: 2019-06-28

Abstract

The present invention relates to a kind of retinal feature Structural Techniques based on pixel, comprising: 1) acquire colored eye ground image；2) feature structure in acquisition image is labeled, marks blood vessel, exudate, optic cup optic disk, blutpunkte and macula lutea respectively；3) five classes are divided into according to health, nonproliferative diabetic retinopathy degree and Proliferative diabetic retinopathy to the image of acquisition, construct data set；4) FOV extraction is carried out to retina color image, determines and extracts threshold value；5) multi-scale division network is designed, four kinds of retinal images size training patterns are inputted；6) the network ROC curve, the sensitivity and specificity between comparing cell are established.The result shows that the accuracy rate of retinal feature segmentation can be improved in this method, and robustness is good.

Description

Retinal feature Structural Techniques based on pixel

Technical field

The present invention relates to a kind of retinal feature Structural Techniques based on pixel, to colored eye ground figure When as segmentation of structures, accuracy rate has a distinct increment than the prior art.

Background technique

Retinopathy is caused by daily a variety of common diseases as a result, such as diabetes, artery sclerosis, leukaemia Deng.These diseases will lead to retina length and width, angle even results in the hyperplasia of retinal vessel.Clinically regarded frequently by eyeground Nethike embrane image carries out screening, analysis and diagnosis to disease.It may determine that whether patient has artery sclerosis by retina arteriovenous, By blutpunkte and exudate number may determine that whether patient has diabetes, patient may determine that by the detection of optic cup optic disk Whether glaucoma is had.Therefore, in order to carry out quantitative analysis to disease and to the early prevention of disease, segmentation retina each section knot Structure related work plays the important and pivotal role, and has to the high efficiency diagnosis of doctor expert and the mankind to the prevention of some persistent ailments Directive significance, be the embodiment to be promoted the well-being of mankind using artificial intelligence.

There is many segmentation eyeground structure very early by whole world Experts ' Attention therewith in the segmentation problem of eye ground Traditional algorithm, can briefly be divided into unsupervised and have the two major classes dividing method of supervision.In recent years, deep learning algorithm obtains Key breakthrough forms abstract further further feature by learning shallow-layer feature, and finds the distribution of data accordingly.With biography System method is compared, and deep learning allows computer autonomous learning from observation data, is settled a dispute by the parties concerned themselves before us according to learning outcome It is solved the problems, such as with traditional algorithm.And many segmentation eyeground methods are disclosed.Such as Khalaf ties convolutional neural networks CNN Structure simplification distinguishes big blood vessel, thin vessels and the background in eye fundus image, and is carried out using various sizes of convolution kernel Adjustment, has obtained good segmentation result in digital retinal images vessel extraction library DRIVE.The it is proposeds such as Fu are based on CNN and item The retinal images blood vessel segmentation method of part random field CRF, this method are utilized using blood vessel segmentation as border detection issue handling CNN generates segmentation probability graph, obtains binary segmentation result in conjunction with CRF.After CNN, AlexNet and VGGNet are had also been proposed It is split.But still there are many drawbacks:

(1) the problems such as data set is too small, data set mark inaccuracy；

(2) for there are the retinal images of large area pathological characters, between lesion region, lesion region and normal region Between interfere with each other, seriously affect segmentation effect；

(3) small part retinal image quality is influenced more by environment when acquiring, and leads to characteristic details and background contrasts It is low, it is artificial to be directly manually labeled the technical experience for relying on operator to data set, it is larger by subjective factor, efficiency also compared with It is low.

By each network of comparison we have found that PixelNet model is that the pixel based on characteristic pattern carries out study segmentation, It is suitble to retinal fundus images segmentation.The model extracts convolution feature using VGG-16 convolution, the pixel sampled to one, from Corresponding feature is extracted on multiple convolution characteristic patterns, hypercolumn descriptor is established, is then input to this feature One MLP multilayer perceptron, recently enters classification results.In addition, the network main thought is originally sampling plan when training Slightly, accelerate training.It proposes stratified sampling pixel-based, sparse sampling, finds from training image The seldom pixel of middle sampling can be obtained by good result.The present invention introduce first how to establish the data set for improving oneself with More extensive training collection is provided.Then it on the basis of PixelNet network portion structure, is replaced originally with more perfect VGG-19 VGG-16 to increase the width of network.Meanwhile adjusting each layer of num_output parameter of network structure, that is, it is defeated to adjust each layer Characteristic pattern number out, maximum compression model size is under conditions of model accuracy rate guarantees to improve test rate.In addition, Three layers of perceptron part reasonably change to improve testing efficiency the number of plies according to the complexity of eyeground different characteristic. Finally, we, which are reasonably allocated to network model in multiple GPU, is split test to test set image, realize each in image Individual segmentation between structure.

Summary of the invention

In view of this, the invention proposes a kind of retinal feature Structural Techniques based on pixel, this method can To improve the accuracy rate of retinal feature segmentation, and robustness is good.

In order to achieve the above objectives, first aspect according to the invention provides the retinal feature knot based on pixel Structure dividing method, specific technical solution, including the following steps:

Step 1: acquiring colored eye ground image；

Step 2: the feature structure in acquisition image being labeled, blood vessel is marked respectively, exudate, optic cup optic disk, goes out Blood point and macula lutea；

Step 3: to the image of acquisition according to health, nonproliferative diabetic retinopathy degree and proliferative diabetic Retinopathy is divided into five classes, constructs data set；

Step 4: FOV extraction being carried out to retina color image, determines and extracts threshold value；

Step 5: design multi-scale division network inputs four kinds of retinal images size training patterns；

Step 6: establishing the network ROC curve, the sensitivity and specificity between comparing cell.

Compared with prior art, the beneficial effects of the present invention are:

Firstly, the present invention use oneself last 2 years building data set, using transfer learning make retina photograph no matter from It is all more comprehensive than existing dividing method on quality and quantity.Then, data set is assigned under multiple GPU by improved more Scale network is trained and tests, and substantially increases segmentation efficiency and accuracy rate.Finally, segmentation result is passed through the interface MFC It shows.This method comprehensively considers timeliness and accuracy rate, and blood vessel accuracy rate is 0.927, and optic disk optic cup accuracy rate is 0.997, exudate accuracy rate is 0.939, and macula lutea accuracy rate is 0.997, and blutpunkte accuracy rate is 0.904, and this method is suitable Wider retina eye fundus picture is closed, and robustness is good.

Detailed description of the invention

Fig. 1 overall framework schematic diagram, i.e. Figure of abstract；

The multiple dimensioned network structure of Fig. 2；

Distribution of each stage eye fundus image of Fig. 3 diabetic in data set；

The original retinal images of Fig. 4 (a-c) and the FOV (d-f) being accordingly partitioned into；

The ROC curve of the multiple dimensioned network of Fig. 5；

Fig. 6 retinal structure segmentation effect figure；

Fig. 7 network training parameter algorithm flow chart；

Fig. 8 compares the ROC curve of heterogeneous networks.

Specific embodiment

The present invention is described in further detail With reference to embodiment.

Universe network block schematic illustration of the invention is as shown in Figure 2.Here four branching networks extract feature respectively independently, Each branch parameter is not shared, therefore inputs section start in data, and four branches are respectively to initial data training, in four channels End the feature vector that the last one convolutional layer in each channel extracts is connected, keep characteristic information more abundant.By four The feature vector in a channel is spliced according to 1: 1: 1: 1 ratio, is finally output to next full articulamentum, makes to connect entirely Layer sufficiently combines the respective feature of four channel networks, has taken into account the general type generic features and target data set of pre-training model Data special characteristic.It is consistent in addition to patch dimensional parameters per network pre-training structure all the way with PixelNet network.

The specific implementation process of technical solution of the present invention is illustrated With reference to embodiment.

1. multi-scale division network structure

Here four branching networks extract feature respectively independently, and each branch parameter is not shared, therefore is inputted in data At beginning, four branches respectively to initial data training, four channels end by the last one convolutional layer in each channel The feature vector of extraction is connected, and keeps characteristic information more abundant.By the feature vector in four channels according to 1: 1: 1: 1 ratio into Row splicing, is finally output to next full articulamentum, full articulamentum is made sufficiently to combine the respective feature of four channel networks, The general type generic features of pre-training model and the data special characteristic of target data set are taken into account.Per network pre-training knot all the way Structure is consistent in addition to patch dimensional parameters with PixelNet network.

2. experiment

2.1 data set

It is 2124 × 2056 that the data set that this experiment uses, which is 1444 × 1444 and 940 pixels comprising 560 pixels, 1500 retinal fundus images of total 283 patients, age bracket male differ, differ within women 24-75 years old for 25-83 years old, wrap Multiple pathological grades are included, and all marks and approves by hospital ophthalmology expert.For more comprehensive data set, we are also adopted The blood vessel of DRIVE data set marks, and has chosen the feature mark of 20 sheet photo of STARE data set, amounts to 1520 retinal maps Picture.Wherein 1320 are used for training pattern, remaining is for verifying model.All images all first carry out adaptive thresholding before the input It is worth processing method to divide the boundary FOV, threshold value 23 from the background of retinal fundus images.According to the multiple dimensioned input of network The retinal images that 1320 are partitioned into the boundary FOV are divided into 224 × 224,448 × 448,896 × 896 patch by feature, Input along with complete image as network.

2.2 segmentation standard

In order to more objectively compare the similarities and differences of segmentation result and artificial calibration result, four kinds of Statistical indicators are introduced: (1) true positives refer to the pixel that really blood vessel is also accurately identified by model as blood vessel；(2) false negative refer to really blood vessel but by Model is identified as the pixel of non-vascular；(3) true negative refers to the picture that really non-vascular is also accurately identified by model as non-vascular Vegetarian refreshments；(4) false positive refers to that really non-vascular is but identified as the pixel of blood vessel by model.Pass through definition spirit again on this basis The superiority and inferiority of the trained model of the parameter evaluations network such as sensitivity, specificity, accuracy rate.According to the variation relation between Se and 1-Sp Receiver operating curve's ROC curve is drawn, the area AUC under ROC curve reflects the performance of dividing method, and AUC is 1 Classifier is exactly perfect classifier.

2.3 training

The each structure of retina is different, but it is seen that each structure can be divided into two according to our network structures Class.We term it concentrated structures for one kind, and such as optic cup optic disk and macula lutea, another kind of we term it cover type structures, such as blood Pipe, exudate, blutpunkte.Concentrated is structurally characterized in that such retinal structure often only occurs in a certain piece of region of image, institute Accounting example is smaller relative to general image, and structural integrity is smaller.Cover type is structurally characterized in that such retinal structure generally goes out Each place of present image, proportion is larger relative to whole figure image, and structure is caused to have certain globality.

For cover type structure, we use the multiple dimensioned input network shown described in a upper section such as Fig. 2, Shu Ru 224 × 224,448 × 448,896 × 896 and complete image, while inputting details, do not forget the entirety for paying attention to cover type structure Property, and the update that can carry out weight is derived based on back-propagation algorithm, training parameter algorithm flow block diagram such as Fig. 7 shows.

In the present invention, above three prediction is integrated, and is set as the objective function of multitask network training. Herein, by LossK₁, LossK₂, LossK₃, LossK₄It is expressed as the loss of one to four branches with Loss Bin, and comes It is lost from the binary system of final output.The objective function of whole network can be expressed as Loss=Loss K₁+Loss K₂+Loss K₃+Loss K₄+Loss Bin.In each training step, loss function can be propagated from branch one to four.It therefore, can be with base Network is updated in five kinds of different classification tasks.The design considers five predictions in generalized framework, it is more suitable for me Multistage deep neural network backpropagation.In short, loss can be with is defined as:For Each detection layers m, there are training samples.For an image, object and the distribution without object is very uneven, therefore use is adopted Sample eliminates this imbalance.It can be defined as

2.4 compared with other networks

The network is compared with tri- kinds of networks of PiexlNet, Unet and DeepLabv3+ of open source are split effect, and four A network is all based on the data set of we and processing, is compared from three accuracy rate, sensitivity and specificity indexs.Pay attention to It is the present invention accurate segmentation effect of each feature structure of eye fundus image in order to obtain herein, it will be respectively to each feature knot Structure carry out three accuracy rate, sensitivity and specificity indexs judge, be respectively blood vessel, exudate, blutpunkte, optic cup optic disk and Macula lutea.The ROC curve of final four networks output is as shown in Figure 7.Multi-scale model method has higher than single channel network Accuracy.Compared with art methods, the method achieve higher precision, it is small to solve eyeground data scale, mark essence Spend the inaccurate problem undesirable so as to cause eyeground segmentation of structures effect, at the same realize detection optical fundus blood vessel, exudate, Optic cup optic disk, blutpunkte whole feature structure.

The foregoing is only a preferred embodiment of the present invention, is not intended to limit the scope of the present invention, should Understand, the present invention is not limited to existing scheme as described herein, the purpose of these implementations description is to help in this field Technical staff practice the present invention.Any those of skill in the art are easy in the feelings for not departing from spirit and scope of the invention It is further improved under condition and perfect, therefore the present invention is only limited by the content and range of the claims in the present invention, It includes alternative in the spirit and scope of the invention being defined by the appended claims and equivalent that it, which is intended to cover all, Scheme.

Claims

1. a kind of retinal feature Structural Techniques based on pixel, including the following steps:

Step 1: acquiring colored eye ground image；

Step 2: the feature structure in acquisition image being labeled, marks blood vessel, exudate, optic cup optic disk, blutpunkte respectively And macula lutea；

Step 3: to the image of acquisition according to health, nonproliferative diabetic retinopathy degree and proliferative diabetic view Film lesion is divided into five classes, constructs data set；

2. the retinal feature Structural Techniques according to claim 1 based on pixel, which is characterized in that step 3 In, this method constructs the data set of oneself, and different from disclosed eyeground data set of having increased income, our data set is to difference Age, different sexes, the different state of an illness are marked and have been illustrated respectively, including each pathological grade, and all pass through hospital ophthalmology Expert, which marks, to be approved, wherein most eye ground photos are one group data of patient during one section, it is more square in this way Just illness analysis is done after.

3. the retinal feature Structural Techniques according to claim 1 based on pixel, which is characterized in that step 4 In, in order to which retinal images are split from background, we will be with like attribute by using k means clustering method Image pixel grouping, in retinal fundus images reduce different colours quantity, execute adaptive thresholding method with from Divide the boundary FOV in the background of retinal fundus images, since purpose is to distinguish FOV and background, threshold value must be big Threshold value is selected by changing threshold value to 30 (corresponding to black) come paired observation from 0 in the intensity value of background pixel.

4. the retinal feature Structural Techniques according to claim 1 based on pixel, which is characterized in that step 5 In, multi-scale division network pixel-based is designed, four branching networks of network extract feature respectively independently, each branch parameter It does not share, therefore inputs section start in data, four branches, will be each in the end in four channels respectively to initial data training The feature vector that the last one convolutional layer in a channel extracts is connected, and keeps characteristic information more abundant, by the feature in four channels Vector is connected entirely according to 1: 1: 1: 1 ratio, is finally output to next full articulamentum, keeps full articulamentum sufficiently comprehensive The respective feature of four channel networks, the data of the general type generic features and target data set of having taken into account pre-training model are specific The combination of feature, four channels enhances the generalization ability of network.

5. the retinal feature Structural Techniques according to claim 1 based on pixel, which is characterized in that step 5 In, the first paths to the 4th paths, corresponding sequence is that from top to bottom, the size of input picture is 224 respectively in Fig. 2 The patch and full-size image of × 224,448 × 448,896 × 896 totally three kinds of sizes, first three paths are to allow e-learning The local feature of segmentation object, last paths is to allow the global feature of e-learning segmentation object, pre- per network all the way Training structure is consistent in addition to patch dimensional parameters with PixelNet network.