CN110263804B - Medical image segmentation method based on safe semi-supervised clustering - Google Patents

Medical image segmentation method based on safe semi-supervised clustering Download PDF

Info

Publication number
CN110263804B
CN110263804B CN201910371366.2A CN201910371366A CN110263804B CN 110263804 B CN110263804 B CN 110263804B CN 201910371366 A CN201910371366 A CN 201910371366A CN 110263804 B CN110263804 B CN 110263804B
Authority
CN
China
Prior art keywords
sample
clustering
labeled
unlabeled
density
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910371366.2A
Other languages
Chinese (zh)
Other versions
CN110263804A (en
Inventor
郭丽
甘海涛
夏思雨
厉振华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Dianzi University
Original Assignee
Hangzhou Dianzi University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Dianzi University filed Critical Hangzhou Dianzi University
Priority to CN201910371366.2A priority Critical patent/CN110263804B/en
Publication of CN110263804A publication Critical patent/CN110263804A/en
Application granted granted Critical
Publication of CN110263804B publication Critical patent/CN110263804B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/136Segmentation; Edge detection involving thresholding

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • Image Analysis (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

The invention discloses a medical image segmentation method based on safe semi-supervised clustering, and relates to a semi-supervised FCM clustering and density peak value clustering method. Firstly, a local graph is constructed by adopting a k-nearest neighbor method to obtain a graph regular term. Second, the FCM clustering and density clustering methods are used to estimate the confidence of the labeled and unlabeled samples. Then, confidence weighting of the samples and a regular term based on a local graph are introduced into the target function of the original semi-supervised FCM clustering method to obtain the target function of the safe semi-supervised clustering method. And finally, obtaining a clustering result by iteratively optimizing the membership matrix and the clustering center. The invention solves the safe use problem of the marked sample, simultaneously solves the safe use problem of the unmarked sample, and improves the accuracy and the robustness of the medical image segmentation.

Description

Medical image segmentation method based on safe semi-supervised clustering
Technical Field
The invention relates to a medical image segmentation method based on semi-supervised clustering, in particular to a medical image segmentation method based on safe semi-supervised clustering, and belongs to the field of data mining based on medical images.
Background
With the continuous development of visualization technology, modern medicine has become more and more unable to process information of medical images, and medical images play an important role in clinical diagnosis, teaching and scientific research, and the like. The medical image segmentation method based on semi-supervised clustering integrates limited manual supervision information, namely, a plurality of limited points are clicked on an image to identify the relation between corresponding regions, the points are used as sample data with label information in the medical image segmentation method based on semi-supervised clustering, and the sample data is used for guiding clustering, so that the algorithm performance is improved, and the image segmentation is more accurate. The marking in the medical image is generally finished by experts, but wrong marking may occur due to various conditions in the marking process, and the medical image often carries noise points and outliers, and the traditional medical image segmentation method based on semi-supervised clustering does not consider the two aspects in the clustering process.
In this case, the performance of the conventional semi-supervised clustering method may be worse than that of the corresponding unsupervised learning method, which limits the application of the semi-supervised clustering in the medical image segmentation to a certain extent. In other words, the marked data may be detrimental to performance, while noise and outliers in the unmarked data also have a large impact on performance. The traditional semi-supervised clustering generally considers that the prior knowledge is beneficial to learning effect, but the collected prior knowledge (such as error labeled samples and noise) can possibly cause the degradation of learning performance. Xuesong Yin indicates that wrong a priori knowledge can lead to a degradation of learning performance. Based on the two aspects, it makes sense to design a safe semi-supervised learning method. Therefore, the invention tries to develop a mechanism that different samples have different safety degrees, so as to realize that the clustering performance is not lower than that of the original unsupervised clustering and semi-supervised clustering methods.
Disclosure of Invention
The invention provides a medical image segmentation method based on safe semi-supervised clustering, aiming at the defect that the risk of a marked sample and an unmarked sample is not considered simultaneously in the traditional medical image segmentation method based on semi-supervised clustering, which can cause the final segmentation effect to be reduced.
Firstly, the invention adopts a k-nearest neighbor method to construct a local graph to obtain a graph regular term. Second, the FCM clustering and density clustering methods are used to estimate the confidence of the labeled and unlabeled samples. Then, confidence weighting of the samples and a regular term based on a local graph are introduced into the target function of the original semi-supervised FCM clustering method to obtain the target function of the safe semi-supervised clustering method. And finally, obtaining a clustering result by iteratively optimizing the membership matrix and the clustering center. The technical scheme is as follows: a medical image segmentation method based on safe semi-supervised clustering comprises the following steps:
the method comprises the following steps: inputting labeled and unlabeled medical image datasets;
step two: FCM clustering is carried out on the data set to obtain a prediction label of the data set;
step three: obtaining the confidence coefficient of the unmarked sample by using a density peak value clustering method and according to the local density of the unmarked sample and the minimum distance between the unmarked sample and the point with higher density, obtaining the confidence coefficient of the marked sample according to the local density of the marked sample in the same marked sample cluster and the minimum distance between the marked sample and the point with higher density, and normalizing the confidence coefficient;
step four: constructing a local graph with the aim of limiting the output of the labeled samples with low confidence to the output of the adjacent samples;
step five: integrating information to construct a target function;
step six: solving the optimization problem by adopting an iterative optimization method;
step seven: and judging the category of the unlabeled sample to realize medical image segmentation.
Compared with the traditional semi-supervised clustering method, the method measures the confidence coefficient of the samples by using the density and the distance between the samples, and limits the marked samples with low confidence coefficient to the output of the adjacent samples by constructing the local graph, so that each sample can be safely and reasonably used, and the clustering is more accurate and robust. The invention solves the safe use problem of the marked sample, simultaneously solves the safe use problem of the unmarked sample, and improves the accuracy and the robustness of the medical image segmentation.
Drawings
FIG. 1 is a flow chart of an embodiment of the present invention.
Detailed Description
While the invention has been described in connection with what is presently considered to be the most practical and preferred embodiment, it is to be understood that the invention is not to be limited to the disclosed embodiment, but on the contrary, is intended to cover various modifications and equivalent arrangements included within the spirit and scope of the appended claims.
To better illustrate the objects and advantages of the present invention, an embodiment of the method of the present invention is described in further detail below with reference to fig. 1 and examples.
The method comprises the following steps: inputting labeled and unlabeled medical image datasets;
a subset of labeled samples of the input medical image dataset: xl=[x1,...,xl]The corresponding label is ykE { 1.. c }, unlabeled sample subset: xu=[xl+1,...,xn]。
Step two: FCM clustering is carried out on the data set to obtain a prediction label of the data set;
obtaining a predictive label of the data set through FCM clustering:
Figure GDA0003109484700000021
tagging predictions using the Kuhn-Munkres algorithm
Figure GDA0003109484700000031
Mapping to equivalent labels
Figure GDA0003109484700000032
With a given label
Figure GDA0003109484700000033
Are consistent.
Step three: obtaining the confidence coefficient of the unmarked sample by using a density peak value clustering method and according to the local density of the unmarked sample and the minimum distance between the unmarked sample and the point with higher density, obtaining the confidence coefficient of the marked sample according to the local density of the marked sample in the same marked sample cluster and the minimum distance between the marked sample and the point with higher density, and normalizing the confidence coefficient;
local density of unlabeled sample:
Figure GDA0003109484700000034
wherein j ═ 1, 2.. times, n],k=[l+1,...,n]Dist (k, j) is a point xkAnd xjEuclidean distance of dcIs the truncation distance.
Minimum distance of unlabeled sample from points with higher density:
Figure GDA0003109484700000035
and for the data point with the greatest density:
Figure GDA0003109484700000036
unlabeled sample confidence: gamma rayk=ρkk (4)
Unlabeled sample confidence normalization:
Figure GDA0003109484700000037
local density of labeled samples in the same labeled sample cluster:
Figure GDA0003109484700000038
wherein j isy=[1,2,...q],k′=[1,2,...,l],jyRepresenting sample set and labeled sample point xk′A set of samples with the same label.
Minimum distance of the marked sample from the point with higher density in the same marked sample cluster:
Figure GDA0003109484700000039
and for the data point with the greatest density:
Figure GDA00031094847000000310
labeled sample confidence:
Figure GDA00031094847000000311
labeled sample confidence normalization:
Figure GDA00031094847000000312
step four: constructing a k-nearest neighbor local graph with the aim of limiting labeled sample outputs with low confidence to those of neighboring samples;
constructing a local neighborhood graph of the marked sample, and then weighting W ═ W of the local graph edgek′r]n×nThe calculation is as follows:
Figure GDA0003109484700000041
wherein N isp(xk′) Finger xk′P data of nearest neighbor, xk′To mark sample points, xrσ represents the width parameter of the gaussian kernel for the neighboring sample points.
Step five: and integrating the information to construct an objective function.
The objective function is as follows:
Figure GDA0003109484700000042
the limiting conditions are as follows:
Figure GDA0003109484700000043
step six: solving the optimization problem by adopting an iterative optimization method;
by minimizing the above optimization problem, an optimal solution can be obtained. To simplify the calculation, the value of m is set to 2. The method solves the sample membership degree and the clustering center by adopting a Lagrange multiplier method.
Membership u of unlabeled samplesik
Figure GDA0003109484700000044
Wherein the content of the first and second substances,
Figure GDA0003109484700000045
Figure GDA0003109484700000046
membership u of labeled sampleik′
Figure GDA0003109484700000051
Wherein the content of the first and second substances,
Figure GDA0003109484700000052
Figure GDA0003109484700000053
cluster center vi
Figure GDA0003109484700000054
And obtaining a final membership matrix U and a clustering center V through iterative calculation. When in use
Figure GDA0003109484700000055
Or when the maximum iteration number is reached, the iteration is terminated, wherein t is the current iteration number, and eta is a set threshold value.
Step seven: and judging the category of the unlabeled sample to realize the segmentation of the medical image.
And after obtaining the membership matrix U, defuzzifying according to the maximum membership principle to obtain the category of the unlabeled sample, and finally, carrying out image segmentation to obtain a result.

Claims (1)

1. A medical image segmentation method based on safe semi-supervised clustering is characterized by comprising the following steps:
the method comprises the following steps: inputting labeled and unlabeled medical image datasets;
a subset of labeled samples of the input medical image dataset: xl=[x1,...,xl]The corresponding label is ykE { 1.. c }, unlabeled sample subset: xu=[xl+1,...,xn];
Step two: FCM clustering is carried out on the data set to obtain a prediction label of the data set;
obtaining a data set by FCM clusteringThe predictive label of (2):
Figure FDA0003109484690000011
tagging predictions using the Kuhn-Munkres algorithm
Figure FDA0003109484690000012
Is mapped as
Figure FDA0003109484690000013
Mapping tags
Figure FDA0003109484690000014
With a given label ykKeeping consistent on categories;
step three: obtaining the confidence coefficient of the unlabeled sample by using a density peak value clustering method and through the local density of the unlabeled sample and the minimum distance between the unlabeled sample and a point with higher density, obtaining the confidence coefficient of the labeled sample through the local density of the labeled sample in the same labeled sample cluster and the minimum distance between the labeled sample and the point with higher density, and normalizing the confidence coefficient;
local density of unlabeled sample:
Figure FDA0003109484690000015
wherein j ═ 1, 2.. times, n],k=[l+1,...,n]Dist (k, j) is a point xkAnd xjEuclidean distance of dcIs a truncation distance;
minimum distance of unlabeled sample from points with higher density:
Figure FDA0003109484690000016
and for the data point with the greatest density:
Figure FDA0003109484690000017
unlabeled sample confidence: gamma rayk=ρkk (4)
Unlabeled sample confidence normalization:
Figure FDA0003109484690000018
local density of labeled samples in the same labeled sample cluster:
Figure FDA0003109484690000021
wherein j isy=[1,2,...q],k′=[1,2,...,l],jyRepresenting sample set and labeled sample point xk′A set of identically labeled samples;
minimum distance of the marked sample from the point with higher density in the same marked sample cluster:
Figure FDA0003109484690000022
and for the data point with the greatest density:
Figure FDA0003109484690000023
labeled sample confidence:
Figure FDA0003109484690000024
labeled sample confidence normalization:
Figure FDA0003109484690000025
step four: constructing a k-nearest neighbor local graph with the aim of limiting labeled sample outputs with low confidence to those of neighboring samples;
constructing local neighborhoods of labeled samplesIf the graph has a partial graph edge weight W [ < W >k′r]n×nThe calculation is as follows:
Figure FDA0003109484690000026
wherein N isp(xk′) Finger xk′P data of nearest neighbor, xk′To mark sample points, xrIs a neighboring sample point, and sigma represents a width parameter of the Gaussian kernel function;
step five: integrating information to construct a target function;
the objective function is as follows:
Figure FDA0003109484690000031
the limiting conditions are as follows:
Figure FDA0003109484690000032
step six: solving the optimization problem by adopting an iterative optimization method;
by minimizing the above optimization problem, an optimal solution can be obtained; to simplify the calculation, the value of m is set to 2; the method adopts a Lagrange multiplier method to solve the sample membership degree and the clustering center;
membership u of unlabeled samplesik
Figure FDA0003109484690000033
Wherein the content of the first and second substances,
Figure FDA0003109484690000034
Figure FDA0003109484690000035
membership u of labeled sampleik′
Figure FDA0003109484690000036
Wherein the content of the first and second substances,
Figure FDA0003109484690000037
Figure FDA0003109484690000041
cluster center vi
Figure FDA0003109484690000042
Obtaining a final membership matrix U and a clustering center V through iterative calculation; when in use
Figure FDA0003109484690000043
Or when the maximum iteration times is reached, the iteration is terminated, wherein t is the current iteration times, and eta is a set threshold;
step seven: judging the category of the unmarked sample to realize the segmentation of the medical image;
and after obtaining the membership matrix U, defuzzifying according to the maximum membership principle to obtain the category of the unlabeled sample, and finally, carrying out image segmentation to obtain a result.
CN201910371366.2A 2019-05-06 2019-05-06 Medical image segmentation method based on safe semi-supervised clustering Active CN110263804B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910371366.2A CN110263804B (en) 2019-05-06 2019-05-06 Medical image segmentation method based on safe semi-supervised clustering

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910371366.2A CN110263804B (en) 2019-05-06 2019-05-06 Medical image segmentation method based on safe semi-supervised clustering

Publications (2)

Publication Number Publication Date
CN110263804A CN110263804A (en) 2019-09-20
CN110263804B true CN110263804B (en) 2021-08-03

Family

ID=67914306

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910371366.2A Active CN110263804B (en) 2019-05-06 2019-05-06 Medical image segmentation method based on safe semi-supervised clustering

Country Status (1)

Country Link
CN (1) CN110263804B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111612735A (en) * 2020-04-08 2020-09-01 杭州电子科技大学 Lung nodule image classification method based on information fusion safety semi-supervised clustering
CN111898704B (en) * 2020-08-17 2024-05-10 腾讯科技(深圳)有限公司 Method and device for clustering content samples
CN113780750B (en) * 2021-08-18 2024-03-01 同济大学 Medical risk assessment method and device based on medical image segmentation
CN115131610B (en) * 2022-06-13 2024-02-27 西北工业大学 Robust semi-supervised image classification method based on data mining

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106447676A (en) * 2016-10-12 2017-02-22 浙江工业大学 Image segmentation method based on rapid density clustering algorithm
CN107341812A (en) * 2017-07-04 2017-11-10 太原理工大学 A kind of sequence Lung neoplasm image partition method based on super-pixel and Density Clustering

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8170306B2 (en) * 2007-04-25 2012-05-01 Siemens Aktiengesellschaft Automatic partitioning and recognition of human body regions from an arbitrary scan coverage image
CN104156438A (en) * 2014-08-12 2014-11-19 德州学院 Unlabeled sample selection method based on confidence coefficients and clustering
CN104881687A (en) * 2015-06-02 2015-09-02 四川理工学院 Magnetic resonance image classification method based on semi-supervised Gaussian mixed model
CN105825226A (en) * 2016-03-11 2016-08-03 江苏畅远信息科技有限公司 Association-rule-based distributed multi-label image identification method
CN106611418A (en) * 2016-03-29 2017-05-03 四川用联信息技术有限公司 Image segmentation algorithm
CN108629783B (en) * 2018-05-02 2021-05-04 山东师范大学 Image segmentation method, system and medium based on image feature density peak search
CN109409400A (en) * 2018-08-28 2019-03-01 西安电子科技大学 Merge density peaks clustering method, image segmentation system based on k nearest neighbor and multiclass

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106447676A (en) * 2016-10-12 2017-02-22 浙江工业大学 Image segmentation method based on rapid density clustering algorithm
CN107341812A (en) * 2017-07-04 2017-11-10 太原理工大学 A kind of sequence Lung neoplasm image partition method based on super-pixel and Density Clustering

Also Published As

Publication number Publication date
CN110263804A (en) 2019-09-20

Similar Documents

Publication Publication Date Title
CN110263804B (en) Medical image segmentation method based on safe semi-supervised clustering
CN112115995B (en) Image multi-label classification method based on semi-supervised learning
Azadi et al. Auxiliary image regularization for deep cnns with noisy labels
CN113408605B (en) Hyperspectral image semi-supervised classification method based on small sample learning
Cao et al. A multi-kernel based framework for heterogeneous feature selection and over-sampling for computer-aided detection of pulmonary nodules
CN110188827B (en) Scene recognition method based on convolutional neural network and recursive automatic encoder model
Mao et al. Feature representation using deep autoencoder for lung nodule image classification
Li et al. Adaptive metric learning for saliency detection
CN113326731A (en) Cross-domain pedestrian re-identification algorithm based on momentum network guidance
CN112614131A (en) Pathological image analysis method based on deformation representation learning
CN113674288B (en) Automatic segmentation method for digital pathological image tissue of non-small cell lung cancer
CN111062277B (en) Sign language-lip language conversion method based on monocular vision
CN110555459A (en) Score prediction method based on fuzzy clustering and support vector regression
CN110458022B (en) Autonomous learning target detection method based on domain adaptation
CN110647907A (en) Multi-label image classification algorithm using multi-layer classification and dictionary learning
Khanykov et al. Image segmentation improvement by reversible segment merging
Cho et al. Effective pseudo-labeling based on heatmap for unsupervised domain adaptation in cell detection
CN111581466B (en) Partial multi-mark learning method for characteristic information noise
CN113222072A (en) Lung X-ray image classification method based on K-means clustering and GAN
CN116258978A (en) Target detection method for weak annotation of remote sensing image in natural protection area
Wang et al. Chromosome detection in metaphase cell images using morphological priors
CN116363460A (en) High-resolution remote sensing sample labeling method based on topic model
CN113469270B (en) Semi-supervised intuitive clustering method based on decomposition multi-target differential evolution superpixel
CN113592045B (en) Model adaptive text recognition method and system from printed form to handwritten form
CN114692746A (en) Information entropy based classification method of fuzzy semi-supervised support vector machine

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant