CN110263804B - Medical image segmentation method based on safe semi-supervised clustering - Google Patents
Medical image segmentation method based on safe semi-supervised clustering Download PDFInfo
- Publication number
- CN110263804B CN110263804B CN201910371366.2A CN201910371366A CN110263804B CN 110263804 B CN110263804 B CN 110263804B CN 201910371366 A CN201910371366 A CN 201910371366A CN 110263804 B CN110263804 B CN 110263804B
- Authority
- CN
- China
- Prior art keywords
- sample
- clustering
- labeled
- unlabeled
- density
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2321—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
- G06F18/23213—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/10—Segmentation; Edge detection
- G06T7/136—Segmentation; Edge detection involving thresholding
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- General Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Probability & Statistics with Applications (AREA)
- Image Analysis (AREA)
- Medical Treatment And Welfare Office Work (AREA)
Abstract
The invention discloses a medical image segmentation method based on safe semi-supervised clustering, and relates to a semi-supervised FCM clustering and density peak value clustering method. Firstly, a local graph is constructed by adopting a k-nearest neighbor method to obtain a graph regular term. Second, the FCM clustering and density clustering methods are used to estimate the confidence of the labeled and unlabeled samples. Then, confidence weighting of the samples and a regular term based on a local graph are introduced into the target function of the original semi-supervised FCM clustering method to obtain the target function of the safe semi-supervised clustering method. And finally, obtaining a clustering result by iteratively optimizing the membership matrix and the clustering center. The invention solves the safe use problem of the marked sample, simultaneously solves the safe use problem of the unmarked sample, and improves the accuracy and the robustness of the medical image segmentation.
Description
Technical Field
The invention relates to a medical image segmentation method based on semi-supervised clustering, in particular to a medical image segmentation method based on safe semi-supervised clustering, and belongs to the field of data mining based on medical images.
Background
With the continuous development of visualization technology, modern medicine has become more and more unable to process information of medical images, and medical images play an important role in clinical diagnosis, teaching and scientific research, and the like. The medical image segmentation method based on semi-supervised clustering integrates limited manual supervision information, namely, a plurality of limited points are clicked on an image to identify the relation between corresponding regions, the points are used as sample data with label information in the medical image segmentation method based on semi-supervised clustering, and the sample data is used for guiding clustering, so that the algorithm performance is improved, and the image segmentation is more accurate. The marking in the medical image is generally finished by experts, but wrong marking may occur due to various conditions in the marking process, and the medical image often carries noise points and outliers, and the traditional medical image segmentation method based on semi-supervised clustering does not consider the two aspects in the clustering process.
In this case, the performance of the conventional semi-supervised clustering method may be worse than that of the corresponding unsupervised learning method, which limits the application of the semi-supervised clustering in the medical image segmentation to a certain extent. In other words, the marked data may be detrimental to performance, while noise and outliers in the unmarked data also have a large impact on performance. The traditional semi-supervised clustering generally considers that the prior knowledge is beneficial to learning effect, but the collected prior knowledge (such as error labeled samples and noise) can possibly cause the degradation of learning performance. Xuesong Yin indicates that wrong a priori knowledge can lead to a degradation of learning performance. Based on the two aspects, it makes sense to design a safe semi-supervised learning method. Therefore, the invention tries to develop a mechanism that different samples have different safety degrees, so as to realize that the clustering performance is not lower than that of the original unsupervised clustering and semi-supervised clustering methods.
Disclosure of Invention
The invention provides a medical image segmentation method based on safe semi-supervised clustering, aiming at the defect that the risk of a marked sample and an unmarked sample is not considered simultaneously in the traditional medical image segmentation method based on semi-supervised clustering, which can cause the final segmentation effect to be reduced.
Firstly, the invention adopts a k-nearest neighbor method to construct a local graph to obtain a graph regular term. Second, the FCM clustering and density clustering methods are used to estimate the confidence of the labeled and unlabeled samples. Then, confidence weighting of the samples and a regular term based on a local graph are introduced into the target function of the original semi-supervised FCM clustering method to obtain the target function of the safe semi-supervised clustering method. And finally, obtaining a clustering result by iteratively optimizing the membership matrix and the clustering center. The technical scheme is as follows: a medical image segmentation method based on safe semi-supervised clustering comprises the following steps:
the method comprises the following steps: inputting labeled and unlabeled medical image datasets;
step two: FCM clustering is carried out on the data set to obtain a prediction label of the data set;
step three: obtaining the confidence coefficient of the unmarked sample by using a density peak value clustering method and according to the local density of the unmarked sample and the minimum distance between the unmarked sample and the point with higher density, obtaining the confidence coefficient of the marked sample according to the local density of the marked sample in the same marked sample cluster and the minimum distance between the marked sample and the point with higher density, and normalizing the confidence coefficient;
step four: constructing a local graph with the aim of limiting the output of the labeled samples with low confidence to the output of the adjacent samples;
step five: integrating information to construct a target function;
step six: solving the optimization problem by adopting an iterative optimization method;
step seven: and judging the category of the unlabeled sample to realize medical image segmentation.
Compared with the traditional semi-supervised clustering method, the method measures the confidence coefficient of the samples by using the density and the distance between the samples, and limits the marked samples with low confidence coefficient to the output of the adjacent samples by constructing the local graph, so that each sample can be safely and reasonably used, and the clustering is more accurate and robust. The invention solves the safe use problem of the marked sample, simultaneously solves the safe use problem of the unmarked sample, and improves the accuracy and the robustness of the medical image segmentation.
Drawings
FIG. 1 is a flow chart of an embodiment of the present invention.
Detailed Description
While the invention has been described in connection with what is presently considered to be the most practical and preferred embodiment, it is to be understood that the invention is not to be limited to the disclosed embodiment, but on the contrary, is intended to cover various modifications and equivalent arrangements included within the spirit and scope of the appended claims.
To better illustrate the objects and advantages of the present invention, an embodiment of the method of the present invention is described in further detail below with reference to fig. 1 and examples.
The method comprises the following steps: inputting labeled and unlabeled medical image datasets;
a subset of labeled samples of the input medical image dataset: xl=[x1,...,xl]The corresponding label is ykE { 1.. c }, unlabeled sample subset: xu=[xl+1,...,xn]。
Step two: FCM clustering is carried out on the data set to obtain a prediction label of the data set;
tagging predictions using the Kuhn-Munkres algorithmMapping to equivalent labelsWith a given labelAre consistent.
Step three: obtaining the confidence coefficient of the unmarked sample by using a density peak value clustering method and according to the local density of the unmarked sample and the minimum distance between the unmarked sample and the point with higher density, obtaining the confidence coefficient of the marked sample according to the local density of the marked sample in the same marked sample cluster and the minimum distance between the marked sample and the point with higher density, and normalizing the confidence coefficient;
wherein j ═ 1, 2.. times, n],k=[l+1,...,n]Dist (k, j) is a point xkAnd xjEuclidean distance of dcIs the truncation distance.
unlabeled sample confidence: gamma rayk=ρk/δk (4)
local density of labeled samples in the same labeled sample cluster:
wherein j isy=[1,2,...q],k′=[1,2,...,l],jyRepresenting sample set and labeled sample point xk′A set of samples with the same label.
Minimum distance of the marked sample from the point with higher density in the same marked sample cluster:
step four: constructing a k-nearest neighbor local graph with the aim of limiting labeled sample outputs with low confidence to those of neighboring samples;
constructing a local neighborhood graph of the marked sample, and then weighting W ═ W of the local graph edgek′r]n×nThe calculation is as follows:
wherein N isp(xk′) Finger xk′P data of nearest neighbor, xk′To mark sample points, xrσ represents the width parameter of the gaussian kernel for the neighboring sample points.
Step five: and integrating the information to construct an objective function.
The objective function is as follows:
the limiting conditions are as follows:
step six: solving the optimization problem by adopting an iterative optimization method;
by minimizing the above optimization problem, an optimal solution can be obtained. To simplify the calculation, the value of m is set to 2. The method solves the sample membership degree and the clustering center by adopting a Lagrange multiplier method.
Membership u of unlabeled samplesik:
membership u of labeled sampleik′:
cluster center vi:
And obtaining a final membership matrix U and a clustering center V through iterative calculation. When in useOr when the maximum iteration number is reached, the iteration is terminated, wherein t is the current iteration number, and eta is a set threshold value.
Step seven: and judging the category of the unlabeled sample to realize the segmentation of the medical image.
And after obtaining the membership matrix U, defuzzifying according to the maximum membership principle to obtain the category of the unlabeled sample, and finally, carrying out image segmentation to obtain a result.
Claims (1)
1. A medical image segmentation method based on safe semi-supervised clustering is characterized by comprising the following steps:
the method comprises the following steps: inputting labeled and unlabeled medical image datasets;
a subset of labeled samples of the input medical image dataset: xl=[x1,...,xl]The corresponding label is ykE { 1.. c }, unlabeled sample subset: xu=[xl+1,...,xn];
Step two: FCM clustering is carried out on the data set to obtain a prediction label of the data set;
tagging predictions using the Kuhn-Munkres algorithmIs mapped asMapping tagsWith a given label ykKeeping consistent on categories;
step three: obtaining the confidence coefficient of the unlabeled sample by using a density peak value clustering method and through the local density of the unlabeled sample and the minimum distance between the unlabeled sample and a point with higher density, obtaining the confidence coefficient of the labeled sample through the local density of the labeled sample in the same labeled sample cluster and the minimum distance between the labeled sample and the point with higher density, and normalizing the confidence coefficient;
wherein j ═ 1, 2.. times, n],k=[l+1,...,n]Dist (k, j) is a point xkAnd xjEuclidean distance of dcIs a truncation distance;
unlabeled sample confidence: gamma rayk=ρk/δk (4)
local density of labeled samples in the same labeled sample cluster:
wherein j isy=[1,2,...q],k′=[1,2,...,l],jyRepresenting sample set and labeled sample point xk′A set of identically labeled samples;
minimum distance of the marked sample from the point with higher density in the same marked sample cluster:
and for the data point with the greatest density:
labeled sample confidence:
step four: constructing a k-nearest neighbor local graph with the aim of limiting labeled sample outputs with low confidence to those of neighboring samples;
constructing local neighborhoods of labeled samplesIf the graph has a partial graph edge weight W [ < W >k′r]n×nThe calculation is as follows:
wherein N isp(xk′) Finger xk′P data of nearest neighbor, xk′To mark sample points, xrIs a neighboring sample point, and sigma represents a width parameter of the Gaussian kernel function;
step five: integrating information to construct a target function;
the objective function is as follows:
the limiting conditions are as follows:
step six: solving the optimization problem by adopting an iterative optimization method;
by minimizing the above optimization problem, an optimal solution can be obtained; to simplify the calculation, the value of m is set to 2; the method adopts a Lagrange multiplier method to solve the sample membership degree and the clustering center;
membership u of unlabeled samplesik:
membership u of labeled sampleik′:
cluster center vi:
Obtaining a final membership matrix U and a clustering center V through iterative calculation; when in useOr when the maximum iteration times is reached, the iteration is terminated, wherein t is the current iteration times, and eta is a set threshold;
step seven: judging the category of the unmarked sample to realize the segmentation of the medical image;
and after obtaining the membership matrix U, defuzzifying according to the maximum membership principle to obtain the category of the unlabeled sample, and finally, carrying out image segmentation to obtain a result.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910371366.2A CN110263804B (en) | 2019-05-06 | 2019-05-06 | Medical image segmentation method based on safe semi-supervised clustering |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910371366.2A CN110263804B (en) | 2019-05-06 | 2019-05-06 | Medical image segmentation method based on safe semi-supervised clustering |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110263804A CN110263804A (en) | 2019-09-20 |
CN110263804B true CN110263804B (en) | 2021-08-03 |
Family
ID=67914306
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910371366.2A Active CN110263804B (en) | 2019-05-06 | 2019-05-06 | Medical image segmentation method based on safe semi-supervised clustering |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110263804B (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111612735A (en) * | 2020-04-08 | 2020-09-01 | 杭州电子科技大学 | Lung nodule image classification method based on information fusion safety semi-supervised clustering |
CN111898704B (en) * | 2020-08-17 | 2024-05-10 | 腾讯科技(深圳)有限公司 | Method and device for clustering content samples |
CN113780750B (en) * | 2021-08-18 | 2024-03-01 | 同济大学 | Medical risk assessment method and device based on medical image segmentation |
CN115131610B (en) * | 2022-06-13 | 2024-02-27 | 西北工业大学 | Robust semi-supervised image classification method based on data mining |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106447676A (en) * | 2016-10-12 | 2017-02-22 | 浙江工业大学 | Image segmentation method based on rapid density clustering algorithm |
CN107341812A (en) * | 2017-07-04 | 2017-11-10 | 太原理工大学 | A kind of sequence Lung neoplasm image partition method based on super-pixel and Density Clustering |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8170306B2 (en) * | 2007-04-25 | 2012-05-01 | Siemens Aktiengesellschaft | Automatic partitioning and recognition of human body regions from an arbitrary scan coverage image |
CN104156438A (en) * | 2014-08-12 | 2014-11-19 | 德州学院 | Unlabeled sample selection method based on confidence coefficients and clustering |
CN104881687A (en) * | 2015-06-02 | 2015-09-02 | 四川理工学院 | Magnetic resonance image classification method based on semi-supervised Gaussian mixed model |
CN105825226A (en) * | 2016-03-11 | 2016-08-03 | 江苏畅远信息科技有限公司 | Association-rule-based distributed multi-label image identification method |
CN106611418A (en) * | 2016-03-29 | 2017-05-03 | 四川用联信息技术有限公司 | Image segmentation algorithm |
CN108629783B (en) * | 2018-05-02 | 2021-05-04 | 山东师范大学 | Image segmentation method, system and medium based on image feature density peak search |
CN109409400A (en) * | 2018-08-28 | 2019-03-01 | 西安电子科技大学 | Merge density peaks clustering method, image segmentation system based on k nearest neighbor and multiclass |
-
2019
- 2019-05-06 CN CN201910371366.2A patent/CN110263804B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106447676A (en) * | 2016-10-12 | 2017-02-22 | 浙江工业大学 | Image segmentation method based on rapid density clustering algorithm |
CN107341812A (en) * | 2017-07-04 | 2017-11-10 | 太原理工大学 | A kind of sequence Lung neoplasm image partition method based on super-pixel and Density Clustering |
Also Published As
Publication number | Publication date |
---|---|
CN110263804A (en) | 2019-09-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110263804B (en) | Medical image segmentation method based on safe semi-supervised clustering | |
CN112115995B (en) | Image multi-label classification method based on semi-supervised learning | |
Azadi et al. | Auxiliary image regularization for deep cnns with noisy labels | |
CN113408605B (en) | Hyperspectral image semi-supervised classification method based on small sample learning | |
Cao et al. | A multi-kernel based framework for heterogeneous feature selection and over-sampling for computer-aided detection of pulmonary nodules | |
CN110188827B (en) | Scene recognition method based on convolutional neural network and recursive automatic encoder model | |
Mao et al. | Feature representation using deep autoencoder for lung nodule image classification | |
Li et al. | Adaptive metric learning for saliency detection | |
CN113326731A (en) | Cross-domain pedestrian re-identification algorithm based on momentum network guidance | |
CN112614131A (en) | Pathological image analysis method based on deformation representation learning | |
CN113674288B (en) | Automatic segmentation method for digital pathological image tissue of non-small cell lung cancer | |
CN111062277B (en) | Sign language-lip language conversion method based on monocular vision | |
CN110555459A (en) | Score prediction method based on fuzzy clustering and support vector regression | |
CN110458022B (en) | Autonomous learning target detection method based on domain adaptation | |
CN110647907A (en) | Multi-label image classification algorithm using multi-layer classification and dictionary learning | |
Khanykov et al. | Image segmentation improvement by reversible segment merging | |
Cho et al. | Effective pseudo-labeling based on heatmap for unsupervised domain adaptation in cell detection | |
CN111581466B (en) | Partial multi-mark learning method for characteristic information noise | |
CN113222072A (en) | Lung X-ray image classification method based on K-means clustering and GAN | |
CN116258978A (en) | Target detection method for weak annotation of remote sensing image in natural protection area | |
Wang et al. | Chromosome detection in metaphase cell images using morphological priors | |
CN116363460A (en) | High-resolution remote sensing sample labeling method based on topic model | |
CN113469270B (en) | Semi-supervised intuitive clustering method based on decomposition multi-target differential evolution superpixel | |
CN113592045B (en) | Model adaptive text recognition method and system from printed form to handwritten form | |
CN114692746A (en) | Information entropy based classification method of fuzzy semi-supervised support vector machine |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |