CN114708200A - Method for extracting and screening CT (computed tomography) image characteristics for constructing chronic hepatitis B cirrhosis prediction model - Google Patents

Method for extracting and screening CT (computed tomography) image characteristics for constructing chronic hepatitis B cirrhosis prediction model Download PDF

Info

Publication number
CN114708200A
CN114708200A CN202210273537.XA CN202210273537A CN114708200A CN 114708200 A CN114708200 A CN 114708200A CN 202210273537 A CN202210273537 A CN 202210273537A CN 114708200 A CN114708200 A CN 114708200A
Authority
CN
China
Prior art keywords
image
cirrhosis
constructing
chronic hepatitis
prediction model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210273537.XA
Other languages
Chinese (zh)
Inventor
何健
王锦程
毛应凡
徐珊珊
汤盛楠
吴锦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN202210273537.XA priority Critical patent/CN114708200A/en
Publication of CN114708200A publication Critical patent/CN114708200A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/0002Inspection of images, e.g. flaw detection
    • G06T7/0012Biomedical image inspection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10072Tomographic images
    • G06T2207/10081Computed x-ray tomography [CT]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30004Biomedical image processing
    • G06T2207/30056Liver; Hepatic

Landscapes

  • Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Radiology & Medical Imaging (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Apparatus For Radiation Diagnosis (AREA)
  • Image Processing (AREA)

Abstract

An image omics feature extraction and screening method for a CT image used for constructing a chronic hepatitis B cirrhosis prediction model. The existing liver fibrosis and cirrhosis prediction methods cannot be widely applied due to the cost problem, and the selection method of the characteristics used for prediction is not sound enough. The invention sets exclusion criteria for patients infected by HBV and having liver fibrosis pathological results of flat-scan CT examination, and counts the total number of patients capable of being included in CT image acquisition, and the acquired patient adopts an upper supine position to receive CT examination; dividing the obtained flat-scan CT image into two parts according to the total number of statistics, wherein one part is used as a training set and used for constructing a chronic hepatitis B cirrhosis prediction model, and the other part is used as a verification set and used for verifying the effect of the chronic hepatitis B cirrhosis prediction model; and carrying out image segmentation and image omics feature extraction. The method can accurately extract and screen the characteristics for constructing and predicting the liver cirrhosis model.

Description

Method for extracting and screening iconomics characteristics of CT (computed tomography) image for constructing chronic hepatitis B cirrhosis prediction model
Technical Field
The invention relates to an image omics feature extraction and screening method for a CT image for constructing a chronic hepatitis B cirrhosis prediction model.
Background
Current noninvasive methods for predicting the extent of liver fibrosis include serum indices and elastography. TE and MRE are known to have excellent diagnostic properties for liver fibrosis staging. However, these well-behaved equipment methods have not been widely used due to their high cost. In china, enhanced CT examination is usually recommended for HBV carriers to clarify their presence or absence of tumorigenesis, but many patients receive only flat scan CT examination due to limited cost effectiveness. While enhanced CT or MRI may provide more information than flat-scan CT, it is desirable to study noninvasive models to predict cirrhosis of the liver at a relatively low cost based on readily available data. In addition, the selection method of the characteristics used for constructing the prediction model is not sound enough.
Disclosure of Invention
The invention aims to solve the problems that the existing liver fibrosis and cirrhosis prediction method cannot be widely applied due to the cost problem and the selection method of the characteristics used for prediction is not sound enough, and provides an image omics characteristic extraction and screening method for constructing a CT image of a chronic hepatitis B cirrhosis prediction model.
A method for extracting and screening the characteristic of the imaging group of CT images used for constructing a chronic hepatitis B cirrhosis prediction model comprises the following steps:
step one, setting exclusion criteria aiming at patients infected by HBV and having liver fibrosis pathological results of flat-scan CT examination, carrying out next CT image acquisition on patients not in the exclusion criteria, and counting the total number of patients capable of being included in CT image acquisition, wherein the exclusion criteria are as follows:
(1) lack of detailed pathological records of liver fibrosis;
(2) lack of abdominal plain scan CT images of 1.5mm layer thickness;
(3) the interval between the flat scan CT examination and the biopsy is more than 3 months;
(4) poor image quality; the poor image quality refers to an image with a low score estimated by a PSNR (peak signal to noise ratio), structure similarity SSIM (structural similarity) or multi-scale structure similarity method;
(5) co-infecting with other viruses including one or more of hepatitis C virus, hepatitis D virus or human immunodeficiency virus;
(6) the liver cancer has a focal liver disease focus, and comprises one or more of HCC and hepatic tuberculosis;
(7) ingestion of large amounts of alcohol; said substantial amounts are greater than 20 grams per day;
(8) incomplete clinical data (n 39);
step two, the person to be collected adopts the supine position to receive CT examination; the parameters of the CT scan are: tube voltage is 120kVp, tube current is 250-350 mA, collimation slice thickness is 5mm, reconstruction slice thickness is 1.25mm, slice interval is 5mm, rotation time is 0.6s, spiral thread pitch is 1.375, visual field is 35-40 cm, and matrix is 512 multiplied by 512;
dividing the obtained flat-scan CT image into two parts according to the total number of statistics, wherein one part is used as a training set and used for constructing a chronic hepatitis B cirrhosis prediction model, and the other part is used as a verification set and used for verifying the effect of the chronic hepatitis B cirrhosis prediction model;
and step four, image segmentation and image omics feature extraction are carried out.
Preferably, the image omics feature extraction step comprises:
performing image preprocessing and feature extraction by adopting an open source Pyradiomics software package;
second, the voxel spacing is normalized to a size of 1 × 1 × 1mm, and the voxel intensity values are discretized with a bin width of 25 HU;
thirdly, extracting 828 imagery omics features from each ROI, including 18 first order statistics, 74 texture features and 736 wavelet-based transformation features;
fourthly, normalizing the characteristic values by using z scores in a training queue; verifying the standard score applied in the queue using the mean and standard deviation determined in the training queue;
fifthly, the step of selecting the image omics characteristics by repeatability among observers and lasso regression specifically comprises the following steps:
(1) randomly selecting a right portal horizontal CT scout image of the liver, and quantifying the repeatability of each radiological feature using intra-and inter-group correlation coefficients; one quantitative method is to repeat the ROI segmentation twice a week, another quantitative method is to perform the ROI segmentation independently and then calculate the inter-observer and inter-observer repeatability separately, wherein the minimum acceptable threshold of ICC is 0.8;
wherein, the intra-group and inter-group correlation coefficients are ICC for short and are called intraclass correlation coefficient in English;
(2) selecting an image omics feature;
first, features with high reproducibility are retained for subsequent analysis; the characteristic of high reproducibility means that the value of ICC inside an observer and between observers is more than 0.8;
and secondly, using a minimum absolute shrinkage and selection operator logistic regression algorithm, adjusting the penalty parameters through 10-time cross validation, and taking the characteristics of the non-zero coefficients as the characteristics related to the cirrhosis.
The invention has the beneficial effects that:
the invention is used for constructing a chronic hepatitis B cirrhosis prediction model, particularly provides a data base for constructing the chronic hepatitis B cirrhosis prediction model, makes up for the technical defect of feature selection used for constructing the model, and realizes the construction of the chronic hepatitis B cirrhosis prediction model through the specific image omics feature extraction and screening of CT images.
The method of the invention selects the characteristics of the image group by repeatability between observers and lasso regression, establishes repeatability tests, comprises ICC calculation between an observer and an observer, and has a minimum threshold value of 0.80. By selecting a smaller subset of the imaging features with higher repeatability, the functional clustering reproducibility of the imaging features can be obviously improved, important features related to liver cirrhosis prediction can be screened out, and the method has the advantage of accurate feature selection, thereby providing a basis for constructing a chronic hepatitis B cirrhosis prediction model.
Drawings
FIG. 1 is a flow chart of the method of the present invention;
FIG. 2 is a schematic representation of the principle of image omics feature extraction and screening according to the present invention;
FIG. 3a is a schematic representation of the LASSO model of the present invention determining the optimal λ value by minimum criteria using 10-fold cross-validation;
fig. 3b is a LASSO coefficient plot listing 85 selected features to which the present invention relates.
Detailed Description
The first embodiment is as follows:
in the embodiment, as shown in the flowchart of fig. 1, the method for extracting and screening the omics features of the CT image used for constructing the model for predicting chronic hepatitis b cirrhosis is as follows:
step one, setting exclusion criteria aiming at patients infected by HBV and having liver fibrosis pathological results of flat-scan CT examination, carrying out next CT image acquisition on patients not in the exclusion criteria, and counting the total number of patients capable of being included in CT image acquisition, wherein the exclusion criteria are as follows:
(1) lack of detailed pathological records of liver fibrosis (n-27);
(2) abdominal plain scan CT images lacking a 1.5mm layer thickness (n-128);
(3) the interval between the flat scan CT examination and the biopsy exceeds 3 months (n-16);
(4) poor image quality (n-42); the poor image quality refers to an image with a low score estimated by a PSNR (peak signal to noise ratio), structure similarity SSIM (structural similarity) or multi-scale structure similarity method;
(5) co-infecting with other viruses, including one or more of hepatitis C virus, hepatitis D virus or human immunodeficiency virus (n ═ 17);
(6) the liver cancer has a focal liver disease focus, and comprises one or more of HCC and hepatic tuberculosis (n is 45);
(7) ingestion of large amounts of alcohol; said plurality means greater than 20 grams per day (n-24);
(8) incomplete clinical data (n 39);
finally, 294 patients were included in the study. 144 patients between 2018 and 12 months were assigned to the training cohort and 150 patients between 2019 and 1 month to 2019 and 12 months were assigned to the validation cohort according to the biopsy date.
Step two, regarding the rest of the collected persons after elimination as the collected persons, the collected persons adopt the supine position to receive CT examination; the parameters of the CT scan are: tube voltage is 120kVp, tube current is 250-350 mA, collimation section thickness is 5mm, reconstruction section thickness is 1.25mm, section interval is 5mm, rotation time is 0.6s, spiral thread pitch is 1.375, visual field is between 35 and 40cm, and matrix is 512 multiplied by 512;
dividing the obtained flat-scan CT image into two parts according to the total number of statistics, wherein one part is used as a training set and used for constructing a chronic hepatitis B cirrhosis prediction model, and the other part is used as a verification set and used for verifying the effect of the chronic hepatitis B cirrhosis prediction model;
and step four, image segmentation and image omics feature extraction are carried out.
The second embodiment is as follows:
different from the specific embodiment, the method for extracting and screening the characteristic of the imaging group of the CT image for constructing the chronic hepatitis B cirrhosis prediction model of the embodiment,
the image omics feature extraction step comprises the following steps:
performing image preprocessing and feature extraction by adopting an open source Pyradiomics software package (http:// www.radiomics.io/Pyradiomics. html);
secondly, the voxel spacing is normalized in a size of 1 × 1 × 1mm, and the voxel intensity values are discretized by a bin width of 25HU to reduce the interference of image noise and normalize the intensity;
thirdly, extracting 828 image group characteristics from each ROI, including 18 first order statistics, 74 texture characteristics and 736 wavelet-based transformation characteristics;
fourthly, normalizing the characteristic values by using z scores in a training queue; the standard score (z-score) applied in the cohort was validated using the mean and standard deviation determined in the training cohort.
As shown in fig. 2, of the 828 extracted features, 85 features (8 first order statistics, 21 texture features and 56 wavelet-based transforms) with high repeatability were selected for subsequent analysis. 25 cirrhosis-related features with non-zero coefficients in the lasso regression model were selected according to the training cohort.
Fifthly, the step of selecting the image omics characteristics by repeatability among observers and lasso regression specifically comprises the following steps:
(1) randomly selecting right portal horizontal CT panned images of the liver, quantifying the reproducibility of each radiological feature using intra-and inter-group correlation coefficients based on 50 randomly selected patients; one quantitative method is to repeat the ROI segmentation twice a week, another quantitative method is to perform the ROI segmentation independently and then calculate the inter-observer and inter-observer repeatability separately, wherein the minimum acceptable threshold of ICC is 0.8;
wherein, the intra-group and inter-group correlation coefficients are ICC for short and are called intraclass correlation coefficient in English;
(2) selecting important image omics characteristics;
first, features with high reproducibility are retained for subsequent analysis; the characteristic of high reproducibility means that the value of ICC inside the observer and between the observers is more than 0.8;
next, a Least Absolute Shrinkage and Selection Operator (LASSO) logistic regression algorithm is used, and the penalty parameters are adjusted by 10-fold cross validation to take the features of non-zero coefficients as the liver cirrhosis related features, as shown in fig. 3a and 3 b.
Figures 3a and 3b are diagrams showing selection of imagery omics features using LASSO regression; FIG. 3a shows the LASSO model determining the optimal λ value using 10-fold cross validation with the minimum norm, plotting a binomial deviation curve against log (λ), and plotting the dashed vertical line at the optimal value using the minimum norm and 1 standard error of the minimum norm (1-standard error condition), where the optimal λ value is 0.0383. Fig. 3b shows a LASSO coefficient curve listing 85 selected features.

Claims (2)

1. A method for extracting and screening the characteristic of the imagery omics of the CT image used for constructing the model for predicting the chronic hepatitis B cirrhosis is characterized in that: the method comprises the following steps:
step one, setting exclusion criteria aiming at patients infected by HBV and having liver fibrosis pathological results of flat-scan CT examination, carrying out next CT image acquisition on patients not in the exclusion criteria, and counting the total number of patients capable of being included in CT image acquisition, wherein the exclusion criteria are as follows:
(1) lack of detailed pathological records of liver fibrosis;
(2) abdominal scout CT images lacking a 1.5mm layer thickness;
(3) the interval between the flat scan CT examination and the biopsy is more than 3 months;
(4) poor image quality; the poor image quality refers to an image with a low score estimated by a PSNR (peak signal to noise ratio), structure similarity SSIM (structural similarity) or multi-scale structure similarity method;
(5) co-infecting with other viruses including one or more of hepatitis C virus, hepatitis D virus or human immunodeficiency virus;
(6) the liver cancer has a focal liver disease focus, and comprises one or more of HCC and hepatic tuberculosis;
(7) ingestion of large amounts of alcohol; said substantial amounts are greater than 20 grams per day;
(8) incomplete clinical data (n = 39);
step two, the person to be collected adopts the supine position to receive CT examination; the parameters of the CT scan are: tube voltage is 120kVp, tube current is 250-350 mA, collimation slice thickness is 5mm, reconstruction slice thickness is 1.25mm, slice interval is 5mm, rotation time is 0.6s, spiral thread pitch is 1.375, visual field is 35-40 cm, and matrix is 512 multiplied by 512;
dividing the obtained flat-scan CT image into two parts according to the total number of statistics, wherein one part is used as a training set and used for constructing a chronic hepatitis B cirrhosis prediction model, and the other part is used as a verification set and used for verifying the effect of the chronic hepatitis B cirrhosis prediction model;
and step four, image segmentation and image omics feature extraction are carried out.
2. The method for extracting and screening the iconomics features of the CT images for constructing the model for predicting chronic hepatitis b cirrhosis according to claim 1, wherein:
the image omics feature extraction step comprises the following steps:
performing image preprocessing and feature extraction by adopting an open source Pyradiomics software package;
second, the voxel spacing is normalized in size 1 × 1 × 1mm, and the voxel intensity values are discretized with a bin width of 25 HU;
thirdly, extracting 828 image group characteristics from each ROI, including 18 first order statistics, 74 texture characteristics and 736 wavelet-based transformation characteristics;
fourthly, normalizing the characteristic values by using z scores in a training queue; verifying the standard score applied in the queue using the mean and standard deviation determined in the training queue;
fifthly, the step of selecting the image omics characteristics by repeatability among observers and lasso regression specifically comprises the following steps:
(1) randomly selecting a right portal horizontal CT scout image of the liver, and quantifying the repeatability of each radiological feature using intra-and inter-group correlation coefficients; one quantitative method is to repeat the ROI segmentation twice a week, another quantitative method is to perform the ROI segmentation independently and then calculate the inter-observer and inter-observer repeatability separately, wherein the minimum acceptable threshold of ICC is 0.8;
wherein, the intra-group and inter-group correlation coefficients are ICC for short and are called intraclass correlation coefficient in English;
(2) selecting an image omics feature;
first, features with high reproducibility are retained for subsequent analysis; the characteristic of high reproducibility means that the value of ICC inside the observer and between the observers is more than 0.8;
and secondly, using a minimum absolute shrinkage and selection operator logistic regression algorithm, adjusting the penalty parameters through 10-time cross validation, and taking the characteristics of the non-zero coefficients as the characteristics related to the cirrhosis.
CN202210273537.XA 2022-03-18 2022-03-18 Method for extracting and screening CT (computed tomography) image characteristics for constructing chronic hepatitis B cirrhosis prediction model Pending CN114708200A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210273537.XA CN114708200A (en) 2022-03-18 2022-03-18 Method for extracting and screening CT (computed tomography) image characteristics for constructing chronic hepatitis B cirrhosis prediction model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210273537.XA CN114708200A (en) 2022-03-18 2022-03-18 Method for extracting and screening CT (computed tomography) image characteristics for constructing chronic hepatitis B cirrhosis prediction model

Publications (1)

Publication Number Publication Date
CN114708200A true CN114708200A (en) 2022-07-05

Family

ID=82169119

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210273537.XA Pending CN114708200A (en) 2022-03-18 2022-03-18 Method for extracting and screening CT (computed tomography) image characteristics for constructing chronic hepatitis B cirrhosis prediction model

Country Status (1)

Country Link
CN (1) CN114708200A (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2565646A1 (en) * 2006-10-26 2008-04-26 Mcgill University Systems and methods of clinical state prediction utilizing medical image data
CN110895817A (en) * 2019-11-01 2020-03-20 复旦大学 MRI image hepatic fibrosis automatic grading method based on image omics analysis
CN113096757A (en) * 2021-04-29 2021-07-09 中国科学院深圳先进技术研究院 Esophageal squamous carcinoma survival prediction method and system based on dual-region imaging omics
CN113610845A (en) * 2021-09-09 2021-11-05 汕头大学医学院附属肿瘤医院 Tumor local control prediction model construction method, prediction method and electronic equipment
CN114121227A (en) * 2021-02-24 2022-03-01 首都医科大学附属北京佑安医院 Liver cirrhosis patient upper abdomen enhanced CT (computed tomography) image omics feature processing method and application thereof
US11276173B1 (en) * 2021-05-24 2022-03-15 Qure.Ai Technologies Private Limited Predicting lung cancer risk

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2565646A1 (en) * 2006-10-26 2008-04-26 Mcgill University Systems and methods of clinical state prediction utilizing medical image data
CN110895817A (en) * 2019-11-01 2020-03-20 复旦大学 MRI image hepatic fibrosis automatic grading method based on image omics analysis
CN114121227A (en) * 2021-02-24 2022-03-01 首都医科大学附属北京佑安医院 Liver cirrhosis patient upper abdomen enhanced CT (computed tomography) image omics feature processing method and application thereof
CN113096757A (en) * 2021-04-29 2021-07-09 中国科学院深圳先进技术研究院 Esophageal squamous carcinoma survival prediction method and system based on dual-region imaging omics
US11276173B1 (en) * 2021-05-24 2022-03-15 Qure.Ai Technologies Private Limited Predicting lung cancer risk
CN113610845A (en) * 2021-09-09 2021-11-05 汕头大学医学院附属肿瘤医院 Tumor local control prediction model construction method, prediction method and electronic equipment

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
徐珊珊等: "CT 影像组学预测胰腺神经内分泌肿瘤的病理分级", 东南国防医药, vol. 23, no. 5, 30 September 2021 (2021-09-30) *

Similar Documents

Publication Publication Date Title
van Rikxoort et al. Supervised enhancement filters: Application to fissure detection in chest CT scans
Arimura et al. Computerized scheme for automated detection of lung nodules in low-dose computed tomography images for lung cancer screening1
Erickson et al. Computer-aided detection and diagnosis at the start of the third millennium
Klifa et al. Quantification of breast tissue index from MR data using fuzzy clustering
US9953413B2 (en) Identification and analysis of lesions in medical imaging
CN1934589A (en) Systems and methods providing automated decision support for medical imaging
KR101144964B1 (en) System for Detection of Interstitial Lung Diseases and Method Therefor
JP2004000609A (en) Computer assisted diagnosis by multiple energy image
JP2004105731A (en) Processing of computer aided medical image
CN101028196A (en) Method and tomography unit for carrying out an analysis of a movement of an object
Maitra et al. Automated digital mammogram segmentation for detection of abnormal masses using binary homogeneity enhancement algorithm
Maitra et al. Detection of abnormal masses using divide and conquer algorithmin digital mammogram
CN110728239A (en) Gastric cancer enhanced CT image automatic identification system utilizing deep learning
CN112489788A (en) Multi-modal image analysis method and system for cancer diagnosis
Shahzad et al. A patient-specific coronary density estimate
Nishikawa et al. Computer-aided detection of clustered microcalcifications
Buckler et al. Inter-method performance study of tumor volumetry assessment on computed tomography test-retest data
CN114708200A (en) Method for extracting and screening CT (computed tomography) image characteristics for constructing chronic hepatitis B cirrhosis prediction model
KR101162599B1 (en) An automatic detection method of Cardiac Cardiomegaly through chest radiograph analyses and the recording medium thereof
CN112150406A (en) CT image-based pneumothorax lung collapse degree accurate calculation method
Lu et al. Computing mammographic density from a multiple regression model constructed with image-acquisition parameters from a full-field digital mammographic unit
CN110738649A (en) training method of Faster RCNN network for automatic identification of stomach cancer enhanced CT images
Wu et al. A fully automated multi-modal computer aided diagnosis approach to coronary calcium scoring of MSCT images
US20170178338A1 (en) Identification and analysis of lesions in medical imaging
Felinto et al. Automatic segmentation and quantification of thigh tissues in CT images

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination