Preliminary investigation into sources of uncertainty in quantitative imaging features

doi:10.1016/j.compmedimag.2015.04.006

Computerized Medical Imaging and Graphics

Volume 44, September 2015, Pages 54-61

https://doi.org/10.1016/j.compmedimag.2015.04.006 Get rights and content

Highlights

•
Features measured at end-of-exhale phase did not significantly differ from T20 to T90.
•
Features are highly correlated between end-of-exhale phase and average images.
•
The impact of tube voltage on features may not be statistically significant.
•
The impact of tube current modeled as Gaussian noise was statistically significant.

Abstract

Several recent studies have demonstrated the potential for quantitative imaging features to classify non-small cell lung cancer (NSCLC) patients as high or low risk. However applying the results from one institution to another has been difficult because of the variations in imaging techniques and feature measurement. Our study was designed to determine the effect of some of these sources of uncertainty on image features extracted from computed tomography (CT) images of non-small cell lung cancer (NSCLC) tumors. CT images from 20 NSCLC patients were obtained for investigating the impact of four sources of uncertainty: Two region of interest (ROI) selection conditions (breathing phase and single-slice vs. whole volume) and two imaging protocol parameters (peak tube voltage and current). Texture values did not vary substantially with the choice of breathing phase; however, almost half (12 out of 28) of the measured textures did change significantly when measured from the average images compared to the end-of-exhale phase. Of the 28 features, 8 showed a significant variation when measured from the largest cross sectional slice compared to the entire tumor, but 14 were correlated to the entire tumor value. While simulating a decrease in tube voltage had a negligible impact on texture features, simulating a decrease in mA resulted in significant changes for 13 of the 23 texture values. Our results suggest that substantial variation exists when textures are measured under different conditions, and thus the development of a texture analysis standard would be beneficial for comparing features between patients and institutions.

Introduction

Lung cancer is the leading cause of cancer deaths both globally and within the United States [1]. Non-small cell lung cancer (NSCLC) accounts for 85% of all lung cancer cases [2]. Survival rates in NSCLC have remained low despite progress in imaging and treatment techniques over the past forty years [1]. This problem is compounded by the substantial variability in outcomes among patients of the same stage or risk-group which can make choosing the optimal treatment strategy for any individual patient difficult. Because of these well-known statistics, a variety of research avenues have been explored to produce novel methods of predicting patient outcome or individualizing patient treatment. One method that has recently garnered a lot of attention is the use of image texture analysis to classify patients as high or low risk before they begin treatment.

Texture analysis is a computational method that assigns quantitative values to medical images of tumors. Textures are designed to assess the amount of heterogeneity or identify the patterns within a region of interest(ROI) [3], [4]. As a result, tumors with higher or lower values than a chosen threshold can be grouped and the results correlated to patient outcomes in order to predict survival. Several groups have demonstrated the usefulness of this technique in NSCLC [5], [6], [7], [8], [9], [10]. Taken together, these studies suggest image texture analysis may in future play a large role in the identification of patients with worse prognosis or high risk factors. This could substantially alter the way we select treatment regimens for NSCLC patients and potentially increase observed survival rates.

One obstacle to applying this technique in a clinical setting is the large level of uncertainty in whether features measured at different institutions can be fairly compared. Major sources of uncertainty for features measured from CT images of tumors encompass everything from imaging parameters (such as tube current, tube voltage, exposure, reconstruction algorithm, pixel dimensions, slice thickness, and CT manufacturer) to patient specific factors (such as motion artifacts and tumor size), to variations in feature measurement (such as ROI delineation, software, feature parameters, and pre-processing filters). Identifying the impact of each of these factors is nearly impossible due to the scale of the task and the fact that a ground truth does not exist for quantitative imaging features the way it might for histology or other imaging goals. However, a few of these factors could be controlled for if we knew that these factors have a substantial effect on extracted texture values. For example, while many studies have used textures measured from the entire tumor volume, certain studies used only the largest cross-sectional tumor slice [6], [11]. Whether the results from one technique can be applied to tumors delineated with the alternative technique is not known. Similarly, studies have not previously been conducted on the effect the choice of breathing phase for ROI delineation may have on an individual patient's measurement. It is also still not well understood how the choice of imaging parameters affects the values measured. Because imaging parameters are known to vary from institution to institution, current texture studies are limited to local patients imaged with the same technique. The only study attempting to probe this question found that tube voltage variations had a larger effect on measured values than changes in tube current when textures were measured from a water phantom [12]. However it is not known if this same relationship applies to heterogeneous images such as actual patient tumors.

Our study was designed in order to begin to investigate these specific issues and suggest guidelines for the comparison of texture values obtained under different or unknown conditions.

Section snippets

Patients

For this study, the 4D computed tomography (CT) thoracic scans of 20 patients with stage III NSCLC were obtained. These images are routinely collected as part of the radiation therapy treatment planning process. They are non-contrast images, as is routine in radiation therapy clinics. These images were selected because of the large number of published results demonstrating that image features calculated from non-contrast CT images may be used to predict patient outcomes [5], [6], [9], [11], [13]

Respiratory phase

Table 6 summarizes the results of the Wilcoxon signed rank tests after multiplicity correction comparing the texture values measured from the T50 phase images to the Tavg images and the other breathing phase images (T0–T90). Of the 28 total features, 12 were significantly different when they were measured from the Tavg versus the T50 phase images. The 12 significantly different features included at least one from every category but shape. Features measured from phases closer in time to T50 were

Respiratory phase

From our analysis, the choice of phase did not result in statistically significant differences in values. However, comparing texture values from the T50 and Tavg image sets did result in significant differences for half of the features, including one from each category except for shape. We suggest either the end-of-exhale (T50) or Tavg phase be used in future studies for consistency with already published results. Several features (standard deviation, kurtosis, skewness, all of the shape

Conclusion

Several studies have suggested that image texture analysis may be a useful tool for identifying non-small cell lung cancer (NSCLC) patients at high risk for poor survival. Before this technique can be clinically implemented, the effect of different approaches to measuring texture and of different imaging parameters must be analyzed to ensure features can be reliably and consistently evaluated. This paper investigated the susceptibility of texture features to four variables and may serve as a

Conflict of interest statement

We have no conflicts of interest to disclose.

Acknowledgment

Xenia Fave is a recipient of the AAPM and RSNA Doctoral Fellowship. Molly Cook is a recipient of the AAPM Summer Undergraduate Fellowship.

References (20)

J.R. Molina et al.
Non-small cell lung cancer: epidemiology, risk factors, treatment, and survivorship
Mayo Clin Proc
(2008)
M.M. Galloway
Texture analysis using gray level run lengths
Comput Graph Image Process
(1975)
H. Wang et al.
Multilevel binomial logistic prediction model for malignant pulmonary nodules based on texture features of CT image
Eur J Radiol
(2010)
Y. Balagurunathan et al.
Reproducibility and prognosis of quantitative features extracted from CT images
Transl Oncol
(2014)
T.P. Coroller et al.
CT-based radiomic signature predicts distant metastasis in lung adenocarcinoma
Radiother Oncol
(2015)
National Cancer Institute
SEER stat fact sheets: lung and bronchus cancer
(2014)
R.M. Haralick et al.
Textural features for image classification
IEEE Trans Syst Man Cybern
(1973)
B. Ganeshan et al.
Texture analysis of non-small cell lung cancer on unenhanced computed tomography: initial evidence for a relationship with tumour glucose metabolism and stage
Cancer Imaging
(2010)
B. Ganeshan et al.
Tumour heterogeneity in non-small cell lung carcinoma assessed by CT texture analysis: a potential marker of survival
Eur Radiol
(2012)
S. Basu et al.
Developing a classifier model for lung tumors in CT-scan images

There are more references available in the full text version of this article.

Cited by (77)

Computed tomography-based delta-radiomics enabling early prediction of short-term responses to concurrent chemoradiotherapy for patients with non-small cell lung cancer
2023, Radiation Medicine and Protection
To explore the potential of computed tomography (CT)-based delta-radiomics in predicting early short-term responses to concurrent chemoradiotherapy for patients with non-small cell lung cancer (NSCLC), in order to determine the optimal time point for the prediction.
A total of 20 patients with pathologically confirmed NSCLC were prospectively enrolled in this study, who did not receive surgical treatment between February 2021 and February 2022. For each case, a total of 1,210 radiomic features (RFs) were extracted from both planning CT (pCT) images along with each of the subsequent three weeks of CT images. Effective ΔRFs were selected using intra-class correlation coefficient (ICC) analysis, Pearson's correlation, ANOVA test (or Mann-Whitney U-test), and univariate logistic regression. The area under the curve (AUC) of the receiver operating characteristic (ROC) curve was used to evaluate the potential to predict short-term responses of different time points.
Among the 1,210 ΔRFs for 1–3 weeks, 121 common features were retained after processing using ICC analysis and Pearson's correlation. These retained features included 54 and 58 of all time points that differed significantly between the response and non-response groups for the first and third months, respectively (P < 0.05). After univariate logistic regression, 11 and 44 features remained for the first and third months, respectively. Finally, eight ΔRFs (P < 0.05, AUC = 0.77–0.91) that can discriminate short-term responses in both at 1 and 3 months with statistical accuracy were identified.
CT-based delta-radiomics has the potential to provide reasonable biomarkers of short-term responses to concurrent chemoradiotherapy for NSCLC patients, and it can help improve clinical decisions for early treatment adaptation.
The Promise and Future of Radiomics for Personalized Radiotherapy Dosing and Adaptation
2023, Seminars in Radiation Oncology
Quantitative image analysis, also known as radiomics, aims to analyze large-scale quantitative features extracted from acquired medical images using hand-crafted or machine-engineered feature extraction approaches. Radiomics has great potential for a variety of clinical applications in radiation oncology, an image-rich treatment modality that utilizes computed tomography (CT), magnetic resonance imaging (MRI), and positron emission tomography (PET) for treatment planning, dose calculation, and image guidance. A promising application of radiomics is in predicting treatment outcomes after radiotherapy such as local control and treatment-related toxicity using features extracted from pretreatment and on-treatment images. Based on these individualized predictions of treatment outcomes, radiotherapy dose can be sculpted to meet the specific needs and preferences of each patient. Radiomics can aid in tumor characterization for personalized targeting, especially for identifying high-risk regions within a tumor that cannot be easily discerned based on size or intensity alone. Radiomics-based treatment response prediction can aid in developing personalized fractionation and dose adjustments. In order to make radiomics models more applicable across different institutions with varying scanners and patient populations, further efforts are needed to harmonize and standardize the acquisition protocols by minimizing uncertainties within the imaging data.
Inter-Reader Variability of Volumetric Subsolid Pulmonary Nodule Radiomic Features
2022, Academic Radiology
To evaluate the inter-observer consistency for subsolid pulmonary nodule radiomic features.
Subsolid nodules were selected by reviewing radiology reports of CT examinations performed December 1, 2015 to April 1, 2016. Patients with CTs at two time points were included in this study. There were 55 patients with subsolid nodules, of whom 14 had two nodules. Of 69 subsolid nodules, 66 were persistent at the second time point, yielding 135 lesions for segmentation. Two thoracic radiologists and an imaging fellow segmented the lesions using a semi-automated volumetry algorithm (Syngo.via Vb20, Siemens). Coefficient of variation (CV) was used to assess consistency of 91 quantitative measures extracted from the subsolid nodule segmentations, including first and higher order texture features. The accuracy of segmentation was visually graded by an experienced thoracic radiologist. Influencing factors on radiomic feature consistency and segmentation accuracy were assessed using generalized estimating equation analyses and the Exact Mann-Whitney test.
Mean patient age was 71 (38-93 years), with 39 women and 16 men. Mean nodule volume was 1.39mL, range .03-48.2mL, for 135 nodules. Several radiomic features showed high inter-reader consistency (CV<5%), including entropy, uniformity, sphericity, and spherical disproportion. Descriptors such as surface area and energy had low consistency across inter-reader segmentations (CV>10%). Nodule percent solid component and attenuation influenced inter-reader variability of some radiomic features. The presence of contrast did not significantly affect the consistency of subsolid nodule radiomic features.
Near perfect segmentation, within 5% of actual nodule size, was achieved in 68% of segmentations, and very good segmentation, within 25% of actual nodule size, in 94%. Morphologic features including nodule margin and shape (each p <0.01), and presence of air bronchograms (p = 0.004), bubble lucencies (p = 0.02) and broad pleural contact (p < 0.01) significantly affected the probability of near perfect segmentation. Stroke angle (p = 0.001) and length (p < 0.001) also significantly influenced probability of near perfect segmentation.
The inter-observer consistency of radiomic features for subsolid pulmonary nodules varies, with high consistency for several features, including sphericity, spherical disproportion, and first and higher order entropy, and normalized non-uniformity. Nodule morphology influences the consistency of subsolid nodule radiomic features, and the accuracy of subsolid nodule segmentation.
Dosimetric Factors and Radiomics Features Within Different Regions of Interest in Planning CT Images for Improving the Prediction of Radiation Pneumonitis
2021, International Journal of Radiation Oncology Biology Physics
This study aimed to establish machine learning models using dosimetric factors and radiomics features within 5 regions of interest (ROIs) in treatment planning computed tomography images to improve the prediction of symptomatic radiation pneumonitis (RP) (grade ≥2).
This study retrospectively collected data on 79 patients with lung cancer (25 RP ≥2) who underwent chemoradiotherapy between 2015 and 2018. We defined 5 ROIs in planning computed tomography images: gross tumor volume (GTV), planning tumor volume (PTV), PTV-GTV, total lung (TL)-GTV, and TL-PTV. We calculated the mean dose, V5, V10, V20, and V30 within TL-GTV and TL-PTV and the mean dose within the other ROIs. A total of 1924 radiomics features were extracted from all 5 ROIs. We selected the best predictors for classifying 2 groups of patients using a sequential backward elimination support vector machine model. A permutation test was used to assess its statistical significance (P < .05).
The best predictors for symptomatic RP were the combination of 11 radiomics features, 5 dosimetric factors, age, and T stage, achieving an area under the curve (AUC) of 0.94 (95% confidence interval [CI], 0.85–1) (accuracy, 90%; sensitivity, 80% [95% CI, 44%-96%]; specificity, 95% [95% CI, 73%-100%]; P = 8 × 10^-4). The clinical characteristics, dosimetric factors, and their combination showed limited predictive power (accuracy, 63.3%, 70%, and 70%; AUC [95% CI]: 0.73 [0.54-0.92], 0.53 [0.31-0.75], and 0.72 [0.51-0.92], respectively). The radiomics features of PTV-GTV and TL-PTV outperformed those of the other ROIs (accuracy, 76.7% and 76.7%; AUC [95% CI]: 0.82 [0.65-0.99] and 0.80 [0.59-1], respectively).
Combining dosimetric factors and radiomics features within different ROIs can improve the prediction of symptomatic RP. Our results can help physicians adjust the radiation dose distribution of the dose-sensitive lungs and target volumes based on personalized RP estimates.
Radiomics in breast cancer classification and prediction
2021, Seminars in Cancer Biology
Breast Cancer (BC) is the common form of cancer in women. Its diagnosis and screening are usually performed through different imaging modalities such as mammography, magnetic resonance imaging and ultrasound. However, mammography and ultrasound-imaging techniques have limited sensitivity and specificity both in identifying lesions and in differentiating malign from benign lesions, especially in presence of dense breast parenchyma. Due to the higher resolution of magnetic resonance images, MRI represents the method with the higher specificity and sensitivity among all the available tools, in both lesions’ identification and diagnosis. However, especially for diagnosis, even MRI has limitations that are only partially solved if combined with mammography. Unfortunately, due to the limits of all these imaging tools, in order to have a certain diagnosis, patients often receive painful and costly bioptics procedures. In this context, several computational approaches have been developed to increase sensitivity, while maintaining the same specificity, in BC diagnosis and screening. Amongst these, radiomics has been increasingly gaining ground in oncology to improve cancer diagnosis, prognosis and treatment. Radiomics derives multiple quantitative features from single or multiple medical imaging modalities, highlighting image traits which are not visible to the naked eye and hence significantly augmenting the discriminatory and predictive potential of medical imaging. This review article aims to summarize the state of the art in radiomics-based BC research. The dominating evidence extracted from the literature points towards a high potential of radiomics in disentangling malignant from benign breast lesions, classifying BC types and grades and also in predicting treatment response and recurrence risk. In the era of personalized medicine, radiomics has the potential to improve diagnosis, prognosis, prediction, monitoring, image-based intervention, and assessment of therapeutic response in BC.
Radiomics reproducibility challenge in computed tomography imaging as a nuisance to clinical generalization: a mini-review
2023, Egyptian Journal of Radiology and Nuclear Medicine

View all citing articles on Scopus

View full text

Preliminary investigation into sources of uncertainty in quantitative imaging features

Highlights

Abstract

Introduction

Section snippets

Patients

Respiratory phase

Respiratory phase

Conclusion

Conflict of interest statement

Acknowledgment

Mayo Clin Proc

Comput Graph Image Process

Eur J Radiol

Transl Oncol

Radiother Oncol

SEER stat fact sheets: lung and bronchus cancer

Textural features for image classification

IEEE Trans Syst Man Cybern

Texture analysis of non-small cell lung cancer on unenhanced computed tomography: initial evidence for a relationship with tumour glucose metabolism and stage

Cancer Imaging

Tumour heterogeneity in non-small cell lung carcinoma assessed by CT texture analysis: a potential marker of survival

Eur Radiol

Developing a classifier model for lung tumors in CT-scan images