Proton computed tomography (pCT) has been proposed as an alternative to X-ray computed tomography (CT) for acquiring relative to water stopping power (RSP) maps used for proton treatment planning dose calculations. In parallel, it has been shown that dual energy X-ray CT (DECT) improves RSP accuracy when compared to conventional single energy X-ray CT. This study aimed at directly comparing the RSP accuracy of both modalities using phantoms scanned at an advanced prototype pCT scanner and a state-of-the-art DECT scanner. Two phantoms containing 13 tissue-mimicking inserts of known RSP were scanned at the pCT phase II prototype and a latest generation dual-source DECT scanner (Siemens SOMATOM Definition FORCE). RSP accuracy was compared by mean absolute percent error (MAPE) over all inserts. A highly realistic Monte Carlo (MC) simulation was used to gain insight on pCT image artifacts which degraded MAPE. MAPE was 0.55% for pCT and 0.67% for DECT. The realistic MC simulation agreed well with pCT measurements (MAPE = 0.69%).
We present a method to accurately predict image noise in proton computed tomography (pCT) using data generated from a Monte Carlo simulation and a patient or object model that may be generated from a prior x-ray CT image. This enables noise prediction for arbitrary beam fluence settings and, therefore, the application of fluence-modulated pCT (FMpCT), which can achieve prescribed noise targets and may significantly reduce the integral patient dose. We extended an existing Monte Carlo simulation of a prototype pCT scanner to include effects of quenching in the energy detector scintillators and constructed a beam model from experimental tracking data. Simulated noise predictions were compared to experimental data both in the projection domain and in the reconstructed image. Noise prediction agreement between simulated and experimental data in terms of the root-mean-square (RMS) error was better than 7% for a homogeneous water phantom and a sensitometry phantom with tubular inserts. For an anthropomorphic head phantom, modeling the anatomy of a five-year-old child, the RMS error was better than 9% in three evaluated slices. We were able to reproduce subtle noise features near heterogeneities. To demonstrate the feasibility of Monte Carlo simulated noise maps for fluence modulation, we calculated a fluence profile that yields a homogeneous noise level in the image. Unlike for bow-tie filters in x-ray CT this does not require constant fluence at the detector and the shape of the fluence profile is fundamentally different. Using an improved Monte Carlo simulation, we demonstrated the feasibility of using simulated data for accurate image noise prediction for pCT. We believe that the agreement with experimental data is sufficient to enable the future optimization of FMpCT fluence plans to achieve prescribed noise targets in a fluence-modulated acquisition.
Objectives We hypothesized that published performances of algorithms for artificial intelligence (AI) pneumothorax (PTX) detection in chest radiographs (CXRs) do not sufficiently consider the influence of PTX size and confounding effects caused by thoracic tubes (TTs). Therefore, we established a radiologically annotated benchmarking cohort (n = 6446) allowing for a detailed subgroup analysis. Materials and Methods We retrospectively identified 6434 supine CXRs, among them 1652 PTX-positive cases and 4782 PTX-negative cases. Supine CXRs were radiologically annotated for PTX size, PTX location, and inserted TTs. The diagnostic performances of 2 AI algorithms (“AI_CheXNet” [Rajpurkar et al], “AI_1.5” [Guendel et al]), both trained on publicly available datasets with labels obtained from automatic report interpretation, were quantified. The algorithms' discriminative power for PTX detection was quantified by the area under the receiver operating characteristics (AUROC), and significance analysis was based on the corresponding 95% confidence interval. A detailed subgroup analysis was performed to quantify the influence of PTX size and the confounding effects caused by inserted TTs. Results Algorithm performance was quantified as follows: overall performance with AUROCs of 0.704 (AI_1.5) / 0.765 (AI_CheXNet) for unilateral PTXs, AUROCs of 0.666 (AI_1.5) / 0.722 (AI_CheXNet) for unilateral PTXs smaller than 1 cm, and AUROCs of 0.735 (AI_1.5) / 0.818 (AI_CheXNet) for unilateral PTXs larger than 2 cm. Subgroup analysis identified TTs to be strong confounders that significantly influence algorithm performance: Discriminative power is completely eliminated by analyzing PTX-positive cases without TTs referenced to control PTX-negative cases with inserted TTs. Contrarily, AUROCs increased up to 0.875 (AI_CheXNet) for large PTX-positive cases with inserted TTs referenced to control cases without TTs. Conclusions Our detailed subgroup analysis demonstrated that the performance of established AI algorithms for PTX detection trained on public datasets strongly depends on PTX size and is significantly biased by confounding image features, such as inserted TTS. Our established, clinically relevant and radiologically annotated benchmarking cohort might be of great benefit for ongoing algorithm development.
Background: CT colonography does not enable definite differentiation between benign and premalignant colorectal polyps.Purpose: To perform machine learning-based differentiation of benign and premalignant colorectal polyps detected with CT colonography in an average-risk asymptomatic colorectal cancer screening sample with external validation using radiomics. Materials and Methods:In this secondary analysis of a prospective trial, colorectal polyps of all size categories and morphologies were manually segmented on CT colonographic images and were classified as benign (hyperplastic polyp or regular mucosa) or premalignant (adenoma) according to the histopathologic reference standard. Quantitative image features characterizing shape (n = 14), gray level histogram statistics (n = 18), and image texture (n = 68) were extracted from segmentations after applying 22 image filters, resulting in 1906 feature-filter combinations. Based on these features, a random forest classification algorithm was trained to predict the individual polyp character. Diagnostic performance was validated in an external test set. Results:The random forest model was fitted using a training set consisting of 107 colorectal polyps in 63 patients (mean age, 63 years 6 8 [standard deviation]; 40 men) comprising 169 segmentations on CT colonographic images. The external test set included 77 polyps in 59 patients comprising 118 segmentations. Random forest analysis yielded an area under the receiver operating characteristic curve of 0.91 (95% CI: 0.85, 0.96), a sensitivity of 82% (65 of 79) (95% CI: 74%, 91%), and a specificity of 85% (33 of 39) (95% CI: 72%, 95%) in the external test set. In two subgroup analyses of the external test set, the area under the receiver operating characteristic curve was 0.87 in the size category of 6-9 mm and 0.90 in the size category of 10 mm or larger. The most important image feature for decision making (relative importance of 3.7%) was quantifying first-order gray level histogram statistics. Conclusion:In this proof-of-concept study, machine learning-based image analysis enabled noninvasive differentiation of benign and premalignant colorectal polyps with CT colonography.
Objectives Diagnostic accuracy of artificial intelligence (AI) pneumothorax (PTX) detection in chest radiographs (CXR) is limited by the noisy annotation quality of public training data and confounding thoracic tubes (TT). We hypothesize that in-image annotations of the dehiscent visceral pleura for algorithm training boosts algorithm’s performance and suppresses confounders. Methods Our single-center evaluation cohort of 3062 supine CXRs includes 760 PTX-positive cases with radiological annotations of PTX size and inserted TTs. Three step-by-step improved algorithms (differing in algorithm architecture, training data from public datasets/clinical sites, and in-image annotations included in algorithm training) were characterized by area under the receiver operating characteristics (AUROC) in detailed subgroup analyses and referenced to the well-established “CheXNet” algorithm. Results Performances of established algorithms exclusively trained on publicly available data without in-image annotations are limited to AUROCs of 0.778 and strongly biased towards TTs that can completely eliminate algorithm’s discriminative power in individual subgroups. Contrarily, our final “algorithm 2” which was trained on a lower number of images but additionally with in-image annotations of the dehiscent pleura achieved an overall AUROC of 0.877 for unilateral PTX detection with a significantly reduced TT-related confounding bias. Conclusions We demonstrated strong limitations of an established PTX-detecting AI algorithm that can be significantly reduced by designing an AI system capable of learning to both classify and localize PTX. Our results are aimed at drawing attention to the necessity of high-quality in-image localization in training data to reduce the risks of unintentionally biasing the training process of pathology-detecting AI algorithms. Key Points • Established pneumothorax-detecting artificial intelligence algorithms trained on public training data are strongly limited and biased by confounding thoracic tubes. • We used high-quality in-image annotated training data to effectively boost algorithm performance and suppress the impact of confounding thoracic tubes. • Based on our results, we hypothesize that even hidden confounders might be effectively addressed by in-image annotations of pathology-related image features.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.