Abstract:Background: Metastatic epidural spinal cord compression (MESCC) is a disastrous complication of advanced malignancy. Deep learning (DL) models for automatic MESCC classification on staging CT were developed to aid earlier diagnosis. Methods: This retrospective study included 444 CT staging studies from 185 patients with suspected MESCC who underwent MRI spine studies within 60 days of the CT studies. The DL model training/validation dataset consisted of 316/358 (88%) and the test set of 42/358 (12%) CT studies… Show more
“…DL tools for spine conditions on cross-sectional imaging include MRI assessment of spinal canal stenosis ( 12 , 31 ), radiotherapy planning for vertebral tumours to exclude organs at risk including the spinal cord ( 32 ), and for differentiating benign versus malignant spinal tumours ( 33 ). DL algorithms for automated Bilsky grading of MSCC have also been explored on MRI ( 34 ) and CT ( 15 ). The preliminary DL algorithms for MSCC grading on CT showed kappas of 0.873-0.911 (p<0.001) on an internal test set and were used as the basis for this study ( 15 ).…”
Section: Discussionmentioning
confidence: 99%
“…We developed a consecutive region of interest (ROI) detector and classification/grading deep learning pipeline following the study by Hallinan and Zhu et al., 2022 ( 15 ). First, we build Faster R-CNN ( 19 ) combined with ResNet50 ( 20 ) as its backbone network architecture, which consists of 50 convolutional layers with ReLU ( 21 ) activation function for non-linearity.…”
Section: Methodsmentioning
confidence: 99%
“…The fine-tuned model is used for ROI detection. Second, the combined window learning (CWL) ( 15 ) method with average fusion is adopted for ROI classification with multi-window information due to its robustness to missing input window information and reduced model size. Specifically, we build the CWL model with a convolutional prototypical network ( 23 ) using ResNeXt50 ( 24 ) as its backbone network architecture.…”
Section: Methodsmentioning
confidence: 99%
“…Most recently, Hallinan et al. (2022) trained and tested preliminary CT deep learning algorithms for classification of MSCC ( 15 ). The DL algorithms showed good interrater variability compared to an expert radiologist (reference standard) with kappa values between 0.873 to 0.911 (p<0.001).…”
IntroductionMetastatic spinal cord compression (MSCC) is a disastrous complication of advanced malignancy. A deep learning (DL) algorithm for MSCC classification on CT could expedite timely diagnosis. In this study, we externally test a DL algorithm for MSCC classification on CT and compare with radiologist assessment.MethodsRetrospective collection of CT and corresponding MRI from patients with suspected MSCC was conducted from September 2007 to September 2020. Exclusion criteria were scans with instrumentation, no intravenous contrast, motion artefacts and non-thoracic coverage. Internal CT dataset split was 84% for training/validation and 16% for testing. An external test set was also utilised. Internal training/validation sets were labelled by radiologists with spine imaging specialization (6 and 11-years post-board certification) and were used to further develop a DL algorithm for MSCC classification. The spine imaging specialist (11-years expertise) labelled the test sets (reference standard). For evaluation of DL algorithm performance, internal and external test data were independently reviewed by four radiologists: two spine specialists (Rad1 and Rad2, 7 and 5-years post-board certification, respectively) and two oncological imaging specialists (Rad3 and Rad4, 3 and 5-years post-board certification, respectively). DL model performance was also compared against the CT report issued by the radiologist in a real clinical setting. Inter-rater agreement (Gwet’s kappa) and sensitivity/specificity/AUCs were calculated.ResultsOverall, 420 CT scans were evaluated (225 patients, mean age=60 ± 11.9[SD]); 354(84%) CTs for training/validation and 66(16%) CTs for internal testing. The DL algorithm showed high inter-rater agreement for three-class MSCC grading with kappas of 0.872 (p<0.001) and 0.844 (p<0.001) on internal and external testing, respectively. On internal testing DL algorithm inter-rater agreement (κ=0.872) was superior to Rad 2 (κ=0.795) and Rad 3 (κ=0.724) (both p<0.001). DL algorithm kappa of 0.844 on external testing was superior to Rad 3 (κ=0.721) (p<0.001). CT report classification of high-grade MSCC disease was poor with only slight inter-rater agreement (κ=0.027) and low sensitivity (44.0), relative to the DL algorithm with almost-perfect inter-rater agreement (κ=0.813) and high sensitivity (94.0) (p<0.001).ConclusionDeep learning algorithm for metastatic spinal cord compression on CT showed superior performance to the CT report issued by experienced radiologists and could aid earlier diagnosis.
“…DL tools for spine conditions on cross-sectional imaging include MRI assessment of spinal canal stenosis ( 12 , 31 ), radiotherapy planning for vertebral tumours to exclude organs at risk including the spinal cord ( 32 ), and for differentiating benign versus malignant spinal tumours ( 33 ). DL algorithms for automated Bilsky grading of MSCC have also been explored on MRI ( 34 ) and CT ( 15 ). The preliminary DL algorithms for MSCC grading on CT showed kappas of 0.873-0.911 (p<0.001) on an internal test set and were used as the basis for this study ( 15 ).…”
Section: Discussionmentioning
confidence: 99%
“…We developed a consecutive region of interest (ROI) detector and classification/grading deep learning pipeline following the study by Hallinan and Zhu et al., 2022 ( 15 ). First, we build Faster R-CNN ( 19 ) combined with ResNet50 ( 20 ) as its backbone network architecture, which consists of 50 convolutional layers with ReLU ( 21 ) activation function for non-linearity.…”
Section: Methodsmentioning
confidence: 99%
“…The fine-tuned model is used for ROI detection. Second, the combined window learning (CWL) ( 15 ) method with average fusion is adopted for ROI classification with multi-window information due to its robustness to missing input window information and reduced model size. Specifically, we build the CWL model with a convolutional prototypical network ( 23 ) using ResNeXt50 ( 24 ) as its backbone network architecture.…”
Section: Methodsmentioning
confidence: 99%
“…Most recently, Hallinan et al. (2022) trained and tested preliminary CT deep learning algorithms for classification of MSCC ( 15 ). The DL algorithms showed good interrater variability compared to an expert radiologist (reference standard) with kappa values between 0.873 to 0.911 (p<0.001).…”
IntroductionMetastatic spinal cord compression (MSCC) is a disastrous complication of advanced malignancy. A deep learning (DL) algorithm for MSCC classification on CT could expedite timely diagnosis. In this study, we externally test a DL algorithm for MSCC classification on CT and compare with radiologist assessment.MethodsRetrospective collection of CT and corresponding MRI from patients with suspected MSCC was conducted from September 2007 to September 2020. Exclusion criteria were scans with instrumentation, no intravenous contrast, motion artefacts and non-thoracic coverage. Internal CT dataset split was 84% for training/validation and 16% for testing. An external test set was also utilised. Internal training/validation sets were labelled by radiologists with spine imaging specialization (6 and 11-years post-board certification) and were used to further develop a DL algorithm for MSCC classification. The spine imaging specialist (11-years expertise) labelled the test sets (reference standard). For evaluation of DL algorithm performance, internal and external test data were independently reviewed by four radiologists: two spine specialists (Rad1 and Rad2, 7 and 5-years post-board certification, respectively) and two oncological imaging specialists (Rad3 and Rad4, 3 and 5-years post-board certification, respectively). DL model performance was also compared against the CT report issued by the radiologist in a real clinical setting. Inter-rater agreement (Gwet’s kappa) and sensitivity/specificity/AUCs were calculated.ResultsOverall, 420 CT scans were evaluated (225 patients, mean age=60 ± 11.9[SD]); 354(84%) CTs for training/validation and 66(16%) CTs for internal testing. The DL algorithm showed high inter-rater agreement for three-class MSCC grading with kappas of 0.872 (p<0.001) and 0.844 (p<0.001) on internal and external testing, respectively. On internal testing DL algorithm inter-rater agreement (κ=0.872) was superior to Rad 2 (κ=0.795) and Rad 3 (κ=0.724) (both p<0.001). DL algorithm kappa of 0.844 on external testing was superior to Rad 3 (κ=0.721) (p<0.001). CT report classification of high-grade MSCC disease was poor with only slight inter-rater agreement (κ=0.027) and low sensitivity (44.0), relative to the DL algorithm with almost-perfect inter-rater agreement (κ=0.813) and high sensitivity (94.0) (p<0.001).ConclusionDeep learning algorithm for metastatic spinal cord compression on CT showed superior performance to the CT report issued by experienced radiologists and could aid earlier diagnosis.
“…Radiologists using dedicated CT windows showed high interobserver agreement (kappa values ranging from 0.866 to 0.947) and sensitivities (ranging from 91.51 to 98.11) for recognition of any grade of MESCC. The original radiology reports showed relatively reduced performance (kappa = 0.095, sensitivity = 44.34), suggesting that dedicated training sessions for assessment of MESCC on CT or even the use of a deep learning algorithm for automatic classification of MESCC could improve patient care [ 31 , 32 ]. Future prospective studies are planned to assess the accuracy of MESCC detection on staging CT across a more extensive range of oncological patients.…”
Background: Early diagnosis of metastatic epidural spinal cord compression (MESCC) is vital to expedite therapy and prevent paralysis. Staging CT is performed routinely in cancer patients and presents an opportunity for earlier diagnosis. Methods: This retrospective study included 123 CT scans from 101 patients who underwent spine MRI within 30 days, excluding 549 CT scans from 216 patients due to CT performed post-MRI, non-contrast CT, or a gap greater than 30 days between modalities. Reference standard MESCC gradings on CT were provided in consensus via two spine radiologists (11 and 7 years of experience) analyzing the MRI scans. CT scans were labeled using the original reports and by three radiologists (3, 13, and 14 years of experience) using dedicated CT windowing. Results: For normal/none versus low/high-grade MESCC per CT scan, all radiologists demonstrated almost perfect agreement with kappa values ranging from 0.866 (95% CI 0.787–0.945) to 0.947 (95% CI 0.899–0.995), compared to slight agreement for the reports (kappa = 0.095, 95%CI −0.098–0.287). Radiologists also showed high sensitivities ranging from 91.51 (95% CI 84.49–96.04) to 98.11 (95% CI 93.35–99.77), compared to 44.34 (95% CI 34.69–54.31) for the reports. Conclusion: Dedicated radiologist review for MESCC on CT showed high interobserver agreement and sensitivity compared to the current standard of care.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.