We are developing a computer-aided diagnosis (CAD) system to classify malignant and benign lung nodules found on CT scans. A fully automated system was designed to segment the nodule from its surrounding structured background in a local volume of interest (VOI) and to extract image features for classification. Image segmentation was performed with a three-dimensional (3D) active contour (AC) method. A data set of 96 lung nodules (44 malignant, 52 benign) from 58 patients was used in this study. The 3D AC model is based on two-dimensional AC with the addition of three new energy components to take advantage of 3D information: (1) 3D gradient, which guides the active contour to seek the object surface, (2) 3D curvature, which imposes a smoothness constraint in the z direction, and (3) mask energy, which penalizes contours that grow beyond the pleura or thoracic wall. The search for the best energy weights in the 3D AC model was guided by a simplex optimization method. Morphological and gray-level features were extracted from the segmented nodule. The rubber band straightening transform (RBST) was applied to the shell of voxels surrounding the nodule. Texture features based on run-length statistics were extracted from the RBST image. A linear discriminant analysis classifier with stepwise feature selection was designed using a second simplex optimization to select the most effective features. Leave-one-case-out resampling was used to train and test the CAD system. The system achieved a test area under the receiver operating characteristic curve (A z ) of 0.83±0.04. Our preliminary results indicate that use of the 3D AC model and the 3D texture features surrounding the nodule is a promising approach to the segmentation and classification of lung nodules with CAD. The segmentation performance of the 3D AC model trained with our data set was evaluated with 23 nodules available in the Lung Image Database Consortium (LIDC). The lung nodule volumes segmented by the 3D AC model for best classification were generally larger than those outlined by the LIDC radiologists using visual judgment of nodule boundaries.
The purpose of this work is to develop a computer-aided diagnosis ͑CAD͒ system to differentiate malignant and benign lung nodules on CT scans. A fully automated system was designed to segment the nodule from its surrounding structured background in a local volume of interest ͑VOI͒ and to extract image features for classification. Image segmentation was performed with a 3D active contour method. The initial contour was obtained as the boundary of a binary object generated by k-means clustering within the VOI and smoothed by morphological opening. A data set of 256 lung nodules ͑124 malignant and 132 benign͒ from 152 patients was used in this study. In addition to morphological and texture features, the authors designed new nodule surface features to characterize the lung nodule surface smoothness and shape irregularity. The effects of two demographic features, age and gender, as adjunct to the image features were also investigated. A linear discriminant analysis ͑LDA͒ classifier built with features from stepwise feature selection was trained using simplex optimization to select the most effective features. A two-loop leave-one-out resampling scheme was developed to reduce the optimistic bias in estimating the test performance of the CAD system. The area under the receiver operating characteristic curve, A z , for the test cases improved significantly ͑p Ͻ 0.05͒ from 0.821Ϯ 0.026 to 0.857Ϯ 0.023 when the newly developed image features were included with the original morphological and texture features. A similar experiment performed on the data set restricted to primary cancers and benign nodules, excluding the metastatic cancers, also resulted in an improved test A z , though the improvement did not reach statistical significance ͑p = 0.07͒. The two demographic features did not significantly affect the performance of the CAD system ͑p Ͼ 0.05͒ when they were added to the feature space containing the morphological, texture, and new gradient field and radius features. To investigate if a support vector machine ͑SVM͒ classifier can achieve improved performance over the LDA classifier, we compared the performance of the LDA and SVMs with various kernels and parameters. Principal component analysis was used to reduce the dimensionality of the feature space for both the LDA and the SVM classifiers. When the number of selected principal components was varied, the highest test A z among the SVMs of various kernels and parameters was slightly higher than that of the LDA in one-loop leave-one-case-out resampling. However, no SVM with fixed architecture consistently performed better than the LDA in the range of principal components selected. This study demonstrated that the authors' proposed segmentation and feature extraction techniques are promising for classifying lung nodules on CT images.
Purpose:The small number of samples available for training and testing is often the limiting factor in finding the most effective features and designing an optimal computer-aided diagnosis ͑CAD͒ system. Training on a limited set of samples introduces bias and variance in the performance of a CAD system relative to that trained with an infinite sample size. In this work, the authors conducted a simulation study to evaluate the performances of various combinations of classifiers and feature selection techniques and their dependence on the class distribution, dimensionality, and the training sample size. The understanding of these relationships will facilitate development of effective CAD systems under the constraint of limited available samples. Methods: Three feature selection techniques, the stepwise feature selection ͑SFS͒, sequential floating forward search ͑SFFS͒, and principal component analysis ͑PCA͒, and two commonly used classifiers, Fisher's linear discriminant analysis ͑LDA͒ and support vector machine ͑SVM͒, were investigated. Samples were drawn from multidimensional feature spaces of multivariate Gaussian distributions with equal or unequal covariance matrices and unequal means, and with equal covariance matrices and unequal means estimated from a clinical data set. Classifier performance was quantified by the area under the receiver operating characteristic curve A z . The mean A z values obtained by resubstitution and hold-out methods were evaluated for training sample sizes ranging from 15 to 100 per class. The number of simulated features available for selection was chosen to be 50, 100, and 200. Results: It was found that the relative performance of the different combinations of classifier and feature selection method depends on the feature space distributions, the dimensionality, and the available training sample sizes. The LDA and SVM with radial kernel performed similarly for most of the conditions evaluated in this study, although the SVM classifier showed a slightly higher hold-out performance than LDA for some conditions and vice versa for other conditions. PCA was comparable to or better than SFS and SFFS for LDA at small samples sizes, but inferior for SVM with polynomial kernel. For the class distributions simulated from clinical data, PCA did not show advantages over the other two feature selection methods. Under this condition, the SVM with radial kernel performed better than the LDA when few training samples were available, while LDA performed better when a large number of training samples were available. Conclusions: None of the investigated feature selection-classifier combinations provided consistently superior performance under the studied conditions for different sample sizes and feature space distributions. In general, the SFFS method was comparable to the SFS method while PCA may have an advantage for Gaussian feature spaces with unequal covariance matrices. The performance of the SVM with radial kernel was better than, or comparable to, that of the SVM with polynomial kernel un...
The purpose of this study is to investigate the effects of CT scanning and reconstruction parameters on automated segmentation and volumetric measurements of nodules in CT images. Phantom nodules of known sizes were used so that segmentation accuracy could be quantified in comparison to ground-truth volumes. Spherical nodules having 4.8, 9.5, and 16 mm diameters and 50 and 100 mg/cc calcium contents were embedded in lung-tissue-simulating foam which was inserted in the thoracic cavity of a chest section phantom. CT scans of the phantom were acquired with a 16-slice scanner at various tube currents, pitches, fields-of-view, slice intervals, and slice thicknesses. Scans were also taken using identical techniques either within the same day or five months apart for study of reproducibility. The phantom nodules were segmented with a 3-dimensional active contour (3DAC) model that we previously developed for use on patient nodules. The percentage volume errors relative to the ground-truth volumes were estimated under the various imaging conditions. There was no statistically significant difference in volume error for repeated CT scans or scans taken with techniques where only pitch, field-of-view, or tube current (mA) was changed. However, slice thickness significantly (p < 0.05) affected volume error. Therefore, to evaluate nodule growth, consistent imaging conditions and high resolution should be used for acquisition of the serial CT scans, especially for smaller nodules. Understanding the effects of scanning and reconstruction parameters on volume measurements by 3DAC allows better interpretation of data and assessment of growth. Tracking nodule growth with computerized segmentation methods would reduce inter-and intraobserver variabilities.
A study was performed to determine the accuracies and reproducibilities of the CT numbers of simulated lung nodules imaged with multi-detector CT scanners. The nodules were simulated by spherical balls of three diameters (4.8, 9.5, and 16 mm) and two compositions (50 and 100 mg/cc CaCO 3 in water-equivalent plastic). All were scanned in a liquid-water-filled container at the center of a water-equivalent-plastic phantom and in air cavities within the same phantom using GE multidetector CT scanners. The nodules were also scanned within simulated lung regions in an anthropomorphic thorax section phantom that was bolused on both sides with water-equivalent slabs. Results were compared for three scanning protocols-the protocol for the National Lung Screening Trial (NLST), the protocol for the Lung Tissue Research Consortium (LTRC) study, and a high resolution (small pitch, thin slice and small scan interval) higher dose "gold standard" protocol. Scans were repeated three times with each protocol to assess reproducibility. The CT numbers of the nodules in water were found to be nearly independent of nodule size. However, the presence and the size of an air cavity surrounding a nodule had a significant effect (e.g., the CT number of a 50 mg/cc nodule was 64 HU in water, 37 HU in a 1.8 cm diameter air cavity, and 19 HU in a 4.4 cm diameter air cavity). This variability of CT number with size of air cavity may affect the results of the LTRC study in which patients are scanned at both full inspiration and full expiration. The CT numbers of the 9.5 and 16 mm diameter nodules within the anthropomorphic phantom were highly reproducible (average standard deviations of 2 HU or less) for all protocols. On the other hand, both accuracy and reproducibility were significantly degraded for the 4.8 mm diameter nodules, especially for the NLST (2.5 mm thickness, 2 mm slice interval) technique. Use of thinner slice (1.25 mm) and slice interval (1.25 mm) scans that can be reconstructed retrospectively from the multi-detector helical CT projection data of the standard NLST protocol yield CT numbers for the 4.8 mm diameter nodules that are more accurate and reproducible than those of the standard NLST technique. In general, the CT numbers of the nodules were found to be lower at positions near the centers of the lungs and near the spine, which is probably due to increased beam hardening in those regions. Also, larger nodules were found to have higher CT numbers than smaller nodules, consistent with results obtained on early single slice GE CT scanners. Until manufacturers develop quantitative CT scanners with improved x-ray beam hardening and scatter corrections, it is recommended that reference phantoms be employed to more accurately assess the calcium contents of patient lung nodules in screening and tissue characterization studies and in eventual computer-aided detection and diagnosis applications.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.