In this small data set, FFDM appears to be slightly more sensitive than digital breast tomosynthesis for the detection of calcification. However, diagnostic performance as measured by area under the curve using BI-RADS was not significantly different. With improvements in processing algorithms and display, digital breast tomosynthesis could potentially be improved for this purpose.
Purpose:To assess interpretation performance and radiation dose when two-dimensional synthesized mammography (SM) images versus standard full-field digital mammography (FFDM) images are used alone or in combination with digital breast tomosynthesis images.
Materials and Methods:A fully crossed, mode-balanced multicase (n = 123), multireader (n = 8), retrospective observer performance study was performed by using deidentified images acquired between 2008 and 2011 with institutional review board approved, HIPAA-compliant protocols, during which each patient signed informed consent. The cohort included 36 cases of biopsy-proven cancer, 35 cases of biopsy-proven benign lesions, and 52 normal or benign cases (Breast Imaging Reporting and Data System [BI-RADS] score of 1 or 2) with negative 1-year follow-up results. Accuracy of sequentially reported probability of malignancy ratings and seven-category forced BI-RADS ratings was evaluated by using areas under the receiver operating characteristic curve (AUCs) in the random-reader analysis.
Results:Probability of malignancy-based mean AUCs for SM and FFDM images alone was 0.894 and 0.889, respectively (difference, 20.005; 95% confidence interval [CI]: 20.062, 0.054; P = .85). Mean AUC for SM with tomosynthesis and FFDM with tomosynthesis was 0.916 and 0.939, respectively (difference, 0.023; 95% CI: 20.011, 0.057; P = .19). In terms of the reader-specific AUCs, five readers performed better with SM alone versus FFDM alone, and all eight readers performed better with combined FFDM and tomosynthesis (absolute differences from 0.003 to 0.052). Similar results were obtained by using a nonparametric analysis of forced BI-RADS ratings.
Rationale and Objectives
Retrospectively compare interpretive performance of synthetically reconstructed two-dimensional images in combination with DBT versus FFDM plus DBT.
Materials and Methods
Ten radiologists trained in reading tomosynthesis examinations interpreted retrospectively, under two modes, 114 mammograms. One mode included the directly acquired FFDM combined with DBT and the other, synthetically reconstructed projection images combined with DBT. The reconstructed images do not require additional radiation exposure. We compared the two modes with respect to “sensitivity”, namely recommendation to recall a breast with either a pathology proven cancer (n=48) or a high risk lesion (n=6); and “specificity”, namely no recommendation to recall a breast not depicting an abnormality (n=144) or depicting only benign abnormalities (n=30).
Results
The average sensitivity for FFDM with DBT was 0.826 versus 0.772 for synthetic FFDM with DBT (difference=0.054, p=0.017 and p=0.053 for fixed and random reader effect, respectively). The fraction of breasts with no, or benign, abnormalities recommended to be recalled were virtually the same: 0.298 and 0.297 for the two modalities, respectively (95% confidence intervals for the difference CI= −0.028, 0.036 and CI = −0.070, 0.066 for fixed and random reader effects, correspondingly). Sixteen additional clusters of micro-calcifications (“positive” breasts) were missed by all readers combined when interpreting the mode with synthesized images versus FFDM.
Conclusion
Lower sensitivity with comparable specificity was observed with the tested version of synthetically generated images versus FFDM, both combined with DBT. Improved synthesized images with experimentally verified acceptable diagnostic quality will be needed to eliminate double exposure during DBT based screening.
Performance of CAD systems for mass detection at mammography varies significantly, depending on examination and system used. Actual performance of all systems in clinical environment can be improved.
DBT may be an alternative to obtaining additional mammographic views in most but not all cases of patients with a lesion that is not solely calcifications. In a fraction of cases, the use of DBT may eliminate the need for ultrasound.
Rationale and Objectives: The clinical utility of interactive computer-aided diagnosis (ICAD) systems depends on clinical relevance and visual similarity between the queried breast lesions and the ICAD-selected reference regions. The objective of this study is to develop and test a new ICAD scheme that aims improve visual similarity of ICAD-selected reference regions.
Materials and Methods:A large and diverse reference library involving 3000 regions of interests was established. For each queried breast mass lesion by the observer, the ICAD scheme segments the lesion, classifies its boundary spiculation level, and computes 14 image features representing the segmented lesion and its surrounding tissue background. A conditioned k-nearest neighbor algorithm is applied to select a set of the 25 most "similar" lesions from the reference library. After computing the mutual information between the queried lesion and each of these initially selected 25 lesions, the scheme displays the six reference lesions with the highest mutual information scores. To evaluate the automated selection process of the six "visually similar" lesions to the queried lesion, we conducted a two-alternative forced-choice observer preference study using 85 queried mass lesions. Two sets of reference lesions selected by one new automated ICAD scheme and the other previously reported scheme using a subjective rating method were randomly displayed on the left and right side of the queried lesion. Nine observers were asked to decide for each of the 85 queried lesions which one of the two reference sets was "more visually similar" to the queried lesion.
Results:In classification of mass boundary spiculation levels, the overall agreement rate between the automated scheme and an observer is 58.8% (Kappa = 0.31). In observer preference study, the nine observers preferred on average the reference lesion sets selected by the automated scheme as being more visually similar than the set selected by the subjective rating approach in 53.2% of the queried lesions. The results were not significantly different for the two methods (p = 0.128).
Conclusion:This study suggests that using the new automated ICAD scheme, the inter-observer variability related issues can thus be avoided. Furthermore, the new scheme maintains the similar performance level as the previous scheme using the subjective rating method that can select reference sets that are significantly more visually similar (p < 0.05) than when using traditional ICAD schemes in which the mass boundary spiculation levels are not accurately detected and quantified.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.