These data suggest consideration of a more conservative approach for most patients with thyroid nodules that are cytologically indeterminate on fine-needle aspiration and benign according to gene-expression classifier results. (Funded by Veracyte.).
The genomic sequencing classifier demonstrates high sensitivity and accuracy for identifying benign nodules. Its 36% increase in specificity compared with the gene expression classifier potentially increases the number of patients with benign nodules who can safely avoid unnecessary diagnostic surgery.
Introduction: The Afirma® Xpression Atlas (XA) detects gene variants and fusions in thyroid nodule FNA samples from a curated panel of 511 genes using whole-transcriptome RNA-sequencing. Its intended use is among cytologically indeterminate nodules that are Afirma GSC suspicious, Bethesda V/VI nodules, or known thyroid metastases. Here we report its analytical and clinical validation.Methods: DNA and RNA were purified from the same sample across 943 blinded FNAs and compared by multiple methodologies, including whole-transcriptome RNA-seq, targeted RNA-seq, and targeted DNA-seq. An additional 695 blinded FNAs were used to define performance for fusions between whole-transcriptome RNA-seq and targeted RNA-seq. We quantified the reproducibility of the whole-transcriptome RNA-seq assay across laboratories and reagent lots. Finally, variants and fusions were compared to histopathology results.Results: Of variants detected in DNA at 5 or 20% variant allele frequency, 74 and 88% were also detected by XA, respectively. XA variant detection was 89% when compared to an alternative RNA-based detection method. Low levels of expression of the DNA allele carrying the variant, compared with the wild-type allele, was found in some variants not detected by XA. 82% of gene fusions detected in a targeted RNA fusion assay were detected by XA. Conversely, nearly all variants or fusions detected by XA were confirmed by an alternative method. Analytical validation studies demonstrated high intra-plate reproducibility (89%-94%), inter-plate reproducibility (86–91%), and inter-lab accuracy (90%). Multiple variants and fusions previously described across the spectrum of thyroid cancers were identified by XA, including some with approved or investigational targeted therapies. Among 190 Bethesda III/IV nodules, the sensitivity of XA as a standalone test was 49%.Conclusion: When the Afirma Genomic Sequencing Classifier (GSC) is used first among Bethesda III/IV nodules as a rule-out test, XA supplements genomic insight among those that are GSC suspicious. Our data clinically and analytically validate XA for use among GSC suspicious, or Bethesda V/VI nodules. Genomic information provided by XA may inform clinical decision-making with precision medicine insights across a broad range of FNA sample types encountered in the care of patients with thyroid nodules and thyroid cancer.
We demonstrate proof of principle that genomic analysis and machine learning improves the utility of TBB for the diagnosis of UIP, with greater sensitivity and specificity than pathology in TBB alone. Combining multiple individual subject samples results in increased test accuracy over single sample testing. This approach requires validation in an independent cohort of subjects before application in the clinic.
BRAF mutations are uncommon in nodules with atypia of undetermined significance or follicular lesion of undetermined significance or follicular neoplasm or suspicious for follicular neoplasm cytology. Most cytologically indeterminate nodules that proved to be malignant were also BRAF-, and all nodules that were false-negative by GEC were also BRAF-. Similarly, all BRAF+ specimens were also GEC Suspicious. Neither GEC test sensitivity nor specificity was improved by addition of BRAF mutation testing.
Analytical sensitivity, analytical specificity, robustness, and quality control of the GEC were successfully verified, indicating its suitability for clinical use.
Background
Identification of Hürthle cell cancers by non-operative fine-needle aspiration biopsy (FNAB) of thyroid nodules is challenging. Resultingly, non-cancerous Hürthle lesions were conventionally distinguished from Hürthle cell cancers by histopathological examination of tissue following surgical resection. Reliance on histopathological evaluation requires patients to undergo surgery to obtain a diagnosis despite most being non-cancerous. It is highly desirable to avoid surgery and to provide accurate classification of benignity versus malignancy from FNAB preoperatively. In our first-generation algorithm, Gene Expression Classifier (GEC), we achieved this goal by using machine learning (ML) on gene expression features. The classifier is sensitive, but not specific due in part to the presence of non-neoplastic benign Hürthle cells in many FNAB.
Results
We sought to overcome this low-specificity limitation by expanding the feature set for ML using next-generation whole transcriptome RNA sequencing and called the improved algorithm the Genomic Sequencing Classifier (GSC). The Hürthle identification leverages mitochondrial expression and we developed novel feature extraction mechanisms to measure chromosomal and genomic level loss-of-heterozygosity (LOH) for the algorithm. Additionally, we developed a multi-layered system of cascading classifiers to sequentially triage Hürthle cell-containing FNAB, including: 1. presence of Hürthle cells, 2. presence of neoplastic Hürthle cells, and 3. presence of benign Hürthle cells. The final Hürthle cell Index utilizes 1048 nuclear and mitochondrial genes; and Hürthle cell Neoplasm Index leverages LOH features as well as 2041 genes. Both indices are Support Vector Machine (SVM) based. The third classifier, the GSC Benign/Suspicious classifier, utilizes 1115 core genes and is an ensemble classifier incorporating 12 individual models.
Conclusions
The accurate algorithmic depiction of this complex biological system among Hürthle subtypes results in a dramatic improvement of classification performance; specificity among Hürthle cell neoplasms increases from 11.8% with the GEC to 58.8% with the GSC, while maintaining the same sensitivity of 89%.
Electronic supplementary material
The online version of this article (10.1186/s12918-019-0693-z) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.