Johan Staaf scite author profile

Multigene assays for molecular subtypes and biomarkers can aid management of early invasive breast cancer. Using RNA-sequencing we aimed to develop single-sample predictor (SSP) models for clinical markers, subtypes, and risk of recurrence (ROR). A cohort of 7743 patients was divided into training and test set. We trained SSPs for subtypes and ROR assigned by nearest-centroid (NC) methods and SSPs for biomarkers from histopathology. Classifications were compared with Prosigna in two external cohorts (ABiM, n = 100 and OSLO2-EMIT0, n = 103). Prognostic value was assessed using distant recurrence-free interval. Agreement between SSP and NC for PAM50 (five subtypes) was high (85%, Kappa = 0.78) for Subtype (four subtypes) very high (90%, Kappa = 0.84) and for ROR risk category high (84%, Kappa = 0.75, weighted Kappa = 0.90). Prognostic value was assessed as equivalent and clinically relevant. Agreement with histopathology was very high or high for receptor status, while moderate for Ki67 status and poor for Nottingham histological grade. SSP and Prosigna concordance was high for subtype (OSLO-EMIT0 83%, Kappa = 0.73 and ABiM 80%, Kappa = 0.72) and moderate and high for ROR risk category (68 and 84%, Kappa = 0.50 and 0.70, weighted Kappa = 0.70 and 0.78). Pooled concordance for emulated treatment recommendation dichotomized for chemotherapy was high (85%, Kappa = 0.66). Retrospective evaluation suggested that SSP application could change chemotherapy recommendations for up to 17% of postmenopausal ER+/HER2-/N0 patients with balanced escalation and de-escalation. Results suggest that NC and SSP models are interchangeable on a group-level and nearly so on a patient level and that SSP models can be derived to closely match clinical tests.

show abstract

Random Forest Modelling of High-Dimensional Mixed-Type Data for Breast Cancer Classification

Quist

Taylor

Staaf

et al. 2021

Cancers

View full text Add to dashboard Cite

Advances in high-throughput technologies encourage the generation of large amounts of multiomics data to investigate complex diseases, including breast cancer. Given that the aetiologies of such diseases extend beyond a single biological entity, and that essential biological information can be carried by all data regardless of data type, integrative analyses are needed to identify clinically relevant patterns. To facilitate such analyses, we present a permutation-based framework for random forest methods which simultaneously allows the unbiased integration of mixed-type data and assessment of relative feature importance. Through simulation studies and machine learning datasets, the performance of the approach was evaluated. The results showed minimal multicollinearity and limited overfitting. To further assess the performance, the permutation-based framework was applied to high-dimensional mixed-type data from two independent breast cancer cohorts. Reproducibility and robustness of our approach was demonstrated by the concordance in relative feature importance between the cohorts, along with consistencies in clustering profiles. One of the identified clusters was shown to be prognostic for clinical outcome after standard-of-care adjuvant chemotherapy and outperformed current intrinsic molecular breast cancer classifications.

show abstract

Methylation Patterns and Chromatin Accessibility in Neuroendocrine Lung Cancer

Arbajian

Aine

Karlsson

et al. 2020

Cancers

View full text Add to dashboard Cite

Lung cancer is the worldwide leading cause of death from cancer. Epigenetic modifications such as methylation and changes in chromatin accessibility are major gene regulatory mechanisms involved in tumorigenesis and cellular lineage commitment. We aimed to characterize these processes in the context of neuroendocrine (NE) lung cancer. Illumina 450K DNA methylation data were collected for 1407 lung cancers including 27 NE tumors. NE differentially methylated regions (NE-DMRs) were identified and correlated with gene expression data for 151 lung cancers and 31 human tissue entities from the Genotype-Tissue Expression (GTEx) consortium. Assay for transposase-accessible chromatin sequencing (ATAC-seq) and RNA sequencing (RNA-seq) were performed on eight lung cancer cell lines, including three NE cell lines, to identify neuroendocrine specific gene regulatory elements. We identified DMRs with methylation patterns associated with differential gene expression and an NE tumor phenotype. DMR-associated genes could further be split into six functional modules, including one highly specific gene module for NE lung cancer showing high expression in both normal and malignant brain tissue. The regulatory potential of NE-DMRs was further validated in vitro using paired ATAC- and RNA-seq and revealed both proximal and distal regulatory elements of canonical NE-marker genes such as CHGA, NCAM1, INSM1, as well as a number of novel candidate markers of NE lung cancer. Using multilevel genomic analyses of both tumor bulk tissue and lung cancer cell lines, we identified a large catalogue of gene regulatory elements related to the NE phenotype of lung cancer.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Johan Staaf

RNA sequencing-based single sample predictors of molecular subtype and risk of recurrence for clinical assessment of early-stage breast cancer

Random Forest Modelling of High-Dimensional Mixed-Type Data for Breast Cancer Classification

Methylation Patterns and Chromatin Accessibility in Neuroendocrine Lung Cancer

Contact Info

Product

Resources

About