Motivation
RNA sequencing of tumor tissue is typically only used to measure gene expression. Here, we present a statistical approach that leverages existing RNA sequencing data (RNA-seq) to also detect somatic copy number alterations (SCNAs), a pervasive phenomenon in human cancers, without a need to sequence the corresponding DNA.
Results
We present an analysis of 4942 participant samples from 28 cancers in The Cancer Genome Atlas, demonstrating robust detection of SCNAs from RNA-seq. Using genotype imputation and haplotype information, our RNA-based method had a median sensitivity of 85% to detect SCNAs defined by DNA analysis, at high specificity (∼95%). As an example of translational potential, we successfully replicated SCNA features associated with breast cancer subtypes. Our results credential haplotype-based inference based on RNA-seq to detect SCNAs in clinical and population-based settings.
Availability of data and materials
The analyses presented use the data publicly available from The Cancer Genome Atlas Research Network (http://cancergenome.nih.gov/). See Methods for details regarding data downloads. hapLOHseq software is freely available under The MIT license and can be downloaded from http://scheet.org/software.html.
Supplementary information
Supplementary data are available at Bioinformatics online.