Pausing of transcribing RNA polymerase is regulated and creates opportunities to control gene expression. Research in metazoans has so far mainly focused on RNA polymerase II (Pol II) promoter-proximal pausing leaving the pervasive nature of pausing and its regulatory potential in mammalian cells unclear. Here, we developed a pause detecting algorithm (PDA) for nucleotide-resolution occupancy data and a new native elongating transcript sequencing approach, termed nested NET-seq, that strongly reduces artifactual peaks commonly misinterpreted as pausing sites. Leveraging PDA and nested NET-seq reveal widespread genome-wide Pol II pausing at single-nucleotide resolution in human cells. Notably, the majority of Pol II pauses occur outside of promoter-proximal gene regions primarily along the gene-body of transcribed genes. Sequence analysis combined with machine learning modeling reveals DNA sequence properties underlying widespread transcriptional pausing including a new pause motif. Interestingly, key sequence determinants of RNA polymerase pausing are conserved between human cells and bacteria. These studies indicate pervasive sequence-induced transcriptional pausing in human cells and the knowledge of exact pause locations implies potential functional roles in gene expression.
We have generated a human subtelomere probe panel, utilizing well characterized CEPH YACs, for the investigation of human chromosome pathology and evolution through fluorescent in situ hybridization (FISH). Region-specific FISH probes will be extremely valuable for detecting cytogenetically cryptic telomere abnormalities. Here, we present the first comparative mapping study (with 29 subtelomere probes and 6 chromosome paints) to the Old World monkey Presbytis cristαtα, followed by hybridizations to the great apes, gorilla and orangutan, when rearrangements were detected. We observed that the position of telomere-associated genomic sequences has been only moderately conserved during primate evolution. YAC 364f9, specific for the subtelomeric long arm of human chromosome 3, contains an evolutionary inversion breakpoint that was involved in independent chromosome rearrangements in P. cristαtα and gorilla.
Exome sequencing has introduced a paradigm shift for the identification of germline variations responsible for Mendelian diseases. However, non‐coding regions, which make up 98% of the genome, cannot be captured. The lack of functional annotation for intronic and intergenic variants makes RNA‐seq a powerful companion diagnostic. Here, we illustrate this point by identifying six patients with a recessive Osteogenesis Imperfecta (OI) and neonatal progeria syndrome. By integrating homozygosity mapping and RNA‐seq, we delineated a deep intronic TAPT1 mutation (c.1237‐52 G>A) that segregated with the disease. Using SI‐NET‐seq, we document that TAPT1's nascent transcription was not affected in patients' fibroblasts, indicating instead that this variant leads to an alteration of pre‐mRNA processing. Predicted to serve as an alternative splicing branchpoint, this mutation enhances TAPT1 exon 12 skipping, creating a protein‐null allele. Additionally, our study reveals dysregulation of pathways involved in collagen and extracellular matrix biology in disease‐relevant cells. Overall, our work highlights the power of transcriptomic approaches in deciphering the repercussions of non‐coding variants, as well as in illuminating the molecular mechanisms of human diseases.
Exome sequencing has introduced a paradigm shift for the identification of germline variations responsible for Mendelian diseases. However, non-coding regions, which make up 98% of the genome, cannot be captured. The lack of functional annotation for intronic and intergenic variants makes RNA-seq a powerful companion diagnostic. Here, we illustrate this point by identifying six patients with a recessive Osteogenesis Imperfecta (OI) and neonatal progeria syndrome. By integrating homozygosity mapping and RNA-seq, we delineated a deep intronic TAPT1 mutation (c.1237-52G>A) that segregated with the disease. Using patients’ fibroblasts, we document that TAPT1’s nascent transcription was not affected, indicating instead that this variant leads to an alteration of pre-mRNA processing. Predicted to serve as an alternative splicing branchpoint, this mutation causes TAPT1 exon 12 skipping, creating a protein-null allele. Additionally, our study reveals dysregulation of pathways involved in collagen and extracellular matrix biology in disease-relevant cells. Overall, our work highlights the power of transcriptomic approaches in deciphering the repercussion of non-coding variants as well as in illuminating the molecular mechanisms and underlying pathways of human diseases.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.