High throughput cDNA sequencing technologies have advanced our understanding of transcriptome complexity and regulation. However, these methods lose information contained in biological RNA because the copied reads are often short and because modifications are not retained. We address these limitations using a native poly(A) RNA sequencing strategy developed by Oxford Nanopore Technologies (ONT). Our study generated 9.9 million aligned sequence reads for the human cell line GM12878, using thirty MinION flow cells at six institutions. These native RNA reads had a median length of 771 bases, and a maximum aligned length of over 21,000 bases. Mitochondrial poly(A) reads provided an internal measure of read length quality. We combined these long nanopore reads with higher accuracy short-reads and annotated GM12878 promoter regions, to identify 33,984 plausible RNA isoforms. We describe strategies for assessing 3′ poly(A) tail length, base modifications, and transcript haplotypes.
Nanopore sequencers can be used to selectively sequence certain DNA molecules in a pool by reversing the voltage across individual nanopores to reject specific sequences, enabling enrichment and depletion to address biological questions. Previously, we achieved this using dynamic time warping to map the signal to a reference genome, but the method required substantial computational resources and did not scale to gigabase-sized references. Here we overcome this limitation by using GPU base calling. We show enrichment of specific chromosomes from the human genome and of low-abundance organisms in mixed populations without a priori knowledge of sample composition. Finally, we enrich targeted panels comprising 25,600 exons from 10,000 human genes and 717 genes implicated in cancer, identifying PML-RARA fusions in the NB4 cell line in <15 hours sequencing. These methods can be used to efficiently screen any target panel of genes without specialised sample preparation using any computer and suitable GPU. Our toolkit, readfish, is available at https://www.github.com/looselab/readfish .
Motivation The Oxford Nanopore Technologies (ONT) MinION is used for sequencing a wide variety of sample types with diverse methods of sample extraction. Nanopore sequencers output FAST5 files containing signal data subsequently base called to FASTQ format. Optionally, ONT devices can collect data from all sequencing channels simultaneously in a bulk FAST5 file enabling inspection of signal in any channel at any point. We sought to visualize this signal to inspect challenging or difficult to sequence samples. Results The BulkVis tool can load a bulk FAST5 file and overlays MinKNOW (the software that controls ONT sequencers) classifications on the signal trace and can show mappings to a reference. Users can navigate to a channel and time or, given a FASTQ header from a read, jump to its specific position. BulkVis can export regions as Nanopore base caller compatible reads. Using BulkVis, we find long reads can be incorrectly divided by MinKNOW resulting in single DNA molecules being split into two or more reads. The longest seen to date is 2 272 580 bases in length and reported in eleven consecutive reads. We provide helper scripts that identify and reconstruct split reads given a sequencing summary file and alignment to a reference. We note that incorrect read splitting appears to vary according to input sample type and is more common in ’ultra-long’ read preparations. Availability and implementation The software is available freely under an MIT license at https://github.com/LooseLab/bulkvis. Supplementary information Supplementary data are available at Bioinformatics online.
Nanopore sequencers enable selective sequencing of single molecules in real time by individually reversing the voltage across specific nanopores. Thus DNA molecules can be rejected and replaced with new molecules enabling targeted sequencing to enrich, deplete or achieve specific coverage in a set of reads to address a biological question. We previously demonstrated this method worked using dynamic time warping mapping signal to reference, but required significant compute and did not scale to gigabase references. Using direct base calling with GPU we can now scale to gigabase references. We enrich for specific chromosomes mapping against the human genome and we develop pipelines enriching low abundance organisms from mixed populations without prior knowledge of sample composition. Finally, we enrich panels including 25,600 exon targets from 10,000 human genes and 717 genes implicated in cancer. Using this approach we identify PML-RARA fusions in the NB4 cell line in under 15 hours sequencing. These methods can be used to efficiently screen any target panel of genes without specialised sample preparation using a single computer and suitably powerful GPU.
Motivation : The Oxford Nanopore Technologies (ONT) MinION is used for sequencing a wide variety of sample types with diverse methods of sample extraction. Nanopore sequencers output fast5 files containing signal data subsequently base called to fastq format. Optionally, ONT devices can collect data from all sequencing channels simultaneously in a bulk fast5 file enabling inspection of signal in any channel at any point.We sought to visualise this signal to inspect challenging or difficult to sequence samples, or where flow cell performance is modified by an external agent, such as 'Read Until'.Results : The BulkVis tool can load a bulk fast5 file and overlays MinKNOW classifications on the signal trace. Users can navigate to a channel and time or, given a fastq header from a read, jump to its specific position. BulkVis can export regions as Nanopore base caller compatible reads. Using BulkVis, we find long reads can be incorrectly divided by MinKNOW resulting in single DNA molecules being split into two or more reads. The longest seen to date is 2,272,580 bases in length and reported in eleven consecutive reads. We provide helper scripts that identify and reconstruct split reads given a sequencing summary file and alignment to a reference. We note that incorrect read splitting appears to vary according to input sample type and is more common in 'ultra long' read preparations.Availability : The software is available freely under an MIT license at https://github.com/LooseLab/bulkVis . The software requires python3 to run.
Male genital structures are among the most rapidly evolving morphological traits and are often the only features that can distinguish closely related species. This process is thought to be driven by sexual selection and may reinforce species separation. However, while the genetic bases of many phenotypic differences have been identified, we still lack knowledge about the genes underlying evolutionary differences in male genital organs and organ size more generally. The claspers (surstyli) are periphallic structures that play an important role in copulation in insects. Here, we show that divergence in clasper size and bristle number between Drosophila mauritiana and Drosophila simulans is caused by evolutionary changes in tartan (trn), which encodes a transmembrane leucine-rich repeat domain protein that mediates cell–cell interactions and affinity. There are no fixed amino acid differences in trn between D. mauritiana and D. simulans, but differences in the expression of this gene in developing genitalia suggest that cis-regulatory changes in trn underlie the evolution of clasper morphology in these species. Finally, analyses of reciprocal hemizygotes that are genetically identical, except for the species from which the functional allele of trn originates, determined that the trn allele of D. mauritiana specifies larger claspers with more bristles than the allele of D. simulans. Therefore, we have identified a gene underlying evolutionary change in the size of a male genital organ, which will help to better understand not only the rapid diversification of these structures, but also the regulation and evolution of organ size more broadly.
Male genital structures are among the most rapidly evolving morphological traits and are often the only features that can distinguish closely related species. This process is thought to be driven by sexual selection and may reinforce species separation. However, while the genetic basis of many phenotypic differences have been identified, we still lack knowledge about the genes underlying evolutionary differences in male genital organs and organ size more generally. The claspers (surstyli) are periphallic structures that play an important role in copulation in insects. Here we show that natural variation in clasper size and bristle number between Drosophila mauritiana and D. simulans is caused by evolutionary changes in tartan (trn), which encodes a transmembrane leucine-rich repeat domain protein that mediates cellcell interactions and affinity differences. There are no fixed amino acid differences in trn between D. mauritiana and D. simulans but differences in the expression of this gene in developing genitalia suggest cis-regulatory changes in trn underlie the evolution of clasper morphology in these species. Finally, analysis of reciprocal hemizyotes that are genetically identical, except for which species the functional allele of trn is from, determined that the trn allele of D. mauritiana specifies larger claspers with more bristles than the allele of D. simulans. Therefore we have identified the first gene underlying evolutionary change in the size of a male genital organ, which will help to better understand the rapid diversification of these structures and the regulation and evolution of organ size more broadly.3 Significance StatementThe morphology of male genital organs evolves rapidly driven by sexual selection. However, little is known about the genes underlying genitalia differences between species. Identifying these genes is key to understanding how sexual selection acts on development to produce rapid phenotypic change. We have found that the gene tartan underlies differences between male Drosophila mauritiana and D. simulans in the size and bristle number of the claspersgenital projections that grasp the female during copulation. Moreover, since tartan encodes a protein that is involved in cell affinity, this may represent a new developmental mechanism for morphological change. Therefore, our study provides new insights into genetic and developmental bases for the rapid evolution of male genitalia and organ size more generally.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.