Mutations that add, subtract, rearrange, or otherwise refashion genome
structure often affect phenotypes, though the fragmented nature of most
contemporary assemblies obscure them. To discover such mutations, we assembled
the first new reference quality genome of Drosophila
melanogaster since its initial sequencing. By comparing this genome
to the existing D. melanogaster assembly, we create a
structural variant map of unprecedented resolution, revealing extensive genetic
variation that has remained hidden until now. Many of these variants constitute
strong candidates underlying phenotypic variation, including tandem duplications
and a transposable element insertion that dramatically amplifies the expression
of detoxification genes associated with nicotine resistance. The abundance of
important genetic variation that still evades discovery highlights how crucial
high-quality references are to deciphering phenotypes.
Mutations that add, subtract, rearrange, or otherwise refashion genome structure often affect phenotypes, though the fragmented nature of most contemporary assemblies obscure them. To discover such mutations, we assembled the first reference quality genome of Drosophila melanogaster since its initial sequencing. By comparing this genome to the existing D. melanogaster assembly, we create a structural variant map of unprecedented resolution, revealing extensive genetic variation that has remained hidden until now. Many of these variants constitute strong candidates underlying phenotypic variation, including tandem duplications and a transposable element insertion that dramatically amplifies the expression of detoxification genes associated with nicotine resistance. The abundance of important genetic variation that still evades discovery highlights how crucial high quality references are to deciphering phenotypes.
The accurate determination of allele frequencies is crucially important across a wide range of problems in genetics, such as developing population genetic models, making inferences from genome-wide association studies, determining genetic risk for diseases, as well as other scientific and medical applications. Furthermore, understanding how allele frequencies change over time in populations is central to ascertaining their evolutionary dynamics. We present a precise, efficient, and economical method (FREQ-Seq2) for quantifying the relative frequencies of different alleles at loci of interest in mixed population samples. Through the creative use of paired barcode sequences, we exponentially increased the throughput of the original FREQ-Seq method from 48 to 2,304 samples. FREQ-Seq2 can be targeted to specific genomic regions of interest, which are amplified using universal barcoded adapters to generate Illumina sequencing libraries. Our enhanced method, available as a kit along with open-source software for analyzing sequenced libraries, enables detection and removal of errors that are undetectable in the original FREQ-Seq method as well as other conventional methods for allele frequency quantification. Finally, we validated the performance of our NGS-based approach with a highly multiplexed set of control samples as well as a competitive evolution experiment in E. coli, and compare the latter to estimates derived from manual colony counting. Our analyses demonstrate that FREQ-Seq2 is flexible, inexpensive, and produces large amounts of data with low error, low noise, and desirable statistical properties. In summary, FREQ-Seq2 is a powerful method for quantifying allele frequency that provides a versatile approach for profiling mixed populations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.