For nonmodel organisms, genome-wide information that describes functionally relevant variation may be obtained by RNA-Seq following de novo transcriptome assembly. While sequencing has become relatively inexpensive, the preparation of a large number of sequencing libraries remains prohibitively expensive for population genetic analyses of nonmodel species. Pooling samples may be then an attractive alternative. To test whether pooled RNA-Seq accurately predicts true allele frequencies, we analysed the liver transcriptomes of 10 bank voles. Each sample was sequenced both as an individually barcoded library and as a part of a pool. Equal amounts of total RNA from each vole were pooled prior to mRNA selection and library construction. Reads were mapped onto the de novo assembled reference transcriptome. High-quality genotypes for individual voles, determined for 23,682 SNPs, provided information on 'true' allele frequencies; allele frequencies estimated from the pool were then compared with these values. 'True' frequencies and those estimated from the pool were highly correlated. Mean relative estimation error was 21% and did not depend on expression level. However, we also observed a minor effect of interindividual variation in gene expression and allele-specific gene expression influencing allele frequency estimation accuracy. Moreover, we observed strong negative relationship between minor allele frequency and relative estimation error. Our results indicate that pooled RNA-Seq exhibits accuracy comparable with pooled genome resequencing, but variation in expression level between individuals should be assessed and accounted for. This should help in taking account the difference in accuracy between conservatively expressed transcripts and these which are variable in expression level.
Since its introduction by Wright (1931), the concept of effective population size (Ne) has been enormously influential. The main reasons are its conceptual simplicity and broad utility, as it brings under a common denominator species and populations that may differ profoundly in their biological characteristics. According to a standard textbook definition (Hartl & Clark, 2006) 'the effective population size of an
Current species distributions at high latitudes are the product of expansion from glacial refugia into previously uninhabitable areas at the end of the last glaciation. The traditional view of postglacial colonization is that southern populations expanded their ranges into unoccupied northern territories. Recent findings on mitochondrial DNA (mtDNA) of British small mammals have challenged this simple colonization scenario by demonstrating a more complex genetic turnover in Britain during the Pleistocene-Holocene transition where one mtDNA clade of each species was replaced by another mtDNA clade of the same species. Here, we provide evidence from one of those small mammals, the bank vole (), that the replacement was genome-wide. Using more than 10 000 autosomal SNPs we found that similar to mtDNA, bank vole genomes in Britain form two (north and south) clusters which admix. Therefore, the genome of the original postglacial colonists (the northern cluster) was probably replaced by another wave of migration from a different continental European population (the southern cluster), and we gained support for this by modelling with approximate Bayesian computation. This finding emphasizes the importance of analysis of genome-wide diversity within species under changing climate in creating opportunities for sophisticated testing of population history scenarios.
DNA sequences derived from multiple regions of the nuclear genome are essential for historical inferences in the fields of phylogeography and phylogenetics. The appropriate markers should be single-copy, variable, easy to amplify from multiple samples and easy to sequence using high-throughput technologies. This may be difficult to achieve for species lacking sequenced genomes and particularly challenging for species possessing large genomes, which consist mostly of repetitive sequences. Here, we present a cost-effective, broadly applicable framework for designing, validating and high-throughput sequencing of multiple markers in nonmodel species without sequenced genomes. We demonstrate its utility in two closely related species of newts, representatives of urodeles, a vertebrate group characterized by large genomes. We show that over 80 markers, c. 600 bp each, developed mainly from 3' untranslated transcript regions (3'UTR) may be effectively multiplexed and sequenced. Data are further processed using standard, freely available bioinformatic tools, producing phase-resolved sequences. The approach does not require barcoded PCR primers, and the cost of library preparation is independent of the number of markers investigated. We hope that this approach will be of broad interest for researchers working at the interface of population genetics and phylogenetics, exploring deep intraspecific genetic structure, species boundaries and phylogeographies of closely related species.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.