High ethanol tolerance is an exquisite characteristic of the yeast Saccharomyces cerevisiae, which enables this microorganism to dominate in natural and industrial fermentations. Up to now, ethanol tolerance has only been analyzed in laboratory yeast strains with moderate ethanol tolerance. The genetic basis of the much higher ethanol tolerance in natural and industrial yeast strains is unknown. We have applied pooled-segregant whole-genome sequence analysis to map all quantitative trait loci (QTL) determining high ethanol tolerance. We crossed a highly ethanol-tolerant segregant of a Brazilian bioethanol production strain with a laboratory strain with moderate ethanol tolerance. Out of 5974 segregants, we pooled 136 segregants tolerant to at least 16% ethanol and 31 segregants tolerant to at least 17%. Scoring of SNPs using whole-genome sequence analysis of DNA from the two pools and parents revealed three major loci and additional minor loci. The latter were more pronounced or only present in the 17% pool compared to the 16% pool. In the locus with the strongest linkage, we identified three closely located genes affecting ethanol tolerance: MKT1, SWS2, and APJ1, with SWS2 being a negative allele located in between two positive alleles. SWS2 and APJ1 probably contained significant polymorphisms only outside the ORF, and lower expression of APJ1 may be linked to higher ethanol tolerance. This work has identified the first causative genes involved in high ethanol tolerance of yeast. It also reveals the strong potential of pooled-segregant sequence analysis using relatively small numbers of selected segregants for identifying QTL on a genome-wide scale.
Recent advances in high-throughput sequencing (HTS) technologies and computing capacity have produced unprecedented amounts of genomic data that have unraveled the genetics of phenotypic variability in several species. However, operating and integrating current software tools for data analysis still require important investments in highly skilled personnel. Developing accurate, efficient and user-friendly software packages for HTS data analysis will lead to a more rapid discovery of genomic elements relevant to medical, agricultural and industrial applications. We therefore developed Next-Generation Sequencing Eclipse Plug-in (NGSEP), a new software tool for integrated, efficient and user-friendly detection of single nucleotide variants (SNVs), indels and copy number variants (CNVs). NGSEP includes modules for read alignment, sorting, merging, functional annotation of variants, filtering and quality statistics. Analysis of sequencing experiments in yeast, rice and human samples shows that NGSEP has superior accuracy and efficiency, compared with currently available packages for variants detection. We also show that only a comprehensive and accurate identification of repeat regions and CNVs allows researchers to properly separate SNVs from differences between copies of repeat elements. We expect that NGSEP will become a strong support tool to empower the analysis of sequencing data in a wide range of research projects on different species.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.