BackgroundRecent developments in third-gen long read sequencing and diploid-aware assemblers have resulted in the rapid release of numerous reference-quality assemblies for diploid genomes. However, assembly of highly heterozygous genomes is still problematic when regional heterogeneity is so high that haplotype homology is not recognised during assembly. This results in regional duplication rather than consolidation into allelic variants and can cause issues with downstream analysis, for example variant discovery, or haplotype reconstruction using the diploid assembly with unpaired allelic contigs.ResultsA new pipeline—Purge Haplotigs—was developed specifically for third-gen sequencing-based assemblies to automate the reassignment of allelic contigs, and to assist in the manual curation of genome assemblies. The pipeline uses a draft haplotype-fused assembly or a diploid assembly, read alignments, and repeat annotations to identify allelic variants in the primary assembly. The pipeline was tested on a simulated dataset and on four recent diploid (phased) de novo assemblies from third-generation long-read sequencing, and compared with a similar tool. After processing with Purge Haplotigs, haploid assemblies were less duplicated with minimal impact on genome completeness, and diploid assemblies had more pairings of allelic contigs.ConclusionsPurge Haplotigs improves the haploid and diploid representations of third-gen sequencing based genome assemblies by identifying and reassigning allelic contigs. The implementation is fast and scales well with large genomes, and it is less likely to over-purge repetitive or paralogous elements compared to alignment-only based methods. The software is available at https://bitbucket.org/mroachawri/purge_haplotigs under a permissive MIT licence.Electronic supplementary materialThe online version of this article (10.1186/s12859-018-2485-7) contains supplementary material, which is available to authorized users.
Characterization of interspecies differences in gene regulation is crucial for understanding the molecular basis of both phenotypic diversity and evolution. By means of chromatin immunoprecipitation and DNA microarray analysis, the divergence in the binding sites of the pseudohyphal regulators Ste12 and Tec1 was determined in the yeasts Saccharomyces cerevisiae, S. mikatae, and S. bayanus under pseudohyphal conditions. We have shown that most of these sites have diverged across these species, far exceeding the interspecies variation in orthologous genes. A group of Ste12 targets was shown to be bound only in S. mikatae and S. bayanus under pseudohyphal conditions. Many of these genes are targets of Ste12 during mating in S. cerevisiae, indicating that specialization between the two pathways has occurred in this species. Transcription factor binding sites have therefore diverged substantially faster than ortholog content. Thus, gene regulation resulting from transcription factor binding is likely to be a major cause of divergence between related species.
Human intervention has subjected the yeast Saccharomyces cerevisiae to multiple rounds of independent domestication and thousands of generations of artificial selection. As a result, this species comprises a genetically diverse collection of natural isolates as well as domesticated strains that are used in specific industrial applications. However the scope of genetic diversity that was captured during the domesticated evolution of the industrial representatives of this important organism remains to be determined. To begin to address this, we have produced whole-genome assemblies of six commercial strains of S. cerevisiae (four wine and two brewing strains). These represent the first genome assemblies produced from S. cerevisiae strains in their industrially-used forms and the first high-quality assemblies for S. cerevisiae strains used in brewing. By comparing these sequences to six existing high-coverage S. cerevisiae genome assemblies, clear signatures were found that defined each industrial class of yeast. This genetic variation was comprised of both single nucleotide polymorphisms and large-scale insertions and deletions, with the latter often being associated with ORF heterogeneity between strains. This included the discovery of more than twenty probable genes that had not been identified previously in the S. cerevisiae genome. Comparison of this large number of S. cerevisiae strains also enabled the characterization of a cluster of five ORFs that have integrated into the genomes of the wine and bioethanol strains on multiple occasions and at diverse genomic locations via what appears to involve the resolution of a circular DNA intermediate. This work suggests that, despite the scrutiny that has been directed at the yeast genome, there remains a significant reservoir of ORFs and novel modes of genetic transmission that may have significant phenotypic impact in this important model and industrial species.
In Candida albicans, the a1-␣2 complex represses white-opaque switching, as well as mating. Based upon the assumption that the a1-␣2 corepressor complex binds to the gene that regulates white-opaque switching, a chromatin immunoprecipitation-microarray analysis strategy was used to identify 52 genes that bound to the complex. One of these genes, TOS9, exhibited an expression pattern consistent with a "master switch gene." TOS9 was only expressed in opaque cells, and its gene product, Tos9p, localized to the nucleus. Deletion of the gene blocked cells in the white phase, misexpression in the white phase caused stable mass conversion of cells to the opaque state, and misexpression blocked temperature-induced mass conversion from the opaque state to the white state. A model was developed for the regulation of spontaneous switching between the opaque state and the white state that includes stochastic changes of Tos9p levels above and below a threshold that induce changes in the chromatin state of an as-yet-unidentified switching locus. TOS9 has also been referred to as EAP2 and WOR1.White-opaque switching was first observed in a strain of Candida albicans isolated from a fatal bloodstream infection (40). The switch affected the cellular phenotype (1, 41, 42), gene expression (24), and a variety of putative virulence traits (40, 41). It also conferred the capacity to colonize skin (23). An early analysis of clinical strains performed in 1986, however, revealed that only 8% underwent the switch (D. R. Soll, unpublished observation; 44). This observation was enigmatic, since all strains of C. albicans possessed opaque state-specific genes (33). In 2002, Miller and Johnson (31) provided not only an explanation for this enigma but also a role for the whiteopaque transition. They found that while an a/␣ laboratory strain could not switch, MTLa1 and MTL␣2 deletion derivatives of that strain, which were ␣ and a, respectively, could. They demonstrated that switching in the a/␣ strain was repressed by the a1-␣2 complex, the same complex that repressed mating (31). Their results suggested that in order to switch, natural strains, which are predominantly a/␣ (26,30,48), first had to undergo homozygosis to a/a or ␣/␣. Lockhart et al. (30) generalized this observation by demonstrating that while natural a/␣ strains did not undergo white-opaque switching, spontaneously generated MTL-homozygous offspring and natural MTL-homozygous strains did switch. Miller and Johnson (31) further demonstrated that in order to mate, the a and ␣ strains they derived by deleting MTL␣2 and MTLa1, respectively, first had to switch from white to opaque. Lockhart et al. (29) generalized this observation by demonstrating that only natural a/a and ␣/␣ strains that expressed the opaque phenotype could mate. The white-opaque transition, therefore, was an essential and unique step in the C. albicans mating process (3,42,43).In spite of the fundamental role white-opaque switching plays in mating, very little is known about the molecular mechanisms that re...
To understand the organization of the transcriptional networks that govern cell differentiation, we have investigated the transcriptional circuitry controlling pseudohyphal development in Saccharomyces cerevisiae.
Despite its industrial importance, the yeast species Dekkera (Brettanomyces) bruxellensis has remained poorly understood at the genetic level. In this study we describe whole genome sequencing and analysis for a prevalent wine spoilage strain, AWRI1499. The 12.7 Mb assembly, consisting of 324 contigs in 99 scaffolds (super-contigs) at 26-fold coverage, exhibits a relatively high density of single nucleotide polymorphisms (SNPs). Haplotype sampling for 1.2% of open reading frames suggested that the D. bruxellensis AWRI1499 genome is comprised of a moderately heterozygous diploid genome, in combination with a divergent haploid genome. Gene content analysis revealed enrichment in membrane proteins, particularly transporters, along with oxidoreductase enzymes. Availability of this assembly and annotation provides a resource for further investigation of genomic organization in this species, and functional characterization of genes that may confer important phenotypic traits.
Penicillium marneffei is the only known species of its genus that is dimorphic. At 25°C, P. marneffei exhibits true filamentous growth and undergoes asexual development producing spores borne on complex structures called conidiophores. At 37°C, P. marneffei undergoes a dimorphic transition to produce uninucleate yeast cells that divide by fission. We have cloned a homologue of the Aspergillus nidulans abaA gene encoding an ATTS/TEA DNA‐binding domain transcriptional regulator and shown that it is involved in both these developmental programs. Targeted deletion of abaA blocks asexual development at 25°C before spore production, resulting in aberrant conidiophores with reiterated terminal cells. At 37°C, the abaA deletion strain fails to switch correctly from multinucleate filamentous to uninucleate yeast cells. Both the transitional hyphal cells, which produce the yeast cells, and the yeast cells themselves contain multiple nuclei. Expression of the abaA gene is activated during both conidiation and the hyphal–yeast switch. Interestingly, the abaA gene of the filamentous monomorphic fungus A. nidulans can complement both conidiation and dimorphic switching defects in the P. marneffei abaA mutant. In addition, ectopic overexpression of abaA results in anucleate yeast cells and multinucleate vegetative filamentous cells. These data suggest that abaA regulates cell cycle events and morphogenesis in two distinct developmental programmes.
The yeast Dekkera bruxellensis is a major contaminant of industrial fermentations, such as those used for the production of biofuel and wine, where it outlasts and, under some conditions, outcompetes the major industrial yeast Saccharomyces cerevisiae. In order to investigate the level of inter-strain variation that is present within this economically important species, the genomes of four diverse D. bruxellensis isolates were compared. While each of the four strains was shown to contain a core diploid genome, which is clearly sufficient for survival, two of the four isolates have a third haploid complement of chromosomes. The sequences of these additional haploid genomes were both highly divergent from those comprising the diploid core and divergent between the two triploid strains. Similar to examples in the Saccharomyces spp. clade, where some allotriploids have arisen on the basis of enhanced ability to survive a range of environmental conditions, it is likely these strains are products of two independent hybridisation events that may have involved multiple species or distinct sub-species of Dekkera. Interestingly these triploid strains represent the vast majority (92%) of isolates from across the Australian wine industry, suggesting that the additional set of chromosomes may confer a selective advantage in winery environments that has resulted in these hybrid strains all-but replacing their diploid counterparts in Australian winery settings. In addition to the apparent inter-specific hybridisation events, chromosomal aberrations such as strain-specific insertions and deletions and loss-of-heterozygosity by gene conversion were also commonplace. While these events are likely to have affected many phenotypes across these strains, we have been able to link a specific deletion to the inability to utilise nitrate by some strains of D. bruxellensis, a phenotype that may have direct impacts in the ability for these strains to compete with S. cerevisiae.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.