Cryptococcus neoformans is a pathogenic basidiomycetous yeast responsible for more than 600,000 deaths each year. It occurs as two serotypes (A and D) representing two varieties (i.e. grubii and neoformans, respectively). Here, we sequenced the genome and performed an RNA-Seq-based analysis of the C. neoformans var. grubii transcriptome structure. We determined the chromosomal locations, analyzed the sequence/structural features of the centromeres, and identified origins of replication. The genome was annotated based on automated and manual curation. More than 40,000 introns populating more than 99% of the expressed genes were identified. Although most of these introns are located in the coding DNA sequences (CDS), over 2,000 introns in the untranslated regions (UTRs) were also identified. Poly(A)-containing reads were employed to locate the polyadenylation sites of more than 80% of the genes. Examination of the sequences around these sites revealed a new poly(A)-site-associated motif (AUGHAH). In addition, 1,197 miscRNAs were identified. These miscRNAs can be spliced and/or polyadenylated, but do not appear to have obvious coding capacities. Finally, this genome sequence enabled a comparative analysis of strain H99 variants obtained after laboratory passage. The spectrum of mutations identified provides insights into the genetics underlying the micro-evolution of a laboratory strain, and identifies mutations involved in stress responses, mating efficiency, and virulence.
R code which implements two-level external cross-validation with the PAMR package, experiment code, dataset details and additional figures are freely available for non-commercial use from http://www.maths.qut.edu.au/profiles/wood/permr.jsp
The subtelomeric regions of organisms ranging from protists to fungi undergo a much higher rate of rearrangement than is observed in the rest of the genome. While characterizing these ~40-kb regions of the human fungal pathogen Cryptococcus neoformans, we have identified a recent gene amplification event near the right telomere of chromosome 3 that involves a gene encoding an arsenite efflux transporter (ARR3). The 3,177-bp amplicon exists in a tandem array of 2-15 copies and is present exclusively in strains with the C. neoformans var. grubii subclade VNI A5 MLST profile. Strains bearing the amplification display dramatically enhanced resistance to arsenite that correlates with the copy number of the repeat; the origin of increased resistance was verified as transport-related by functional complementation of an arsenite transporter mutant of Saccharomyces cerevisiae. Subsequent experimental evolution in the presence of increasing concentrations of arsenite yielded highly resistant strains with the ARR3 amplicon further amplified to over 50 copies, accounting for up to ~1% of the whole genome and making the copy number of this repeat as high as that seen for the ribosomal DNA. The example described here therefore represents a rare evolutionary intermediate-an array that is currently in a state of dynamic flux, in dramatic contrast to relatively common, static relics of past tandem duplications that are unable to further amplify due to nucleotide divergence. Beyond identifying and engineering fungal isolates that are highly resistant to arsenite and describing the first reported instance of microevolution via massive gene amplification in C. neoformans, these results suggest that adaptation through gene amplification may be an important mechanism that C. neoformans employs in response to environmental stresses, perhaps including those encountered during infection. More importantly, the ARR3 array will serve as an ideal model for further molecular genetic analyses of how tandem gene duplications arise and expand.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.