ABSTRACTThe fungi kingdom is composed of eukaryotic heterotrophs, which are responsible for balancing the ecosystem and play a major role as decomposers. They also produce a vast diversity of secondary metabolites, which have antibiotic or pharmacological properties. However, our lack of knowledge of gene function in fungi precludes us from tailoring them to our needs and tapping into their metabolic diversity. To remedy this, we gathered genomic and gene expression data of 19 most widely-researched fungi to build a database, fungi.guru, which contains tools for cross-species identification of conserved pathways, functional gene modules, and gene families. We exemplify how our database can elucidate the molecular function, biological process and cellular component of genes involved in various biological processes, by identifying a secondary metabolite pathway producing gliotoxin in Aspergillus fumigatus, the catabolic pathway of cellulose in Coprinopsis cinerea and the conserved DNA replication pathway in Fusarium graminearum and Pyricularia oryzae. The database is available at www.fungi.guru.
Summary: During the last few decades, the study of microbial ecology has been enabled by molecular and genomic data. DNA sequencing has revealed the surprising extent of microbial diversity and how microbial processes run global ecosystems. However, significant gaps in our understanding of the microbial world remain, and one example is that microbial eukaryotes, or protists, are still neglected. To address this gap, we used gene expression data from 15 distinct protist species to create protist.guru: an online database equipped with tools for identifying functional co-expression networks, gene families, and enriched gene clusters. Here, we show how our database can be used to reveal genes involved in essential pathways, such as the synthesis of secondary carotenoids in Haematococcus lacustris. We expect protist.guru to serve as a valuable resource for protistologists, as well as a catalyst for discoveries and new insights into the biological processes of microbial eukaryotes. Availability: The database and co-expression networks are freely available from http://protist.guru/. The expression matrices and sample annotations are found in the supplementary data.
SummaryDuring the last few decades, the study of microbial ecology has been enabled by molecular and genomic data. DNA sequencing has revealed the surprising extent of microbial diversity and how microbial processes run global ecosystems. However, significant gaps in our understanding of the microbial world remain, and one example is that microbial eukaryotes, or protists, are still neglected. To address this gap, we used gene expression data from 15 distinct protist species to create protist.guru: an online database equipped with tools for identifying functional co-expression networks, gene families, and enriched gene clusters. Here, we show how our database can be used to reveal genes involved in essential pathways, such as the synthesis of secondary carotenoids in Haematococcus lacustris. We expect protist.guru to serve as a valuable resource for protistologists, as well as a catalyst for discoveries and new insights into the biological processes of microbial eukaryotes.AvailabilityThe database and co-expression networks are freely available from http://protist.guru/. The expression matrices and sample annotations are found in the supplementary data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.