Teresia Buza scite author profile

The Gene Ontology (GO) Consortium (GOC, http://www.geneontology.org) is a community-based bioinformatics resource that classifies gene product function through the use of structured, controlled vocabularies. Over the past year, the GOC has implemented several processes to increase the quantity, quality and specificity of GO annotations. First, the number of manual, literature-based annotations has grown at an increasing rate. Second, as a result of a new ‘phylogenetic annotation’ process, manually reviewed, homology-based annotations are becoming available for a broad range of species. Third, the quality of GO annotations has been improved through a streamlined process for, and automated quality checks of, GO annotations deposited by different annotation groups. Fourth, the consistency and correctness of the ontology itself has increased by using automated reasoning tools. Finally, the GO has been expanded not only to cover new areas of biology through focused interaction with experts, but also to capture greater specificity in all areas of the ontology using tools for adding new combinatorial terms. The GOC works closely with other ontology developers to support integrated use of terminologies. The GOC supports its user community through the use of e-mail lists, social media and web-based resources.

show abstract

Gene Ontology annotation quality analysis in model eukaryotes

Buza

McCarthy

Wang

et al. 2008

View full text Add to dashboard Cite

Functional analysis using the Gene Ontology (GO) is crucial for array analysis, but it is often difficult for researchers to assess the amount and quality of GO annotations associated with different sets of gene products. In many cases the source of the GO annotations and the date the GO annotations were last updated is not apparent, further complicating a researchers’ ability to assess the quality of the GO data provided. Moreover, GO biocurators need to ensure that the GO quality is maintained and optimal for the functional processes that are most relevant for their research community. We report the GO Annotation Quality (GAQ) score, a quantitative measure of GO quality that includes breadth of GO annotation, the level of detail of annotation and the type of evidence used to make the annotation. As a case study, we apply the GAQ scoring method to a set of diverse eukaryotes and demonstrate how the GAQ score can be used to track changes in GO annotations over time and to assess the quality of GO annotations available for specific biological processes. The GAQ score also allows researchers to quantitatively assess the functional data available for their experimental systems (arrays or databases).

show abstract

AgBase: supporting functional modeling in agricultural organisms

McCarthy

Gresham²,

Buza³

et al. 2010

View full text Add to dashboard Cite

AgBase (http://www.agbase.msstate.edu/) provides resources to facilitate modeling of functional genomics data and structural and functional annotation of agriculturally important animal, plant, microbe and parasite genomes. The website is redesigned to improve accessibility and ease of use, including improved search capabilities. Expanded capabilities include new dedicated pages for horse, cat, dog, cotton, rice and soybean. We currently provide 590 240 Gene Ontology (GO) annotations to 105 454 gene products in 64 different species, including GO annotations linked to transcripts represented on agricultural microarrays. For many of these arrays, this provides the only functional annotation available. GO annotations are available for download and we provide comprehensive, species-specific GO annotation files for 18 different organisms. The tools available at AgBase have been expanded and several existing tools improved based upon user feedback. One of seven new tools available at AgBase, GOModeler, supports hypothesis testing from functional genomics data. We host several associated databases and provide genome browsers for three agricultural pathogens. Moreover, we provide comprehensive training resources (including worked examples and tutorials) via links to Educational Resources at the AgBase website.

show abstract

iMAP: an integrated bioinformatics and visualization pipeline for microbiome data analysis

et al. 2019

View full text Add to dashboard Cite

Background One of the major challenges facing investigators in the microbiome field is turning large numbers of reads generated by next-generation sequencing (NGS) platforms into biological knowledge. Effective analytical workflows that guarantee reproducibility, repeatability, and result provenance are essential requirements of modern microbiome research. For nearly a decade, several state-of-the-art bioinformatics tools have been developed for understanding microbial communities living in a given sample. However, most of these tools are built with many functions that require an in-depth understanding of their implementation and the choice of additional tools for visualizing the final output. Furthermore, microbiome analysis can be time-consuming and may even require more advanced programming skills which some investigators may be lacking. Results We have developed a wrapper named iMAP (Integrated Microbiome Analysis Pipeline) to provide the microbiome research community with a user-friendly and portable tool that integrates bioinformatics analysis and data visualization. The iMAP tool wraps functionalities for metadata profiling, quality control of reads, sequence processing and classification, and diversity analysis of operational taxonomic units. This pipeline is also capable of generating web-based progress reports for enhancing an approach referred to as review-as-you-go (RAYG). For the most part, the profiling of microbial community is done using functionalities implemented in Mothur or QIIME2 platform. Also, it uses different R packages for graphics and R-markdown for generating progress reports. We have used a case study to demonstrate the application of the iMAP pipeline. Conclusions The iMAP pipeline integrates several functionalities for better identification of microbial communities present in a given sample. The pipeline performs in-depth quality control that guarantees high-quality results and accurate conclusions. The vibrant visuals produced by the pipeline facilitate a better understanding of the complex and multidimensional microbiome data. The integrated RAYG approach enables the generation of web-based reports, which provides the investigators with the intermediate output that can be reviewed progressively. The intensively analyzed case study set a model for microbiome data analysis. Electronic supplementary material The online version of this article (10.1186/s12859-019-2965-4) contains supplementary material, which is available to authorized users.

show abstract

Experimental-confirmation and functional-annotation of predicted proteins in the chicken genome

2007

View full text Add to dashboard Cite

Background: The chicken genome was sequenced because of its phylogenetic position as a nonmammalian vertebrate, its use as a biomedical model especially to study embryology and development, its role as a source of human disease organisms and its importance as the major source of animal derived food protein. However, genomic sequence data is, in itself, of limited value; generally it is not equivalent to understanding biological function. The benefit of having a genome sequence is that it provides a basis for functional genomics. However, the sequence data currently available is poorly structurally and functionally annotated and many genes do not have standard nomenclature assigned.

show abstract

Computational prediction of disease microRNAs in domestic animals

et al. 2014

View full text Add to dashboard Cite

BackgroundThe most important means of identifying diseases before symptoms appear is through the discovery of disease-associated biomarkers. Recently, microRNAs (miRNAs) have become highly useful biomarkers of infectious, genetic and metabolic diseases in human but they have not been well studied in domestic animals. It is probable that many of the animal homologs of human disease-associated miRNAs may be involved in domestic animal diseases. Here we describe a computational biology study in which human disease miRNAs were utilized to predict orthologous miRNAs in cow, chicken, pig, horse, and dog.ResultsWe identified 287 human disease-associated miRNAs which had at least one 100% identical animal homolog. The 287 miRNAs were associated with 359 human diseases referenced in 2,863 Pubmed articles. Multiple sequence analysis indicated that over 60% of known horse mature miRNAs found perfect matches in human disease-associated miRNAs, followed by dog (50%). As expected, chicken had the least number of perfect matches (5%). Phylogenetic analysis of miRNA precursors indicated that 85% of human disease pre-miRNAs were highly conserved in animals, showing less than 5% nucleotide substitution rates over evolutionary time. As an example we demonstrated conservation of human hsa-miR-143-3p which is associated with type 2 diabetes and targets AKT1 gene which is highly conserved in pig, horse and dog. Functional analysis of AKT1 gene using Gene Ontology (GO) showed that it is involved in glucose homeostasis, positive regulation of glucose import, positive regulation of glycogen biosynthetic process, glucose transport and response to food.ConclusionsThis data provides the animal and veterinary research community with a resource to assist in generating hypothesis-driven research for discovering animal disease-related miRNA from their datasets and expedite development of prophylactic and disease-treatment strategies and also influence research efforts to identify novel disease models in large animals. Integrated data is available for download at http://agbase.hpc.msstate.edu/cgi-bin/animal_mirna.cgi.

show abstract

Transcriptomic dissection of the rice – Burkholderia glumae interaction

et al. 2014

View full text Add to dashboard Cite

BackgroundBacterial panicle blight caused by the bacterium Burkholderia glumae is an emerging disease of rice in the United States. Not much is known about this disease, the disease cycle or any source of disease resistance. To understand the interaction between rice and Burkholderia glumae, we used transcriptomics via next-generation sequencing (RNA-Seq) and bioinformatics to identify differentially expressed transcripts between resistant and susceptible interactions and formulate a model for rice resistance to the disease.ResultsUsing inoculated young seedlings as sample tissues, we identified unique transcripts involved with resistance to bacterial panicle blight, including a PIF-like ORF1 and verified differential expression of some selected genes using qRT-PCR. These transcripts, which include resistance genes of the NBS-LRR type, kinases, transcription factors, transporters and expressed proteins with functions that are not known, have not been reported in other pathosystems including rice blast or bacterial blight. Further, functional annotation analysis reveals enrichment of defense response and programmed cell death (biological processes); ATP and protein binding (molecular functions); and mitochondrion-related (cell component) transcripts in the resistant interaction.ConclusionTaken together, we formulated a model for rice resistance to bacterial panicle blight that involves an activation of previously unknown resistance genes and their activation partners upon challenge with B. glumae. Other interesting findings are that 1) though these resistance transcripts were up-regulated upon inoculation in the resistant interaction, some of them were already expressed in the water-inoculated control from the resistant genotype, but not in the water- and bacterium-inoculated samples from the susceptible genotype; 2) rice may have co-opted an ORF that was previously a part of a transposable element to aid in the resistance mechanism; and 3) resistance may have existed immediately prior to rice domestication.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2164-15-755) contains supplementary material, which is available to authorized users.

show abstract

Microbial Diversity in Bushmeat Samples Recovered from the Serengeti Ecosystem in Tanzania

Katani

Schilling

Lyimo

et al. 2019

Sci Rep

View full text Add to dashboard Cite

Bushmeat, the meat and organs derived from wildlife species, is a common source of animal protein in the diets of those living in sub-Saharan Africa and is frequently associated with zoonotic spillover of dangerous pathogens. Given the frequent consumption of bushmeat in this region and the lack of knowledge about the microbial communities associated with this meat, the microbiome of 56 fresh and processed bushmeat samples ascertained from three districts in the Western Serengeti ecosystem in Tanzania was characterized using 16S rRNA metagenomic sequencing. The results show that the most abundant phyla present in bushmeat samples include Firmicutes (67.8%), Proteobacteria (18.4%), Cyanobacteria (8.9%), and Bacteroidetes (3.1%). Regardless of wildlife species, sample condition, season, or region, the microbiome is diverse across all samples, with no significant difference in alpha or beta diversity. The findings also suggest the presence of DNA signatures of potentially dangerous zoonotic pathogens, including those from the genus Bacillus, Brucella, Coxiella, and others, in bushmeat. Together, this investigation provides a better understanding of the microbiome associated with this major food source in samples collected from the Western Serengeti in Tanzania and highlights a need for future investigations on the potential health risks associated with the harvesting, trade, and consumption of bushmeat in Sub-Saharan Africa.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Teresia Buza

Gene Ontology Annotations and Resources

Gene Ontology annotation quality analysis in model eukaryotes

AgBase: supporting functional modeling in agricultural organisms

iMAP: an integrated bioinformatics and visualization pipeline for microbiome data analysis

Experimental-confirmation and functional-annotation of predicted proteins in the chicken genome

Computational prediction of disease microRNAs in domestic animals

Transcriptomic dissection of the rice – Burkholderia glumae interaction

Microbial Diversity in Bushmeat Samples Recovered from the Serengeti Ecosystem in Tanzania

Contact Info

Product

Resources

About