Naryttza N. Diaz scite author profile

The release of the 1000th complete microbial genome will occur in the next two to three years. In anticipation of this milestone, the Fellowship for Interpretation of Genomes (FIG) launched the Project to Annotate 1000 Genomes. The project is built around the principle that the key to improved accuracy in high-throughput annotation technology is to have experts annotate single subsystems over the complete collection of genomes, rather than having an annotation expert attempt to annotate all of the genes in a single genome. Using the subsystems approach, all of the genes implementing the subsystem are analyzed by an expert in that subsystem. An annotation environment was created where populated subsystems are curated and projected to new genomes. A portable notion of a populated subsystem was defined, and tools developed for exchanging and curating these objects. Tools were also developed to resolve conflicts between populated subsystems. The SEED is the first annotation environment that supports this model of annotation. Here, we describe the subsystem approach, and offer the first release of our growing library of populated subsystems. The initial release of data includes 180 177 distinct proteins with 2133 distinct functional roles. This data comes from 173 subsystems and 383 different organisms.

show abstract

The metagenome of a biogas-producing microbial community of a production-scale biogas plant fermenter analysed by the 454-pyrosequencing technology

Schlüter

Bekel

Diaz

et al. 2008

Journal of Biotechnology

307

216

View full text Add to dashboard Cite

Phylogenetic classification of short environmental DNA fragments

et al. 2008

View full text Add to dashboard Cite

Metagenomics is providing striking insights into the ecology of microbial communities. The recently developed massively parallel 454 pyrosequencing technique gives the opportunity to rapidly obtain metagenomic sequences at a low cost and without cloning bias. However, the phylogenetic analysis of the short reads produced represents a significant computational challenge. The phylogenetic algorithm CARMA for predicting the source organisms of environmental 454 reads is described. The algorithm searches for conserved Pfam domain and protein families in the unassembled reads of a sample. These gene fragments (environmental gene tags, EGTs), are classified into a higher-order taxonomy based on the reconstruction of a phylogenetic tree of each matching Pfam family. The method exhibits high accuracy for a wide range of taxonomic groups, and EGTs as short as 27 amino acids can be phylogenetically classified up to the rank of genus. The algorithm was applied in a comparative study of three aquatic microbial samples obtained by 454 pyrosequencing. Profound differences in the taxonomic composition of these samples could be clearly revealed.

show abstract

TACOA – Taxonomic classification of environmental genomic fragments using a kernelized nearest neighbor approach

et al. 2009

View full text Add to dashboard Cite

Background: Metagenomics, or the sequencing and analysis of collective genomes (metagenomes) of microorganisms isolated from an environment, promises direct access to the "unculturable majority". This emerging field offers the potential to lay solid basis on our understanding of the entire living world. However, the taxonomic classification is an essential task in the analysis of metagenomics data sets that it is still far from being solved. We present a novel strategy to predict the taxonomic origin of environmental genomic fragments. The proposed classifier combines the idea of the k-nearest neighbor with strategies from kernel-based learning.

show abstract

Phylogenetic characterization of a biogas plant microbial community integrating clone library 16S-rDNA sequences and metagenome sequence data obtained by 454-pyrosequencing

Kröber

Bekel

Diaz

et al. 2009

Journal of Biotechnology

233

114

View full text Add to dashboard Cite

Taxonomic composition and gene content of a methane-producing microbial community isolated from a biogas reactor

Krause

Diaz

Edwards

et al. 2008

Journal of Biotechnology

211

114

View full text Add to dashboard Cite

Finding novel genes in bacterial communities isolated from the environment

Krause

Diaz

Bartels

et al. 2006

View full text Add to dashboard Cite

show abstract

Hyperbolic SOM-based clustering of DNA fragment features for taxonomic visualization and classification

Martin

Diaz

Ontrup

et al. 2008

View full text Add to dashboard Cite

show abstract

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Naryttza N. Diaz

The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes

The metagenome of a biogas-producing microbial community of a production-scale biogas plant fermenter analysed by the 454-pyrosequencing technology

Phylogenetic classification of short environmental DNA fragments

TACOA – Taxonomic classification of environmental genomic fragments using a kernelized nearest neighbor approach

Phylogenetic characterization of a biogas plant microbial community integrating clone library 16S-rDNA sequences and metagenome sequence data obtained by 454-pyrosequencing

Taxonomic composition and gene content of a methane-producing microbial community isolated from a biogas reactor

Finding novel genes in bacterial communities isolated from the environment

Hyperbolic SOM-based clustering of DNA fragment features for taxonomic visualization and classification

Contact Info

Product

Resources

About