James M. Tiedje scite author profile

The Ribosomal Database Project (RDP) Classifier, a naïve Bayesian classifier, can rapidly and accurately classify bacterial 16S rRNA sequences into the new higher-order taxonomy proposed in Bergey's Taxonomic Outline of the Prokaryotes (2nd ed., release 5.0, Springer-Verlag, New York, NY, 2004). It provides taxonomic assignments from domain to genus, with confidence estimates for each assignment. The majority of classifications (98%) were of high estimated confidence (>95%) and high accuracy (98%). In addition to being tested with the corpus of 5,014 type strain sequences from Bergey's outline, the RDP Classifier was tested with a corpus of 23,095 rRNA sequences as assigned by the NCBI into their alternative higher-order taxonomy. The results from leave-one-out testing on both corpora show that the overall accuracies at all levels of confidence for near-full-length and 400-base segments were 89% or above down to the genus level, and the majority of the classification errors appear to be due to anomalies in the current taxonomies. For shorter rRNA segments, such as those that might be generated by pyrosequencing, the error rate varied greatly over the length of the 16S rRNA gene, with segments around the V2 and Starting in the mid-1980s, Carl Woese revolutionized the field of microbiology with his rRNA-based phylogenetic comparisons delineating the three main branches of life (28). Today, rRNA-based analysis remains a central method in microbiology, used not only to explore microbial diversity but also as a day-to-day method for bacterial identification. Identification methods are conceptually easier to interpret than molecular phylogenetic analyses and are often preferred when the groups are well understood. Most rRNA identification (classification) methods, as opposed to phylogenetic (clustering) methods, have been nearest-neighbor-based classification schemes (10, 18; however, see reference 4). In some part, this was due to the lack of a consistent, higher-level bacterial classification structure (taxonomy). Several recent events have helped change this situation. In 2002, an ad hoc committee for the reevaluation of species definition in bacteriology (24) advised that all new bacterial species descriptions include an rRNA sequence from the type strain, and in 2001, Bergey's Trust published a revised higher-order taxonomy attempting to reconcile bacterial taxonomy with rRNA-based phylogeny (12, 13).The naïve Bayesian classification method is simple yet can be extremely efficient. "Naïve" refers to the (naïve) assumption that data attributes are independent. Domingos and Pazzani (11) showed that the Bayesian method can still be optimal even when this attribute independency is violated. The method has also been reported to perform well on problems similar to the classification of sequence data, such as the classification of text documents, that have a high-dimensional feature space and sparse data (16).The Ribosomal Database Project II (RDP) provides data, tools, and services related to rRNA sequences to the rese...

show abstract

The Ribosomal Database Project: improved alignments and new tools for rRNA analysis

Cole

Wang

Cardenas

et al. 2009

Nucleic Acids Research

4,243

3,205

View full text Add to dashboard Cite

The Ribosomal Database Project (RDP) provides researchers with quality-controlled bacterial and archaeal small subunit rRNA alignments and analysis tools. An improved alignment strategy uses the Infernal secondary structure aware aligner to provide a more consistent higher quality alignment and faster processing of user sequences. Substantial new analysis features include a new Pyrosequencing Pipeline that provides tools to support analysis of ultra high-throughput rRNA sequencing data. This pipeline offers a collection of tools that automate the data processing and simplify the computationally intensive analysis of large sequencing libraries. In addition, a new Taxomatic visualization tool allows rapid visualization of taxonomic inconsistencies and suggests corrections, and a new class Assignment Generator provides instructors with a lesson plan and individualized teaching materials. Details about RDP data and analytical functions can be found at http://rdp.cme.msu.edu/.

show abstract

DNA–DNA hybridization values and their relationship to whole-genome sequence similarities

et al. 2007

View full text Add to dashboard Cite

DNA-DNA hybridization (DDH) values have been used by bacterial taxonomists since the 1960s to determine relatedness between strains and are still the most important criterion in the delineation of bacterial species. Since the extent of hybridization between a pair of strains is ultimately governed by their respective genomic sequences, we examined the quantitative relationship between DDH values and genome sequence-derived parameters, such as the average nucleotide identity (ANI) of common genes and the percentage of conserved DNA. A total of 124 DDH values were determined for 28 strains for which genome sequences were available. The strains belong to six important and diverse groups of bacteria for which the intra-group 16S rRNA gene sequence identity was greater than 94 %. The results revealed a close relationship between DDH values and ANI and between DNA-DNA hybridization and the percentage of conserved DNA for each pair of strains. The recommended cut-off point of 70 % DDH for species delineation corresponded to 95 % ANI and 69 % conserved DNA. When the analysis was restricted to the protein-coding portion of the genome, 70 % DDH corresponded to 85 % conserved genes for a pair of strains. These results reveal extensive gene diversity within the current concept of 'species'. Examination of reciprocal values indicated that the level of experimental error associated with the DDH method is too high to reveal the subtle differences in genome size among the strains sampled. It is concluded that ANI can accurately replace DDH values for strains for which genome sequences are available.

show abstract

Ribosomal Database Project: data and tools for high throughput rRNA analysis

et al. 2013

View full text Add to dashboard Cite

Ribosomal Database Project (RDP; http://rdp.cme.msu.edu/) provides the research community with aligned and annotated rRNA gene sequence data, along with tools to allow researchers to analyze their own rRNA gene sequences in the RDP framework. RDP data and tools are utilized in fields as diverse as human health, microbial ecology, environmental microbiology, nucleic acid chemistry, taxonomy and phylogenetics. In addition to aligned and annotated collections of bacterial and archaeal small subunit rRNA genes, RDP now includes a collection of fungal large subunit rRNA genes. RDP tools, including Classifier and Aligner, have been updated to work with this new fungal collection. The use of high-throughput sequencing to characterize environmental microbial populations has exploded in the past several years, and as sequence technologies have improved, the sizes of environmental datasets have increased. With release 11, RDP is providing an expanded set of tools to facilitate analysis of high-throughput data, including both single-stranded and paired-end reads. In addition, most tools are now available as open source packages for download and local use by researchers with high-volume needs or who would like to develop custom analysis pipelines.

show abstract

Genomic insights that advance the species definition for prokaryotes

Konstantinidis

Tiedje

2005

Proc. Natl. Acad. Sci. U.S.A.

1,703

1,424

View full text Add to dashboard Cite

To help advance the species definition for prokaryotes, we have compared the gene content of 70 closely related and fully sequenced bacterial genomes to identify whether species boundaries exist, and to determine the role of the organism's ecology on its shared gene content. We found the average nucleotide identity (ANI) of the shared genes between two strains to be a robust means to compare genetic relatedness among strains, and that ANI values of Ϸ94% corresponded to the traditional 70% DNA-DNA reassociation standard of the current species definition. At the 94% ANI cutoff, current species includes only moderately homogeneous strains, e.g., most of the >4-Mb genomes share only 65-90% of their genes, apparently as a result of the strains having evolved in different ecological settings. Furthermore, diagnostic genetic signatures (boundaries) are evident between groups of strains of the same species, and the intergroup genetic similarity can be as high as 98 -99% ANI, indicating that justifiable species might be found even among organisms that are nearly identical at the nucleotide level. Notably, a large fraction, e.g., up to 65%, of the differences in gene content within species is associated with bacteriophage and transposase elements, revealing an important role of these elements during bacterial speciation. Our findings are consistent with a definition for species that would include a more homogeneous set of strains than provided by the current definition and one that considers the ecology of the strains in addition to their evolutionary distance.prokaryotic diversity ͉ species concept ͉ nucleotide identity ͉ comparative genomics ͉ evolution A bacterial species is essentially considered to be a collection of strains that are characterized by at least one diagnostic phenotypic trait and whose purified DNA molecules show 70% or higher reassociation values, following the recommendations in the classical paper by Wayne et al. (1). This species definition, while pragmatic and universally applicable within the prokaryotic world (2-4), has been criticized for being difficult to implement because of technological limitations in identifying diagnostic traits and in performing the pairwise DNA-DNA reassociation experiments, and for being often not adequately predictive of phenotype (5-7). Furthermore, this definition is much broader and is not encompassed by any of the eukaryotic species definitions (8). Indeed, applying this standard to eukaryotic species would lead to the inclusion of members of many taxonomic tribes in the same species, e.g., all of the primates should then belong to the same species (8-10). Last, several strains that show Ͼ70% DNA-DNA reassociation values are classified into different species, even different genera, usually on the basis of pathogenicity or host range, such as strains of Escherichia coli and Shigella spp. (11), making the current prokaryotic classification somehow inconsistent.To gain insight into these issues, we performed pairwise, wholegenome comparisons between all related (i.e...

show abstract

DNA recovery from soils of diverse composition

Zhou

Bruns

Tiedje

1996

Appl Environ Microbiol

2,491

1,080

View full text Add to dashboard Cite

A simple, rapid method for bacterial lysis and direct extraction of DNA from soils with minimal shearing was developed to address the risk of chimera formation from small template DNA during subsequent PCR. The method was based on lysis with a high-salt extraction buffer (1.5 M NaCl) and extended heating (2 to 3 h) of the soil suspension in the presence of sodium dodecyl sulfate (SDS), hexadecyltrimethylammonium bromide, and proteinase K. The extraction method required 6 h and was tested on eight soils differing in organic carbon, clay content, and pH, including ones from which DNA extraction is difficult. The DNA fragment size in crude extracts from all soils was >23 kb. Preliminary trials indicated that DNA recovery from two soils seeded with gram-negative bacteria was 92 to 99%. When the method was tested on all eight unseeded soils, microscopic examination of indigenous bacteria in soil pellets before and after extraction showed variable cell lysis efficiency (26 to 92%). Crude DNA yields from the eight soils ranged from 2.5 to 26.9 g of DNA g ؊1 , and these were positively correlated with the organic carbon content in the soil (r ‫؍‬ 0.73). DNA yields from gram-positive bacteria from pure cultures were two to six times higher when the high-salt-SDS-heat method was combined with mortar-and-pestle grinding and freeze-thawing, and most DNA recovered was of high molecular weight. Four methods for purifying crude DNA were also evaluated for percent recovery, fragment size, speed, enzyme restriction, PCR amplification, and DNA-DNA hybridization. In general, all methods produced DNA pure enough for PCR amplification. Since soil type and microbial community characteristics will influence DNA recovery, this study provides guidance for choosing appropriate extraction and purification methods on the basis of experimental goals.

show abstract

Diverse and abundant antibiotic resistance genes in Chinese swine farms

Zhu

Johnson

et al. 2013

Proc. Natl. Acad. Sci. U.S.A.

1,922

1,042

View full text Add to dashboard Cite

Antibiotic resistance genes (ARGs) are emerging contaminants posing a potential worldwide human health risk. Intensive animal husbandry is believed to be a major contributor to the increased environmental burden of ARGs. Despite the volume of antibiotics used in China, little information is available regarding the corresponding ARGs associated with animal farms. We assessed type and concentrations of ARGs at three stages of manure processing to land disposal at three large-scale (10,000 animals per year) commercial swine farms in China. In-feed or therapeutic antibiotics used on these farms include all major classes of antibiotics except vancomycins. High-capacity quantitative PCR arrays detected 149 unique resistance genes among all of the farm samples, the top 63 ARGs being enriched 192-fold (median) up to 28,000-fold (maximum) compared with their respective antibiotic-free manure or soil controls. Antibiotics and heavy metals used as feed supplements were elevated in the manures, suggesting the potential for coselection of resistance traits. The potential for horizontal transfer of ARGs because of transposon-specific ARGs is implicated by the enrichment of transposases—the top six alleles being enriched 189-fold (median) up to 90,000-fold in manure—as well as the high correlation ( r 2 = 0.96) between ARG and transposase abundance. In addition, abundance of ARGs correlated directly with antibiotic and metal concentrations, indicating their importance in selection of resistance genes. Diverse, abundant, and potentially mobile ARGs in farm samples suggest that unmonitored use of antibiotics and metals is causing the emergence and release of ARGs to the environment.

show abstract

The Ribosomal Database Project (RDP-II): sequences and tools for high-throughput rRNA analysis

Cole

Chai

Farris

et al. 2004

Nucleic Acids Research

1,263

901

View full text Add to dashboard Cite

The Ribosomal Database Project (RDP-II) provides the research community with aligned and annotated rRNA gene sequences, along with analysis services and a phylogenetically consistent taxonomic framework for these data. Updated monthly, these services are made available through the RDP-II website (http://rdp.cme.msu.edu/). RDP-II release 9.21 (August 2004) contains 101 632 bacterial small subunit rRNA gene sequences in aligned and annotated format. High-throughput tools for initial taxonomic placement, identification of related sequences, probe and primer testing, data navigation and subalignment download are provided. The RDP-II email address for questions or comments is rdpstaff@msu.edu.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

James M. Tiedje

Naïve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy

The Ribosomal Database Project: improved alignments and new tools for rRNA analysis

DNA–DNA hybridization values and their relationship to whole-genome sequence similarities

Ribosomal Database Project: data and tools for high throughput rRNA analysis

Genomic insights that advance the species definition for prokaryotes

DNA recovery from soils of diverse composition

Diverse and abundant antibiotic resistance genes in Chinese swine farms

The Ribosomal Database Project (RDP-II): sequences and tools for high-throughput rRNA analysis

Contact Info

Product

Resources

About