2017
DOI: 10.1038/nchembio.2319
|View full text |Cite
|
Sign up to set email alerts
|

A new genome-mining tool redefines the lasso peptide biosynthetic landscape

Abstract: Ribosomally synthesized and post-translationally modified peptide (RiPP) natural products are attractive for genome-driven discovery and re-engineering, but limitations in bioinformatic methods and exponentially increasing genomic data make large-scale mining difficult. We report RODEO (Rapid ORF Description and Evaluation Online), which combines hidden Markov model-based analysis, heuristic scoring, and machine learning to identify biosynthetic gene clusters and predict RiPP precursor peptides. We initially f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

12
571
1

Year Published

2017
2017
2022
2022

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 376 publications
(586 citation statements)
references
References 59 publications
12
571
1
Order By: Relevance
“…Many of these tools identify complete clusters of various RiPP families through comparison to a training set of known RiPP clusters, leveraging combinations of hidden Markov models, “rules” for precursor peptide identification (e.g., short open reading frames [ORFs] rich in residues that are acted on by the associated biosynthetic enzymes), and comparison to databases of known RiPP sequences. The identification of precursor peptides remains a challenge, and a recent approach called RODEO leverages a machine-learning based approach for this purpose [20]. …”
Section: Large Scale Ripp Identification Using Automated Genome Mininmentioning
confidence: 99%
See 2 more Smart Citations
“…Many of these tools identify complete clusters of various RiPP families through comparison to a training set of known RiPP clusters, leveraging combinations of hidden Markov models, “rules” for precursor peptide identification (e.g., short open reading frames [ORFs] rich in residues that are acted on by the associated biosynthetic enzymes), and comparison to databases of known RiPP sequences. The identification of precursor peptides remains a challenge, and a recent approach called RODEO leverages a machine-learning based approach for this purpose [20]. …”
Section: Large Scale Ripp Identification Using Automated Genome Mininmentioning
confidence: 99%
“…Biochemical characterization of lasso peptides has indeed born out that assertion as many of the original “rules” for lasso precursor identification have been greatly expanded [20,32]. Lasso peptides are characterized by a knotted structure where the C-terminus is threaded through a macrocycle formed by a covalent bond between the N-terminal amine and an aspartate or glutamate side chain [4].…”
Section: Examples Of Bottom-up Ripp Discoverymentioning
confidence: 99%
See 1 more Smart Citation
“…Recent breakthroughs in crystallization allow for the X-ray single-crystal diffraction of submicrogram quantities of natural products, including those that are otherwise noncrystalline (22). Moreover, at least partially predicting small-molecule structure directly from genome sequence is becoming increasingly sophisticated (16), which has the potential to shift the role of spectroscopy from de novo structure determination into predicted structure confirmation (23) and even enable the discovery of new natural products via direct synthesis of genome-predicted structures (24).…”
mentioning
confidence: 99%
“…Just as the application of more systematic approaches has powerfully enabled functional characterization in related fields, new systematic tools for characterizing the functions of natural products holds similar promise. For example, computer algorithms can increasingly predict biosynthetic gene clusters responsible for a specific functional domain of a natural product and rapidly identify many other natural products with predicted activities (23). And small-molecule microarrays allow for several thousand protein binding assays to be run in parallel rapidly providing information about the binding targets of small molecules (26).…”
mentioning
confidence: 99%