2014
DOI: 10.1093/database/bau106
|View full text |Cite
|
Sign up to set email alerts
|

BactPepDB: a database of predicted peptides from a exhaustive survey of complete prokaryote genomes

Abstract: With the recent progress in complete genome sequencing, mining the increasing amount of genomic information available should in theory provide the means to discover new classes of peptides. However, annotation pipelines often do not consider small reading frames likely to be expressed. BactPepDB, available online at http://bactpepdb.rpbs.univ-paris-diderot.fr, is a database that aims at providing an exhaustive re-annotation of all complete prokaryotic genomes—chromosomal and plasmid DNA—available in RefSeq for… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
17
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 22 publications
(17 citation statements)
references
References 64 publications
(61 reference statements)
0
17
0
Order By: Relevance
“…Firstly, RiPP “gene clusters” can be as small as two genes: a precursor peptide and a tailoring protein, especially when further hydrolytic processing can be carried out by endogenous peptidases [ 120 ]. The prevalence of putative small peptides encoded throughout genomes [ 160 ] make it difficult to predict which of these are post-translationally processed, and some small genes are overlooked by automated gene annotation software, which means that some putative RiPP precursors are not even listed in databases. Furthermore, novel classes are difficult to identify precisely due to their novelty compared to known pathways.…”
Section: Reviewmentioning
confidence: 99%
“…Firstly, RiPP “gene clusters” can be as small as two genes: a precursor peptide and a tailoring protein, especially when further hydrolytic processing can be carried out by endogenous peptidases [ 120 ]. The prevalence of putative small peptides encoded throughout genomes [ 160 ] make it difficult to predict which of these are post-translationally processed, and some small genes are overlooked by automated gene annotation software, which means that some putative RiPP precursors are not even listed in databases. Furthermore, novel classes are difficult to identify precisely due to their novelty compared to known pathways.…”
Section: Reviewmentioning
confidence: 99%
“…Owing to the progress of sequencing techniques, an increasing number of peptide sequences, validated or hypothetical is available. Over 1 800 000 candidate sequences are identified among all prokaryotes ( 2 ), and several millions of new sequences are expected from venom analyses ( 3 ), but their structures remain largely unknown. The rational design of peptides at protein/protein ( 4 6 ) and protein/membrane ( 7 ) interfaces is also raising interest ( 8 , 9 ).…”
Section: Introductionmentioning
confidence: 99%
“…Pseudogenes are also a problem in metagenome functional annotation, and they could represent up to 35% in prokaryotic genomes (Liu et al, 2004 ). To address this annotation challenge, there are databases like BactPepDB (Rey et al, 2014 ) and Pseudogene.org for short sequences and pseudogenes of prokaryotic and eukaryotic organisms (Karro et al, 2007 ). A search in such databases before further analysis could be useful to discard non-coding sequences.…”
Section: Reconstructing the Genomic Content Of The Microbial Communitmentioning
confidence: 99%