2012
DOI: 10.1002/pmic.201100534
|View full text |Cite
|
Sign up to set email alerts
|

PRDB: Protein Repeat DataBase

Abstract: Rapidly increasing genomic data present new challenges for scientists: making sense of millions of amino acid sequences requires a systematic approach and information about their 3D structure, function, and evolution. Over the last decade, numerous studies demonstrated the fundamental importance of protein tandem repeats and their involvement in human diseases. Bioinformatics analysis of these regions requires special computer programs and databases, since the conventional approaches predominantly developed fo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2013
2013
2023
2023

Publication Types

Select...
9

Relationship

1
8

Authors

Journals

citations
Cited by 13 publications
(8 citation statements)
references
References 26 publications
(29 reference statements)
0
8
0
Order By: Relevance
“…A de novo approach was preferred over annotating domain repeats with profile hidden Markov models (HMMs) for known domain repeats, such as those provided by Pfam (Punta et al 2012) or SUPERFAMILY (Wilson et al 2009) for two reasons. First, the set of known domain repeats in these databases is not exhaustive and, accordingly, de novo approaches find many repeats that do not match any already known domain (Ponting et al 2001; Jorda et al 2012). Second, many HMMs provided by Pfam and SUPERFAMILY do not specifically match an individual repeat unit, but rather stretches of several repeat units in tandem.…”
Section: Resultsmentioning
confidence: 99%
“…A de novo approach was preferred over annotating domain repeats with profile hidden Markov models (HMMs) for known domain repeats, such as those provided by Pfam (Punta et al 2012) or SUPERFAMILY (Wilson et al 2009) for two reasons. First, the set of known domain repeats in these databases is not exhaustive and, accordingly, de novo approaches find many repeats that do not match any already known domain (Ponting et al 2001; Jorda et al 2012). Second, many HMMs provided by Pfam and SUPERFAMILY do not specifically match an individual repeat unit, but rather stretches of several repeat units in tandem.…”
Section: Resultsmentioning
confidence: 99%
“…Repeat proteins play key roles in biological processes ranging from adhesion to signaling to defense mechanisms [1]. These proteins consist of adjacent series of usually non-identical repeated amino acid sequences; in most cases, these repeated units fold cooperatively into either a solenoid-shaped or a toroid-shaped structure [24].…”
Section: Introductionmentioning
confidence: 99%
“…Widely used structural classifications such as CATH (35) and SCOP (36) also do not explicitly annotate repeats in protein structures, although it may be possible to leverage individual annotations to find similar repeats. Some databases exist for the detection of repeats from sequence (37–39), but usually these are limited to short tandem repeats and do not take into account divergent repeats, such as solenoids or toroids. The main domain sequence databases such as Pfam (40) and SMART (41) do not excel at the annotation of these repeat types either, as coverage is rather low and many repeat units go undetected.…”
Section: Introductionmentioning
confidence: 99%