2018
DOI: 10.1093/nar/gky1097
|View full text |Cite
|
Sign up to set email alerts
|

CATH: expanding the horizons of structure-based functional annotations for genome sequences

Abstract: This article provides an update of the latest data and developments within the CATH protein structure classification database (http://www.cathdb.info). The resource provides two levels of release: CATH-B, a daily snapshot of the latest structural domain boundaries and superfamily assignments, and CATH+, which adds layers of derived data, such as predicted sequence domains, functional annotations and functional clustering (known as Functional Families or FunFams). The most recent CATH+ release (version 4.2) pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
137
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
7
3

Relationship

2
8

Authors

Journals

citations
Cited by 138 publications
(137 citation statements)
references
References 23 publications
0
137
0
Order By: Relevance
“…Here, we provide a systematic overview of the current knowledge of enzymatic catalysis from the point of view of the catalytic residues. We use the Mechanism and Catalytic Site Atlas (M-CSA) 3 (15) database as a manually curated data set of catalytic residues and other mechanistic information. We start by considering the frequency of the 20 amino acid types as catalytic residues and whether they act through their side chain or backbone.…”
mentioning
confidence: 99%
“…Here, we provide a systematic overview of the current knowledge of enzymatic catalysis from the point of view of the catalytic residues. We use the Mechanism and Catalytic Site Atlas (M-CSA) 3 (15) database as a manually curated data set of catalytic residues and other mechanistic information. We start by considering the frequency of the 20 amino acid types as catalytic residues and whether they act through their side chain or backbone.…”
mentioning
confidence: 99%
“…As an extension of the approach described above using enzyme terminology, we have also systematically investigated the distribution of pseudoenzymes in the protein universe, using protein families from the CATH (class, architecture, topology, homology)-Gene3D resource (57,58), which links protein domain sequences to structures and experimental functions. CATH-Gene3D classifies ~435,000 domain structures and ~95 million protein domain sequences into ~6100 evolutionary superfamilies (57). These can then be subclassified into functional families (FunFams) that share highly similar structures and functions based on sequence patterns, specificity-determining positions, and other conserved positions (Fig.…”
Section: Identifying Pseudoenzymes In Proteomic Databases By Exploitimentioning
confidence: 99%
“…The pipeline was built using the Ruffus (Python) (Goodstadt, 2010) architecture that performs multiple tasks to generate models with substantial quality and reliability. In the modelling process, templates were selected from TOCCATA (Ochoa Montaño B, Blundell TL, unpublished) a database of consensus profiles built from CATH v4.2.0 and SCOPe v2.07 (62,63) based classification of protein structures (PDB files). PDBs within each profile are clustered based on sequence similarity using CD-HIT (64,65)) and structures are aligned using BATON, which was developed as a streamlined version of the in-house program COMPARER (66).…”
Section: Structure Modelling Of Selected Drug Targetsmentioning
confidence: 99%