2017
DOI: 10.1093/nar/gkx1069
|View full text |Cite|
|
Sign up to set email alerts
|

Gene3D: Extensive prediction of globular domains in proteins

Abstract: Gene3D (http://gene3d.biochem.ucl.ac.uk) is a database of globular domain annotations for millions of available protein sequences. Gene3D has previously featured in the Database issue of NAR and here we report a significant update to the Gene3D database. The current release, Gene3D v16, has significantly expanded its domain coverage over the previous version and now contains over 95 million domain assignments. We also report a new method for dealing with complex domain architectures that exist in Gene3D, arisi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
110
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 129 publications
(110 citation statements)
references
References 17 publications
0
110
0
Order By: Relevance
“…1A, and Fig. S1) [29]. The result showed that most of the plant species had the co-occurrence of the RNHLS domain with a DNA polymerase domain, a nuclease, and nucleotide triphosphate hydrolase domain ( Fig.…”
Section: Species Selected To Cover the Characters Of Rnhls Proteins Imentioning
confidence: 89%
See 1 more Smart Citation
“…1A, and Fig. S1) [29]. The result showed that most of the plant species had the co-occurrence of the RNHLS domain with a DNA polymerase domain, a nuclease, and nucleotide triphosphate hydrolase domain ( Fig.…”
Section: Species Selected To Cover the Characters Of Rnhls Proteins Imentioning
confidence: 89%
“…The co-occurrence matrix of the RNHLS domain (3. 30.420.10) with other protein domains was downloaded from Gene3D [29]. The heatmap was processed by online software Morpheus (https://software.broadinstitute.org/morpheus/) using the default settings.…”
Section: Co-occurrence Of Rnhls Domain With Other Protein Domainsmentioning
confidence: 99%
“…As an extension of the approach described above using enzyme terminology, we have also systematically investigated the distribution of pseudoenzymes in the protein universe, using protein families from the CATH (class, architecture, topology, homology)-Gene3D resource (57,58), which links protein domain sequences to structures and experimental functions. CATH-Gene3D classifies ~435,000 domain structures and ~95 million protein domain sequences into ~6100 evolutionary superfamilies (57).…”
Section: Identifying Pseudoenzymes In Proteomic Databases By Exploitimentioning
confidence: 99%
“…A given protein structure, corresponding to a single distance matrix, can be formed by many different homologous sequences, and those sequences all satisfy the constraints imposed by the distance matrix. Such solutions to this constraint satisfaction problem (CSP) are given to us by evolution and are available in sequence repositories such as Pfam (22) or Gene3D (23). While the rules of this CSP for a specific protein fold can be found by comparing sequences from one of such repositories and can be captured as Hidden Markov Models (HMM) or position weight matrices (PWMs), often represented as sequence logos, it has thus far not been possible to deduce general rules-those that would connect any given protein fold or distance matrix with a set of sequences.…”
Section: Introductionmentioning
confidence: 99%