The eukaryotic linear motif (ELM) resource is a repository of manually curated experimentally validated short linear motifs (SLiMs). Since the initial release almost 20 years ago, ELM has become an indispensable resource for the molecular biology community for investigating functional regions in many proteins. In this update, we have added 21 novel motif classes, made major revisions to 12 motif classes and added >400 new instances mostly focused on DNA damage, the cytoskeleton, SH2-binding phosphotyrosine motifs and motif mimicry by pathogenic bacterial effector proteins. The current release of the ELM database contains 289 motif classes and 3523 individual protein motif instances manually curated from 3467 scientific publications. ELM is available at: http://elm.eu.org.
The Eukaryotic Linear Motif (ELM) resource (http://elm.eu.org) is a manually curated database of short linear motifs (SLiMs). In this update, we present the latest additions to this resource, along with more improvements to the web interface. ELM 2016 contains more than 240 different motif classes with over 2700 experimentally validated instances, manually curated from more than 2400 scientific publications. In addition, more data have been made available as individually searchable pages and are downloadable in various formats.
Short linear motifs (SLiMs) are protein binding modules that play major roles in almost all cellular processes. SLiMs are short, often highly degenerate, difficult to characterize and hard to detect. The eukaryotic linear motif (ELM) resource (elm.eu.org) is dedicated to SLiMs, consisting of a manually curated database of over 275 motif classes and over 3000 motif instances, and a pipeline to discover candidate SLiMs in protein sequences. For 15 years, ELM has been one of the major resources for motif research. In this database update, we present the latest additions to the database including 32 new motif classes, and new features including Uniprot and Reactome integration. Finally, to help provide cellular context, we present some biological insights about SLiMs in the cell cycle, as targets for bacterial pathogenicity and their functionality in the human kinome.
The CCCTC-binding factor (CTCF) is known to establish long-range DNA contacts that alter the three-dimensional architecture of chromatin, but how the presence of CTCF influences nearby gene expression is still poorly understood. Here, we analyze CTCF chromatin immunoprecipitation sequencing, RNA sequencing, and Hi-C data, together with genotypes from a healthy human cohort, and measure statistical associations between inter-individual variability in CTCF binding and alternative exon usage. We demonstrate that CTCF-mediated chromatin loops between promoters and intragenic regions are prevalent and that when exons are in physical proximity with their promoters, CTCF binding correlates with exon inclusion in spliced mRNA. Genome-wide, CTCF-bound exons are enriched for genes involved in signaling and cellular stress-response pathways. Structural analysis of three specific examples, checkpoint kinase 2 (CHK2), CDC-like kinase 3 (CLK3), and euchromatic histone-lysine N-methyltransferase (EHMT1), suggests that CTCF-mediated exon inclusion is likely to downregulate enzyme activity by disrupting annotated protein domains. In total, our study suggests that alternative exon usage is regulated by CTCF-dependent chromatin structure.
Background: The recently constructed river buffalo whole-genome radiation hybrid panel (BBURH 5000 ) has already been used to generate preliminary radiation hybrid (RH) maps for several chromosomes, and buffalo-bovine comparative chromosome maps have been constructed. Here,
Degrons are the elements that are used by E3 ubiquitin ligases to target proteins for degradation. Most degrons are short linear motifs embedded within the sequences of modular proteins. As regulatory sites for protein abundance, they are important for many different cellular processes, such as progression through the cell cycle and monitoring cellular hypoxia. Degrons enable the elimination of proteins that are no longer required, preventing their possible dysfunction. Although the human genome encodes~600 E3 ubiquitin ligases, only a fraction of these enzymes have well-defined target degrons. Thus, for most cellular proteins, the destruction mechanisms are poorly understood. This is important for many diseases, especially for cancer, a disease that involves the enhanced expression of oncogenes and the persistence of encoded oncoproteins coupled with reduced abundance of tumor suppressors. Lossof-function mutations occur in the degrons of several oncoproteins, such as the transcription factors MYC and NRF2, and in various mitogenic receptors, such as NOTCH1 and several receptor tyrosine kinases. Mutations eliminating the function of the b-catenin degron are found in many cancers and are considered one of the most abundant mutations driving carcinogenesis. In this Review, we describe the current knowledge of degrons in cancer and suggest that increased research on the "dark degrome" (unknown degron-E3 relationships) would enhance progress in cancer research.
The first reported receptor for SARS-CoV-2 on host cells was the angiotensin-converting enzyme 2 (ACE2). However, the viral spike protein also has an RGD motif, suggesting that cell surface integrins may be co-receptors. We examined the sequences of ACE2 and integrins with the Eukaryotic Linear Motif (ELM) resource and identified candidate short linear motifs (SLiMs) in their short, unstructured, cytosolic tails with potential roles in endocytosis, membrane dynamics, autophagy, cytoskeleton, and cell signaling. These SLiM candidates are highly conserved in vertebrates and may interact with the μ2 subunit of the endocytosis-associated AP2 adaptor complex, as well as with various protein domains (namely, I-BAR, LC3, PDZ, PTB, and SH2) found in human signaling and regulatory proteins. Several motifs overlap in the tail sequences, suggesting that they may act as molecular switches, such as in response to tyrosine phosphorylation status. Candidate LC3-interacting region (LIR) motifs are present in the tails of integrin β3 and ACE2, suggesting that these proteins could directly recruit autophagy components. Our findings identify several molecular links and testable hypotheses that could uncover mechanisms of SARS-CoV-2 attachment, entry, and replication against which it may be possible to develop host-directed therapies that dampen viral infection and disease progression. Several of these SLiMs have now been validated to mediate the predicted peptide interactions.
The Protein Data Bank in Europe-Knowledge Base (PDBe-KB, https://pdbe-kb.org) is a community-driven, collaborative resource for literature-derived, manually curated and computationally predicted structural and functional annotations of macromolecular structure data, contained in the Protein Data Bank (PDB). The goal of PDBe-KB is two-fold: (i) to increase the visibility and reduce the fragmentation of annotations contributed by specialist data resources, and to make these data more findable, accessible, interoperable and reusable (FAIR) and (ii) to place macromolecular structure data in their biological context, thus facilitating their use by the broader scientific community in fundamental and applied research. Here, we describe the guidelines of this collaborative effort, the current status of contributed data, and the PDBe-KB infrastructure, which includes the data exchange format, the deposition system for added value annotations, the distributable database containing the assembled data, and programmatic access endpoints. We also describe a series of novel web-pages—the PDBe-KB aggregated views of structure data—which combine information on macromolecular structures from many PDB entries. We have recently released the first set of pages in this series, which provide an overview of available structural and functional information for a protein of interest, referenced by a UniProtKB accession.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.