SUMMARY Transcription factor (TF) DNA sequence preferences direct their regulatory activity, but are currently known for only ~1% of all eukaryotic TFs. Broadly sampling DNA-binding domain (DBD) types from multiple eukaryotic clades, we determined DNA sequence preferences for >1,000 TFs encompassing 54 different DBD classes from 131 diverse eukaryotes. We find that closely related DBDs almost always have very similar DNA sequence preferences, enabling inference of motifs for ~34% of the ~170,000 known or predicted eukaryotic TFs. Sequences matching both measured and inferred motifs are enriched in ChIP-seq peaks and upstream of transcription start sites in diverse eukaryotic lineages. SNPs defining expression quantitative trait loci in Arabidopsis promoters are also enriched for predicted TF binding sites. Importantly, our motif “library” (http://cisbp.ccbr.utoronto.ca) can be used to identify specific TFs whose binding may be altered by human disease risk alleles. These data present a powerful resource for mapping transcriptional networks across eukaryotes.
RNA-binding proteins are key regulators of gene expression, yet only a small fraction have been functionally characterized. Here we report a systematic analysis of the RNA motifs recognized by RNA-binding proteins, encompassing 205 distinct genes from 24 diverse eukaryotes. The sequence specificities of RNA-binding proteins display deep evolutionary conservation, and the recognition preferences for a large fraction of metazoan RNA-binding proteins can thus be inferred from their RNA-binding domain sequence. The motifs that we identify in vitro correlate well with in vivo RNA-binding data. Moreover, we can associate them with distinct functional roles in diverse types of post-transcriptional regulation, enabling new insights into the functions of RNA-binding proteins both in normal physiology and in human disease. These data provide an unprecedented overview of RNA-binding proteins and their targets, and constitute an invaluable resource for determining post-transcriptional regulatory mechanisms in eukaryotes.
unc-73 is required for cell migrations and axon guidance in C. elegans and encodes overlapping isoforms of 283 and 189 kDa that are closely related to the vertebrate Trio and Kalirin proteins, respectively. UNC-73A contains, in order, eight spectrin-like repeats, a Dbl/Pleckstrin homology (DH/PH) element, an SH3-like domain, a second DH/PH element, an immunoglobulin domain, and a fibronectin type III domain. UNC-73B terminates just downstream of the SH3-like domain. The first DH/PH element specifically activates the Rac GTPase in vitro and stimulates actin polymerization when expressed in Rat2 cells. Both functions are eliminated by introducing the S1216F mutation of unc-73(rh40) into this DH domain. Our results suggest that UNC-73 acts cell autonomously in a protein complex to regulate actin dynamics during cell and growth cone migrations.
UNC-6 netrin, a laminin-related protein secreted from neuroglia and neurons along the ventral midline, orients migrating cells and pioneering growth cones on the nematode epidermis. UNC-5, a cell surface protein expressed on motile cells and pioneer axons, orients movements away from UNC-6 sources. UNC-40, a homolog of the cell surface proteins DCC (Deleted in Colorectal Cancer) and neogenin, is also expressed on motile cells and pioneer neurons. UNC-40 acts cell autonomously to orient movement toward UNC-6 sources. For cells coexpressing UNC-5, it helps orient movement away from UNC-6 sources. Finally, UNC-40 helps determine the dorsoventral position of cells undergoing purely longitudinal migrations. Together with the recent report that DCC is a netrin receptor in vertebrates, our results suggest that UNC-40 is a component of UNC-6 receptors on motile cells.
Background In the last stage of the C. elegans equivalent of epiboly, an open pocket in the epidermis is closed by marginal epidermal P/pocket cells that express and require VAB-1/Eph and PLX-2/plexin receptors for migration toward and alignment with contralateral partners at the ventral midline. Cellular mechanisms affected by these signaling proteins remain unknown. Results A cellular bridge comprising four neuron cell bodies that spans the open pocket serves as a substratum for migration of contra-lateral P cell pair P9/10 to the midline which can facilitate similar migration of neighboring P cells. This bridge is formed by a stereotypical rearrangement of five sister pairs of PLX-2 and VAB-1 expressing cells, of which three pairs serve as a scaffold for bridge assembly and two pairs form the bridge. Bridge formation requires VAB-1 kinase-dependent extension of presumptive bridge cell protrusions toward the ventral midline. An unassembled mutant bridge obstructs but does not block P cell progression toward the midline, however, cell type-specific rescue experiments show that VAB-1 or a nearly complete cytoplasmic deletion of VAB-1 expressed by scaffold and bridge cells or by P9/10 can facilitate P cell progression to the midline. MAB-20/semaphorin and VAB-1 also exhibit complex redundancies to regulate adhesion and prevent gaps between sister bridge and scaffold forming cells that would otherwise completely block P cell migration. Conclusions The Eph receptor functions to mediate cell extensions required for bridge formation, independently facilitates P cell migration to the ventral midline, and acts redundantly with PLX-2/plexin to prevent gaps between sister plexin band cells that normally serve as a substratum for P9/10 cell migration.
We tested the general applicability of in situ proteolysis to form protein crystals suitable for structure determination by adding a protease (chymotrypsin or trypsin) digestion step to crystallization trials of 55 bacterial and 14 human proteins that had proven recalcitrant to our best efforts at crystallization or structure determination. This is a work in progress; so far we determined structures of 9 bacterial proteins and the human aminoimidazole ribonucleotide synthetase (AIRS) domain.
The unc-129 gene, like the unc-6 netrin gene, is required to guide pioneer motoraxons along the dorsoventral axis of Caenorhabditis elegans. unc-129 encodes a member of the transforming growth factor-beta (TGF-beta) superfamily of secreted signaling molecules and is expressed in dorsal, but not ventral, rows of body wall muscles. Ectopic expression of UNC-129 from ventral body wall muscle disrupts growth cone and cell migrations that normally occur along the dorsoventral axis. Thus, UNC-129 mediates expression of dorsoventral polarity information required for axon guidance and guided cell migrations in C. elegans.
Networks of coordinated alternative splicing (AS) events play critical roles in development and disease. However, a comprehensive knowledge of the factors that regulate these networks is lacking. We describe a high-throughput system for systematically linking trans-acting factors to endogenous RNA regulatory events. Using this system, we identify hundreds of factors associated with diverse regulatory layers that positively or negatively control AS events linked to cell fate. Remarkably, more than one-third of the regulators are transcription factors. Further analyses of the zinc finger protein Zfp871 and BTB/POZ domain transcription factor Nacc1, which regulate neural and stem cell AS programs, respectively, reveal roles in controlling the expression of specific splicing regulators. Surprisingly, these proteins also appear to regulate target AS programs via binding RNA. Our results thus uncover a large "missing cache" of splicing regulators among annotated transcription factors, some of which dually regulate AS through direct and indirect mechanisms.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.