RNA-binding proteins are key regulators of gene expression, yet only a small fraction have been functionally characterized. Here we report a systematic analysis of the RNA motifs recognized by RNA-binding proteins, encompassing 205 distinct genes from 24 diverse eukaryotes. The sequence specificities of RNA-binding proteins display deep evolutionary conservation, and the recognition preferences for a large fraction of metazoan RNA-binding proteins can thus be inferred from their RNA-binding domain sequence. The motifs that we identify in vitro correlate well with in vivo RNA-binding data. Moreover, we can associate them with distinct functional roles in diverse types of post-transcriptional regulation, enabling new insights into the functions of RNA-binding proteins both in normal physiology and in human disease. These data provide an unprecedented overview of RNA-binding proteins and their targets, and constitute an invaluable resource for determining post-transcriptional regulatory mechanisms in eukaryotes.
Summary: pybedtools is a flexible Python software library for manipulating and exploring genomic datasets in many common formats. It provides an intuitive Python interface that extends upon the popular BEDTools genome arithmetic tools. The library is well documented and efficient, and allows researchers to quickly develop simple, yet powerful scripts that enable complex genomic analyses.Availability: pybedtools is maintained under the GPL license. Stable versions of pybedtools as well as documentation are available on the Python Package Index at http://pypi.python.org/pypi/pybedtools.Contact: dalerr@niddk.nih.gov; arq5x@virginia.eduSupplementary Information: Supplementary data are available at Bioinformatics online.
CTCF sites are abundant in the genomes of diverse species but their function is enigmatic. We used chromosome conformation capture to determine long-range interactions among CTCF/cohesin sites over 2 Mb on human chromosome 11 encompassing the β-globin locus and flanking olfactory receptor genes. Although CTCF occupies these sites in both erythroid K562 cells and fibroblast 293T cells, the long-range interaction frequencies among the sites are highly cell type specific, revealing a more densely clustered organization in the absence of globin gene activity. Both CTCF and cohesins are required for the cell-type-specific chromatin conformation. Furthermore, loss of the organizational loops in K562 cells through reduction of CTCF with shRNA results in acquisition of repressive histone marks in the globin locus and reduces globin gene expression whereas silent flanking olfactory receptor genes are unaffected. These results support a genome-wide role for CTCF/cohesin sites through loop formation that both influences transcription and contributes to cell-type-specific chromatin organization and function.chromatin loops | insulator | CTCF | globin | transcription E ukaryotic chromatin is dynamically organized to form distinct transcriptionally active or silent domains (1). Chromatin insulators are proposed to play a role in the establishment of such domains within which proper enhancer-gene interactions occur and improper ones are excluded (2). Insulators can function as boundary elements between active and silent chromatin domains and can also interfere with enhancer-gene interaction when placed between them. Evidence also suggests that insulators are involved in higher-order chromatin organization by clustering together with other insulators. Such "insulator bodies" can be visualized at the nuclear periphery of certain cells in Drosophila and are thought to coalesce through the interaction of proteins such as Su(Hw) and Drosophila CTCF (3, 4).In mammalian cells, insulators are bound by CTCF, a protein with 11 zinc fingers through which it can bind to a range of DNA sequences, to itself, and to nuclear structural proteins such as nucleophosmin and matrix attachment regions (MARs) (2). Although insulators would be expected to reside primarily in intergenic regions where they could provide boundary or enhancer blocking activity, genome scans of CTCF enrichment show that CTCF sites are overrepresented in genes and promoter regions, with only 41-56% at intergenic locations (5). In Drosophila, between about 49% and 64% of insulator sites are located in intergenic regions (4). Thus, although CTCF sites are proposed to be insulators, their in vivo functions may not be limited to insulation. Recently, cohesin has been reported to colocalize at most sites of CTCF enrichment and to play a role in transcriptional insulation, but the significance of this joint occupancy to higher-order chromatin structures is unknown (6).In the Igf2/H19 locus, CTCF binding to the imprinting control region on the maternal allele is required to loop ...
Primary sex determination “switches” evolve rapidly, but Doublesex (DSX) related transcription factors (DMRTs) act downstream of these switches to control sexual development in most animal species. Drosophila dsx encodes female- and male-specific isoforms (DSXF and DSXM), but little is known about how dsx controls sexual development, whether DSXF and DSXM bind different targets, or how DSX proteins direct different outcomes in diverse tissues. We undertook genome-wide analyses to identify DSX targets using in vivo occupancy, binding site prediction, and evolutionary conservation. We find that DSXF and DSXM bind thousands of the same targets in multiple tissues in both sexes, yet these targets have sex- and tissue-specific functions. Interestingly, DSX targets show considerable overlap with targets identified for mouse DMRT1. DSX targets include transcription factors and signaling pathway components providing for direct and indirect regulation of sex-biased expression.
A major role of the RNAi pathway in Schizosaccharomyces pombe is to nucleate heterochromatin, but it remains unclear whether this mechanism is conserved. To address this question in Drosophila, we performed genomewide localization of Argonaute2 (AGO2) by chromatin immunoprecipitation (ChIP)-seq in two different embryonic cell lines and found that AGO2 localizes to euchromatin but not heterochromatin. This localization pattern is further supported by immunofluorescence staining of polytene chromosomes and cell lines, and these studies also indicate that a substantial fraction of AGO2 resides in the nucleus. Intriguingly, AGO2 colocalizes extensively with CTCF/CP190 chromatin insulators but not with genomic regions corresponding to endogenous siRNA production. Moreover, AGO2, but not its catalytic activity or Dicer-2, is required for CTCF/CP190-dependent Fab-8 insulator function. AGO2 interacts physically with CTCF and CP190, and depletion of either CTCF or CP190 results in genome-wide loss of AGO2 chromatin association. Finally, mutation of CTCF, CP190, or AGO2 leads to reduction of chromosomal looping interactions, thereby altering gene expression. We propose that RNAi-independent recruitment of AGO2 to chromatin by insulator proteins promotes the definition of transcriptional domains throughout the genome.
BackgroundGene dosage change is a mild perturbation that is a valuable tool for pathway reconstruction in Drosophila. While it is often assumed that reducing gene dose by half leads to two-fold less expression, there is partial autosomal dosage compensation in Drosophila, which may be mediated by feedback or buffering in expression networks.ResultsWe profiled expression in engineered flies where gene dose was reduced from two to one. While expression of most one-dose genes was reduced, the gene-specific dose responses were heterogeneous. Expression of two-dose genes that are first-degree neighbors of one-dose genes in novel network models also changed, and the directionality of change depended on the response of one-dose genes.ConclusionsOur data indicate that expression perturbation propagates in network space. Autosomal compensation, or the lack thereof, is a gene-specific response, largely mediated by interactions with the rest of the transcriptome.
Key Points Ldb1 complexes bind to and positively regulate the expression of a large number of erythroid genes including most known Gata1-regulated genes. Ldb1 complexes and Klf1 frequently bind together and coregulate erythroid gene expression.
Chromatin insulators organize the genome into distinct transcriptional domains and contribute to cell type–specific chromatin organization. However, factors regulating tissue-specific insulator function have not yet been discovered. Here we identify the RNA recognition motif-containing protein Shep as a direct interactor of two individual components of the gypsy insulator complex in Drosophila. Mutation of shep improves gypsy-dependent enhancer blocking, indicating a role as a negative regulator of insulator activity. Unlike ubiquitously expressed core gypsy insulator proteins, Shep is highly expressed in the central nervous system (CNS) with lower expression in other tissues. We developed a novel, quantitative tissue-specific barrier assay to demonstrate that Shep functions as a negative regulator of insulator activity in the CNS but not in muscle tissue. Additionally, mutation of shep alters insulator complex nuclear localization in the CNS but has no effect in other tissues. Consistent with negative regulatory activity, ChIP–seq analysis of Shep in a CNS-derived cell line indicates substantial genome-wide colocalization with a single gypsy insulator component but limited overlap with intact insulator complexes. Taken together, these data reveal a novel, tissue-specific mode of regulation of a chromatin insulator.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.