To gain insight into how genomic information is translated into cellular and developmental programs, the Drosophila model organism Encyclopedia of DNA Elements (modENCODE) project is comprehensively mapping transcripts, histone modifications, chromosomal proteins, transcription factors, replication proteins and intermediates, and nucleosome properties across a developmental time course and in multiple cell lines. We have generated more than 700 data sets and discovered protein-coding, noncoding, RNA regulatory, replication, and chromatin elements, more than tripling the annotated portion of the Drosophila genome. Correlated activity patterns of these elements reveal a functional regulatory network, which predicts putative new functions for genes, reveals stage- and tissue-specific regulators, and enables gene-expression prediction. Our results provide a foundation for directed experimental and computational studies in Drosophila and related species and also a model for systematic data integration toward comprehensive genomic and functional annotation.
Since the initial annotation of miRNAs from cloned short RNAs by the Ambros, Tuschl, and Bartel groups in 2001, more than a hundred studies have sought to identify additional miRNAs in various species. We report here a meta-analysis of short RNA data from Drosophila melanogaster, aggregating published libraries with 76 data sets that we generated for the modENCODE project. In total, we began with more than 1 billion raw reads from 187 libraries comprising diverse developmental stages, specific tissue- and cell-types, mutant conditions, and/or Argonaute immunoprecipitations. We elucidated several features of known miRNA loci, including multiple phased byproducts of cropping and dicing, abundant alternative 5′ termini of certain miRNAs, frequent 3′ untemplated additions, and potential editing events. We also identified 49 novel genomic locations of miRNA production, and 61 additional candidate loci with limited evidence for miRNA biogenesis. Although these loci broaden the Drosophila miRNA catalog, this work supports the notion that a restricted set of cellular transcripts is competent to be specifically processed by the Drosha/Dicer-1 pathway. Unexpectedly, we detected miRNA production from coding and untranslated regions of mRNAs and found the phenomenon of miRNA production from the antisense strand of known loci to be common. Altogether, this study lays a comprehensive foundation for the study of miRNA diversity and evolution in a complex animal model.
BackgroundThe identification of human disease-related microRNAs (disease miRNAs) is important for further investigating their involvement in the pathogenesis of diseases. More experimentally validated miRNA-disease associations have been accumulated recently. On the basis of these associations, it is essential to predict disease miRNAs for various human diseases. It is useful in providing reliable disease miRNA candidates for subsequent experimental studies.Methodology/Principal FindingsIt is known that miRNAs with similar functions are often associated with similar diseases and vice versa. Therefore, the functional similarity of two miRNAs has been successfully estimated by measuring the semantic similarity of their associated diseases. To effectively predict disease miRNAs, we calculated the functional similarity by incorporating the information content of disease terms and phenotype similarity between diseases. Furthermore, the members of miRNA family or cluster are assigned higher weight since they are more probably associated with similar diseases. A new prediction method, HDMP, based on weighted k most similar neighbors is presented for predicting disease miRNAs. Experiments validated that HDMP achieved significantly higher prediction performance than existing methods. In addition, the case studies examining prostatic neoplasms, breast neoplasms, and lung neoplasms, showed that HDMP can uncover potential disease miRNA candidates.ConclusionsThe superior performance of HDMP can be attributed to the accurate measurement of miRNA functional similarity, the weight assignment based on miRNA family or cluster, and the effective prediction based on weighted k most similar neighbors. The online prediction and analysis tool is freely available at http://nclab.hit.edu.cn/hdmpred.
We recently reported that Drosophila Insensitive (Insv) promotes sensory organ development and has activity as a nuclear corepressor for the Notch transcription factor Suppressor of Hairless [Su(H)]. Insv lacks domains of known biochemical function but contains a single BEN domain (i.e., a ''BEN-solo'' protein). Our chromatin immunoprecipitation (ChIP) sequencing (ChIP-seq) analysis confirmed binding of Insensitive to Su(H) target genes in the Enhancer of split gene complex [E(spl)-C]; however, de novo motif analysis revealed a novel site strongly enriched in Insv peaks (TCYAATHRGAA). We validate binding of endogenous Insv to genomic regions bearing such sites, whose associated genes are enriched for neural functions and are functionally repressed by Insv. Unexpectedly, we found that the Insv BEN domain binds specifically to this sequence motif and that Insv directly regulates transcription via this motif. We determined the crystal structure of the BEN-DNA target complex, revealing homodimeric binding of the BEN domain and extensive nucleotide contacts via a helices and a C-terminal loop. Point mutations in key DNA-contacting residues severely impair DNA binding in vitro and capacity for transcriptional regulation in vivo. We further demonstrate DNA-binding and repression activities by the mammalian neural BEN-solo protein BEND5. Altogether, we define novel DNA-binding activity in a conserved family of transcriptional repressors, opening a molecular window on this extensive gene family.
Recently, the BEN (BANP, E5R, and NAC1) domain was recognized as a new class of conserved DNA-binding domain. The fly genome encodes three proteins that bear only a single BEN domain (''BEN-solo'' factors); namely, Insensitive (Insv), Bsg25A (Elba1), and CG9883 (Elba2). Insv homodimers preferentially bind CCAATTGG palindromes throughout the genome to mediate transcriptional repression, whereas Bsg25A and Elba2 heterotrimerize with their obligate adaptor, Elba3 (i.e., the ELBA complex), to recognize a CCAATAAG motif in the Fab-7 insulator. While these data suggest distinct DNA-binding properties of BEN-solo proteins, we performed reporter assays that indicate that both Bsg25A and Elba2 can individually recognize Insv consensus sites efficiently. We confirmed this by solving the structure of Bsg25A complexed to the Insv site, which showed that key aspects of the BEN:DNA recognition strategy are similar between these proteins. We next show that both Insv and ELBA proteins are competent to mediate transcriptional repression via Insv consensus sequences but that the ELBA complex appears to be selective for the ELBA site. Reciprocally, genome-wide analysis reveals that Insv exhibits significant cobinding to class I insulator elements, indicating that it may also contribute to insulator function. Indeed, we observed abundant Insv binding within the Hox complexes with substantial overlaps with class I insulators, many of which bear Insv consensus sites. Moreover, Insv coimmunoprecipitates with the class I insulator factor CP190. Finally, we observed that Insv harbors exclusive activity among fly BEN-solo factors with respect to regulation of Notch-mediated cell fate choices in the peripheral nervous system. This in vivo activity is recapitulated by BEND6, a mammalian BEN-solo factor that conserves the Notch corepressor function of Insv but not its capacity to bind Insv consensus sites. Altogether, our data define an array of common and distinct biochemical and functional properties of this new family of transcription factors.[Keywords: BEN-solo; insulator; repressor; transcription factor] Supplemental material is available for this article. The BEN (BANP, E5R, and NAC1) domain was originally identified bioinformatically as a domain with a-helical character that is present in a variety of metazoan and viral proteins (Abhiman et al. 2008). Several BEN-containing proteins were characterized to have chromatin-related functions, including mammalian BANP/SMAR1 (KaulGhanekar et al. 2004;Rampalli et al. 2005), NAC1 (Korutla et al. 2005(Korutla et al. , 2007, BEND3 (Sathyan et al. 2011), and the C isoform of Drosophila mod(mdg4) (Gerasimova et al. 1995;Negre et al. 2010). Moreover, all of these proteins were linked to transcriptional silencing, albeit via different strategies such as interacting with matrix attachment sites (SMAR1), recruiting histone deacetylases or CoREST (NAC1), or interacting with insulator sites [mod(mdg4)] or heterochromatin (BEND3). Altogether, these collected findings indicate an intimate connection betw...
SUMMARYmicroRNAs (miRNAs) are endogenous short RNAs that mediate vast networks of post-transcriptional gene regulation. Although computational searches and experimental profiling provide evidence for hundreds of functional targets for individual miRNAs, such data rarely provide clear insight into the phenotypic consequences of manipulating miRNAs in vivo. We describe a genome-wide collection of 165 Drosophila miRNA transgenes and find that a majority induced specific developmental defects, including phenocopies of mutants in myriad cell-signaling and patterning genes. Such connections allowed us to validate several likely targets for miRNA-induced phenotypes. Importantly, few of these phenotypes could be predicted from computationally predicted target lists, thus highlighting the value of whole-animal readouts of miRNA activities. Finally, we provide an example of the relevance of these data to miRNA loss-of-function conditions. Whereas misexpression of several K box miRNAs inhibited Notch pathway activity, reciprocal genetic interaction tests with miRNA sponges demonstrated endogenous roles of the K box miRNA family in restricting Notch signaling. In summary, we provide extensive evidence that misexpression of individual miRNAs often induces specific mutant phenotypes that can guide their functional study. By extension, these data suggest that the deregulation of individual miRNAs in other animals may frequently yield relatively specific phenotypes during disease conditions.
Regulation of chromatin through histone acetylation is an important step in gene expression. The Gcn5 histone acetyltransferase is part of protein complexes, e.g., the SAGA complex, that interact with transcriptional activators, targeting the enzyme to specific promoters and assisting in recruitment of the basal RNA polymerase transcription machinery. The Ada2 protein directly binds to Gcn5 and stimulates its catalytic activity. Drosophila contains two Ada2 proteins, Drosophila Ada2a (dAda2a) and dAda2b. We have generated flies that lack dAda2b, which is part of a Drosophila SAGA-like complex. dAda2b is required for viability in Drosophila, and its deletion causes a reduction in histone H3 acetylation. A global hypoacetylation of chromatin was detected on polytene chromosomes in dAda2b mutants. This indicates that the dGcn5-dAda2b complex could have functions in addition to assisting in transcriptional activation through gene-specific acetylation. Although the Drosophila p53 protein was previously shown to interact with the SAGA-like complex in vitro, we find that p53 induction of reaper gene expression occurs normally in dAda2b mutants. Moreover, dAda2b mutant animals show excessive p53-dependent apoptosis in response to gamma radiation. Based on this result, we speculate that dAda2b may be necessary for efficient DNA repair or generation of a DNA damage signal. This could be an evolutionarily conserved function, since a yeast ada2 mutant is also sensitive to a genotoxic agent.The Ada2 protein was first described for yeast, in which its inactivation could relieve the toxicity caused by overexpression of the strong transcriptional activation domain in the viral VP16 protein (6). This provided one of the earliest pieces of evidence for the existence of transcriptional adapters or coactivators. Ada2 proteins were subsequently shown to exist in protein complexes containing the Gcn5 histone acetyltransferase (23). Histone acetylation has been linked to transcriptional activation for a long time, given the observation that actively transcribed regions tend to be hyperacetylated, whereas transcriptionally silent regions generally are hypoacetylated (3). With the identification of the first histone acetyltransferase (HAT) as the Tetrahymena homolog of Gcn5 (13) and the subsequent demonstration that other transcriptional regulators, such as the coactivators p300 and CBP, steroid receptor coactivators, and the TFIID subunit TAF1, also possess intrinsic HAT activity, the link between acetylation and transcription was firmly established (12). In addition, an important role for histone acetylation in DNA replication and repair is emerging (27,38).Gcn5 is evolutionarily conserved and is present in several multiprotein complexes, such as the SAGA and ADA complexes in yeast and the PCAF, STAGA, and TFTC complexes in mammals (17). In yeast, Gcn5 is contained in a trimeric subcomplex with the Ada2 and Ada3 proteins within SAGA and ADA complexes (21,24). Ada2 binds to Gcn5, and deletion of Ada2 results in a loss of Gcn5 from bot...
We expanded the knowledge base for Drosophila cell line transcriptomes by deeply sequencing their small RNAs. In total, we analyzed more than 1 billion raw reads from 53 libraries across 25 cell lines. We verify reproducibility of biological replicate data sets, determine common and distinct aspects of miRNA expression across cell lines, and infer the global impact of miRNAs on cell line transcriptomes. We next characterize their commonalities and differences in endo-siRNA populations. Interestingly, most cell lines exhibit enhanced TE-siRNA production relative to tissues, suggesting this as a common aspect of cell immortalization. We also broadly extend annotations of cis-NAT-siRNA loci, identifying ones with common expression across diverse cells and tissues, as well as cell-restricted loci. Finally, we characterize small RNAs in a set of ovary-derived cell lines, including somatic cells (OSS and OSC) and a mixed germline/somatic cell population (fGS/ OSS) that exhibits ping-pong piRNA signatures. Collectively, the ovary data reveal new genic piRNA loci, including unusual configurations of piRNA-generating regions. Together with the companion analysis of mRNAs described in a previous study, these small RNA data provide comprehensive information on the transcriptional landscape of diverse Drosophila cell lines. These data should encourage broader usage of fly cell lines, beyond the few that are presently in common usage.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.