Zhi John Lu scite author profile

We systematically generated large-scale data sets to improve genome annotation for the nematode Caenorhabditis elegans, a key model organism. These data sets include transcriptome profiling across a developmental time course, genome-wide identification of transcription factor–binding sites, and maps of chromatin organization. From this, we created more complete and accurate gene models, including alternative splice forms and candidate noncoding RNAs. We constructed hierarchical networks of transcription factor–binding and microRNA interactions and discovered chromosomal locations bound by an unusually large number of transcription factors. Different patterns of chromatin composition and histone modification were revealed between chromosome arms and centers, with similarly prominent differences between autosomes and the X chromosome. Integrating data types, we built statistical models relating chromatin, transcription factor binding, and gene expression. Overall, our analyses ascribed putative functions to most of the conserved genome.

show abstract

A User's Guide to the Encyclopedia of DNA Elements (ENCODE)

Myers¹,

Stamatoyannopoulos²,

Snyder³

et al. 2011

PLoS Biol

1,250

944

View full text Add to dashboard Cite

The mission of the Encyclopedia of DNA Elements (ENCODE) Project is to enable the scientific and medical communities to interpret the human genome sequence and apply it to understand human biology and improve health. The ENCODE Consortium is integrating multiple technologies and approaches in a collective effort to discover and define the functional elements encoded in the human genome, including genes, transcripts, and transcriptional regulatory regions, together with their attendant chromatin states and DNA methylation patterns. In the process, standards to ensure high-quality data have been implemented, and novel algorithms have been developed to facilitate analysis. Data and derived results are made available through a freely accessible database. Here we provide an overview of the project and the resources it is generating and illustrate the application of ENCODE data to interpret the human genome.

show abstract

Divergent lncRNAs Regulate Gene Expression and Lineage Differentiation in Pluripotent Cells

et al. 2016

View full text Add to dashboard Cite

Divergent lncRNAs that are transcribed in the opposite direction to nearby protein-coding genes comprise a significant proportion (∼20%) of total lncRNAs in mammalian genomes. Through genome-wide analysis, we found that the distribution of this lncRNA class strongly correlates with essential developmental regulatory genes. In pluripotent cells, divergent lncRNAs regulate the transcription of nearby genes. As an example, the divergent lncRNA Evx1as promotes transcription of its neighbor gene, EVX1, and regulates mesendodermal differentiation. At a single-cell level, early broad expression of Evx1as is followed by a rapid, high-level transcription of EVX1, supporting the idea that Evx1as plays an upstream role to facilitate EVX1 transcription. Mechanistically, Evx1as RNA binds to regulatory sites on chromatin, promotes an active chromatin state, and interacts with Mediator. Based on our analyses, we propose that the biological function of thousands of uncharacterized lncRNAs of this class may be inferred from the role of their neighboring adjacent genes.

show abstract

Diverse transcription factor binding features revealed by genome-wide ChIP-seq in C. elegans

et al. 2010

View full text Add to dashboard Cite

Regulation of gene expression by sequence-specific transcription factors is central to developmental programs and depends on the binding of transcription factors with target sites in the genome. To date, most such analyses in Caenorhabditis elegans have focused on the interactions between a single transcription factor with one or a few select target genes. As part of the modENCODE Consortium, we have used chromatin immunoprecipitation coupled with high-throughput DNA sequencing (ChIP-seq) to determine the genome-wide binding sites of 22 transcription factors (ALR-1, BLMP-1, CEH-14, CEH-30, EGL-27, EGL-5, ELT-3, EOR-1, GEI-11, HLH-1, LIN-11, LIN-13, LIN-15B, LIN-39, MAB-5, MDL-1, MEP-1, PES-1, PHA-4, PQM-1, SKN-1, and UNC-130) at diverse developmental stages. For each factor we determined candidate gene targets, both coding and non-coding. The typical binding sites of almost all factors are within a few hundred nucleotides of the transcript start site. Most factors target a mixture of coding and non-coding target genes, although one factor preferentially binds to non-coding RNA genes. We built a regulatory network among the 22 factors to determine their functional relationships to each other and found that some factors appear to act preferentially as regulators and others as target genes. Examination of the binding targets of three related HOX factors-LIN-39, MAB-5, and EGL-5-indicates that these factors regulate genes involved in cellular migration, neuronal function, and vulval differentiation, consistent with their known roles in these developmental processes. Ultimately, the comprehensive mapping of transcription factor binding sites will identify features of transcriptional networks that regulate C. elegans developmental processes.

show abstract

Comparative analysis of the transcriptome across distant species

Gerstein

Rozowsky

Yan

et al. 2014

Nature

281

291

View full text Add to dashboard Cite

Recurrently deregulated lncRNAs in hepatocellular carcinoma

et al. 2017

View full text Add to dashboard Cite

Hepatocellular carcinoma (HCC) cells often invade the portal venous system and subsequently develop into portal vein tumour thrombosis (PVTT). Long noncoding RNAs (lncRNAs) have been associated with HCC, but a comprehensive analysis of their specific association with HCC metastasis has not been conducted. Here, by analysing 60 clinical samples' RNA-seq data from 20 HCC patients, we have identified and characterized 8,603 candidate lncRNAs. The expression patterns of 917 recurrently deregulated lncRNAs are correlated with clinical data in a TCGA cohort and published liver cancer data. Matched array data from the 60 samples show that copy number variations (CNVs) and alterations in DNA methylation contribute to the observed recurrent deregulation of 235 lncRNAs. Many recurrently deregulated lncRNAs are enriched in co-expressed clusters of genes related to cell adhesion, immune response and metabolic processes. Candidate lncRNAs related to metastasis, such as HAND2-AS1, were further validated using RNAi-based loss-of-function assays. Thus, we provide a valuable resource of functional lncRNAs and biomarkers associated with HCC tumorigenesis and metastasis.

show abstract

Pervasive Chromatin-RNA Binding Protein Interactions Enable RNA-Based Regulation of Transcription

Xiao

Chen

Liang

et al. 2019

Cell

245

224

View full text Add to dashboard Cite

SUMMARY Increasing evidence suggests that transcriptional control and chromatin activities at large involve regulatory RNAs, which likely enlist specific RNA-binding proteins (RBPs). Although multiple RBPs have been implicated in transcription control, it has remained unclear how extensively RBPs directly act on chromatin. We embarked on a large-scale RBP ChIP-seq analysis, revealing widespread RBP presence in active chromatin regions in the human genome. Like transcription factors (TFs), RBPs also show strong preference for hotspots in the genome, particularly gene promoters, where their association is frequently linked to transcriptional output. Unsupervised clustering reveals extensive co-association between TFs and RBPs, as exemplified by YY1, a known RNA-dependent TF, and RBM25, an RBP involved in splicing regulation. Remarkably, RBM25 depletion attenuates all YY1-dependent activities, including chromatin binding, DNA looping, and transcription. We propose that various RBPs may enhance network interaction through harnessing regulatory RNAs to control transcription.

show abstract

Genome-Wide Identification of Binding Sites Defines Distinct Functions for Caenorhabditis elegans PHA-4/FOXA in Development and Environmental Response

et al. 2010

View full text Add to dashboard Cite

Transcription factors are key components of regulatory networks that control development, as well as the response to environmental stimuli. We have established an experimental pipeline in Caenorhabditis elegans that permits global identification of the binding sites for transcription factors using chromatin immunoprecipitation and deep sequencing. We describe and validate this strategy, and apply it to the transcription factor PHA-4, which plays critical roles in organ development and other cellular processes. We identified thousands of binding sites for PHA-4 during formation of the embryonic pharynx, and also found a role for this factor during the starvation response. Many binding sites were found to shift dramatically between embryos and starved larvae, from developmentally regulated genes to genes involved in metabolism. These results indicate distinct roles for this regulator in two different biological processes and demonstrate the versatility of transcription factors in mediating diverse biological roles.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Zhi John Lu

Integrative Analysis of the Caenorhabditis elegans Genome by the modENCODE Project

A User's Guide to the Encyclopedia of DNA Elements (ENCODE)

Divergent lncRNAs Regulate Gene Expression and Lineage Differentiation in Pluripotent Cells

Diverse transcription factor binding features revealed by genome-wide ChIP-seq in C. elegans

Comparative analysis of the transcriptome across distant species

Recurrently deregulated lncRNAs in hepatocellular carcinoma

Pervasive Chromatin-RNA Binding Protein Interactions Enable RNA-Based Regulation of Transcription

Genome-Wide Identification of Binding Sites Defines Distinct Functions for Caenorhabditis elegans PHA-4/FOXA in Development and Environmental Response

Contact Info

Product

Resources

About