The histone modification state of genomic regions is hypothesized to reflect the regulatory activity of the underlying genomic DNA. Based on this hypothesis, the ENCODE Project Consortium measured the status of multiple histone modifications across the genome in several cell types and used these data to segment the genome into regions with different predicted regulatory activities. We measured the cis-regulatory activity of more than 2000 of these predictions in the K562 leukemia cell line. We tested genomic segments predicted to be Enhancers, Weak Enhancers, or Repressed elements in K562 cells, along with other sequences predicted to be Enhancers specific to the H1 human embryonic stem cell line (H1-hESC). Both Enhancer and Weak Enhancer sequences in K562 cells were more active than negative controls, although surprisingly, Weak Enhancer segmentations drove expression higher than did Enhancer segmentations. Lower levels of the covalent histone modifications H3K36me3 and H3K27ac, thought to mark active enhancers and transcribed gene bodies, associate with higher expression and partly explain the higher activity of Weak Enhancers over Enhancer predictions. While DNase I hypersensitivity (HS) is a good predictor of active sequences in our assay, transcription factor (TF) binding models need to be included in order to accurately identify highly expressed sequences. Overall, our results show that a significant fraction (~26%) of the ENCODE enhancer predictions have regulatory activity, suggesting that histone modification states can reflect the cis-regulatory activity of sequences in the genome, but that specific sequence preferences, such as TF-binding sites, are the causal determinants of cis-regulatory activity.[Supplemental material is available for this article.]It is widely reported that specific combinations of covalent histone modifications reflect the regulatory function of underlying genomic DNA sequence (Strahl and Allis 2000). As part of the ENCODE Project, the genomic locations of a variety of covalent histone modifications were determined by chromatin immunoprecipitation sequencing (ChIP-seq) in a number of cell types and cell lines. Two studies used these data to train computational models that predict different functional regions of the human genome. These unsupervised learning algorithms, Segway (Hoffman et al. 2012) and ChromHMM Kellis 2010, 2012), take functional genomics data as input (DNase-seq; FAIRE-seq; and ChIP-seq of histone modifications, RNA polymerase II large subunit [POLR2A], and CTCF) and return segmentation classes, which are then assigned a hypothesized function using current knowledge of histone modification function. As part of the ENCODE Project, these two sets of predictions were consolidated to create a unified annotation of the entire human genome with seven functional classes in multiple cell types. These segmentations include Transcription Start Site, Promoter Flanking, Transcribed, CTCF-bound, Enhancer, Weak Enhancer, and Repressed or Inactive segments (The ENCODE Project ...
A gene's position in the genome can profoundly affect its expression because regional differences in chromatin modulate the activity of locally acting cis-regulatory sequences (CRSs). Here we study how CRSs and regional chromatin act in concert on a genome-wide scale. We present a massively parallel reporter gene assay that measures the activities of hundreds of different CRSs, each integrated at many specific genomic locations. Although genome location strongly affected CRS activity, the relative strengths of CRSs were maintained at all chromosomal locations. The intrinsic activities of CRSs also correlatedwith their activities in plasmid-based assays. We explain our data with a quantitative model in which expression levels are set by independent contributions from local CRSs and the regional chromatin environment, rather than by more complex sequenceor protein-specific interactions between these two factors. The methods we present will help investigators determine when regulatory information is integrated in a modular fashion and when regulatory sequences interact in more complex ways.Two types of cis-regulatory information control the production of mRNAs: local cisregulatory CRSs, such as promoters, enhancers and insulators 1-3 , and the regional chromatin surrounding a gene's location in the genome [4][5][6] . Although short CRSs are often sufficient to drive reporter genes in patterns that reflect their activity in the genome [7][8][9][10][11][12] , their activities are also heavily influenced by regional differences in chromatin state. Regional differences in chromatin underlie chromosome position effects, the phenomenon whereby the same gene is expressed at dramatically different levels depending on its location in the genome 6,13-25 .
RNA binding proteins (RBP) and microRNAs (miRNAs) often bind sequences in 3′ untranslated regions (UTRs) of mRNAs, and regulate stability and translation efficiency. With the identification of numerous RBPs and miRNAs, there is an urgent need for new technologies to dissect the function of the cis-acting elements of RBPs and miRNAs. We describe post-transcriptional regulatory element sequencing (PTRE-seq), a massively parallel method for assaying the target sequences of miRNAs and RBPs. We use PTRE-seq to dissect sequence preferences and interactions between miRNAs and RBPs. The binding sites for these effector molecules influenced different aspects of the RNA lifecycle: RNA stability, translation efficiency, and translation initiation. In some cases, post-transcriptional control is modular, with different factors acting independently of each other, while in other cases factors show specific epistatic interactions. The throughput, flexibility, and reproducibility of PTRE-seq make it a valuable tool to study post-transcriptional regulation by 3′UTR elements.
In the genome, most occurrences of transcription factor binding sites (TFBS) have no -regulatory activity, which suggests that flanking sequences contain information that distinguishes functional from nonfunctional TFBS. We interrogated the role of flanking sequences near Activator Protein 1 (AP-1) binding sites that reside in DNase I Hypersensitive Sites (DHS) and regions annotated as Enhancers. In these regions, we found that sequence features directly adjacent to the core motif distinguish high from low activity AP-1 sites. Some nearby features are motifs for other TFs that genetically interact with the AP-1 site. Other features are extensions of the AP-1 core motif, which cause the extended sites to match motifs of multiple AP-1 binding proteins. Computational models trained on these data distinguish between sequences with high and low activity AP-1 sites and also predict changes in-regulatory activity due to mutations in AP-1 core sites and their flanking sequences. Our results suggest that extended AP-1 binding sites, together with adjacent binding sites for additional TFs, encode part of the information that governs TFBS activity in the genome.
Oil palm (Elaeis guineensis) is the most productive oil bearing crop worldwide. It has three fruit forms, namely dura (thick-shelled), pisifera (shell-less) and tenera (thin-shelled), which are controlled by the SHELL gene. The fruit forms exhibit monogenic co-dominant inheritance, where tenera is a hybrid obtained by crossing maternal dura and paternal pisifera palms. Commercial palm oil production is based on planting thin-shelled tenera palms, which typically yield 30% more oil than dura palms, while pisifera palms are female-sterile and have little to no palm oil yield. It is clear that tenera hybrids produce more oil than either parent due to single gene heterosis. The unintentional planting of dura or pisifera palms reduces overall yield and impacts land utilization that would otherwise be devoted to more productive tenera palms. Here, we identify three additional novel mutant alleles of the SHELL gene, which encode a type II MADS-box transcription factor, and determine oil yield via control of shell fruit form phenotype in a manner similar to two previously identified mutant SHELL alleles. Assays encompassing all five mutations account for all dura and pisifera palms analyzed. By assaying for these variants in 10,224 mature palms or seedlings, we report the first large scale accurate genotype-based determination of the fruit forms in independent oil palm planting sites and in the nurseries that supply them throughout Malaysia. The measured non-tenera contamination rate (10.9% overall on a weighted average basis) underscores the importance of SHELL genetic testing of seedlings prior to planting in production fields. By eliminating non-tenera contamination, comprehensive SHELL genetic testing can improve sustainability by increasing yield on existing planted lands. In addition, economic modeling demonstrates that SHELL gene testing will confer substantial annual economic gains to the oil palm industry, to Malaysian gross national income and to Malaysian government tax receipts.
Acinetobacter calcoaceticus HIRFA32 from wheat rhizosphere produced catecholate type of siderophore with optimum siderophore (ca. 92 % siderophore units) in succinic acid medium without FeSO 4 at 28°C and 24 h of incubation. HPLC purified siderophore appeared as pale yellow crystals with molecular weight [M C NMR, HMQC, HMBC, NOESY and decoupling studies, revealed that siderophore composed of 2,3-dihydroxybenzoic acid with hydroxyhistamine and threonine as amino acid subunits. In vitro study demonstrated siderophore mediated mycelium growth inhibition (ca. 46.87 ± 0.5 %) of Fusarium oxysporum. This study accounts to first report on biosynthesis of acinetobactin-like siderophore by the rhizospheric strain of A. calcoaceticus and its significance in inhibition of F. oxysporum.
The ability to alter genomes specifically by CRISPR-Cas gene editing has revolutionized biological research, biotechnology, and medicine. Broad therapeutic application of this technology, however, will require thorough preclinical assessment of off-target editing by homology-based prediction coupled with reliable methods for detecting off-target editing. Several off-target site nomination assays exist, but careful comparison is needed to ascertain their relative strengths and weaknesses. In this study, HEK293T cells were treated with Streptococcus pyogenes Cas9 and eight guide RNAs with varying levels of predicted promiscuity in order to compare the performance of three homology-independent off-target nomination methods: the cell-based assay, GUIDE-seq, and the biochemical assays CIRCLE-seq and SITE-seq. The three methods were benchmarked by sequencing 75,000 homology-nominated sites using hybrid capture followed by high-throughput sequencing, providing the most comprehensive assessment of such methods to date. The three methods performed similarly in nominating sequence-confirmed off-target sites, but with large differences in the total number of sites nominated. When combined with homology-dependent nomination methods and confirmation by sequencing, all three off-target nomination methods provide a comprehensive assessment of off-target activity. GUIDE-seq's low false-positive rate and the high correlation of its signal with observed editing highlight its suitability for nominating off-target sites for ex vivo CRISPR-Cas therapies.
Supplementary data are available at Bioinformatics online.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.