DNA methylation loss occurs frequently in cancer genomes, primarily within lamina-associated, late-replicating regions termed Partially Methylated Domains (PMDs). We profiled 39 diverse primary tumors and 8 matched adjacent tissues using Whole-Genome Bisulfite Sequencing (WGBS), and analyzed them alongside 343 additional human and 206 mouse WGBS datasets. We identified a local CpG sequence context associated with preferential hypomethylation in PMDs. Analysis of CpGs in this context (“Solo-WCGWs”) revealed previously undetected PMD hypomethylation in almost all healthy tissue types. PMD hypomethylation increased with age, beginning during fetal development, and appeared to track the accumulation of cell divisions. In cancer, PMD hypomethylation depth correlated with somatic mutation density and cell-cycle gene expression, consistent with its reflection of mitotic history, and suggesting its application as a mitotic clock. We propose that late replication leads to lifelong progressive methylation loss, which acts as a biomarker for cellular aging and which may contribute to oncogenesis.
High-throughput RNA sequencing (RNA-seq) dramatically expands the potential for novel genomics discoveries, but the wide variety of platforms, protocols and performance has created the need for comprehensive reference data. Here we describe the Association of Biomolecular Resource Facilities next-generation sequencing (ABRF-NGS) study on RNA-seq. We tested replicate experiments across 15 laboratory sites using reference RNA standards to test four protocols (polyA-selected, ribo-depleted, size-selected and degraded) on five sequencing platforms (Illumina HiSeq, Life Technologies’ PGM and Proton, Pacific Biosciences RS and Roche’s 454). The results show high intra-platform and inter-platform concordance for expression measures across the deep-count platforms, but highly variable efficiency and cost for splice junction and variant detection between all platforms. These data also demonstrate that ribosomal RNA depletion can both enable effective analysis of degraded RNA samples and be readily compared to polyA-enriched fractions. This study provides a broad foundation for cross-platform standardization, evaluation and improvement of RNA-seq.
Single nucleotide polymorphisms (SNPs) are indispensable in such applications as association mapping and construction of high-density genetic maps. These applications usually require genotyping of thousands of SNPs in a large number of individuals. Although a number of SNP genotyping assays are available, most of them are designed for SNP genotyping in diploid individuals. Here, we demonstrate that the Illumina GoldenGate assay could be used for SNP genotyping of homozygous tetraploid and hexaploid wheat lines. Genotyping reactions could be carried out directly on genomic DNA without the necessity of preliminary PCR ampliWcation. A total of 53 tetraploid and 38 hexaploid homozygous wheat lines were genotyped at 96 SNP loci. The genotyping error rate estimated after removal of low-quality data was 0 and 1% for tetraploid and hexaploid wheat, respectively. Developed SNP genotyping assays were shown to be useful for genotyping wheat cultivars.This study demonstrated that the GoldenGate assay is a very eYcient tool for high-throughput genotyping of polyploid wheat, opening new possibilities for the analysis of genetic variation in wheat and dissection of genetic basis of complex traits using association mapping approach.
We have isolated a gene from the yeast Saccharomyces cerevisiae that encodes a 2.0-kilobase heat-inducible mRNA. This gene, which we have designated STII, for stress inducible, was also induced by the amino acid analog canavanine and showed a slight increase in expression as cells moved into stationary phase. The STII gene encodes a 66-kilodalton protein, as determined from the sequence of the longest open reading frame. The putative STII protein, as identified by two-dimensional gel electrophoresis, migrated in the region of 73 to 75 kilodaltons as a series of four isoforms with different isoelectric points. STII is not homologous to the other conserved HSP70 family members in yeasts, despite similarities in size and regulation. Cells carrying a disruption mutation of the STI) gene grew normally at 30°C but showed impaired growth at higher and lower temperatures. Overexpression of the STII gene resulted in substantial trans-activation of SSA4 promoterreporter gene fusions, indicating that STII may play a role in mediating the heat shock response of some HSP70 genes.Cells from early every organism thus far tested respond to an increase in temperature by inducing synthesis of a stereotypical set of proteins, the so-called heat shock proteins. Synthesis of these proteins is also induced by a number of other stimuli that stress the cell, indicating that the response is a general one to organismal (and cellular) trauma. The most abundant of the heat shock proteins found after temperature upshift is a 70-kilodalton (kDa) protein called hsp70. hsp70 is the most highly conserved of all heat shock proteins, showing between 60 and 78% identity among eucaryotes and 40 to 50% identity between Escherichia coli hsp70, the dnaK gene product, and the corresponding eucaryotic proteins (reviewed in reference 22).Among the best characterized of the HSP70 genes are those from the yeast Saccharomyces cerevisiae. Eight genes related to HSP70 have been isolated as recombinant DNA clones in our laboratory. These eight genes have been divided into four phenotypic subfamilies, SSA through SSD (SS denotes stress seventy related; A through D indicate the subfamilies). The expression patterns and functional interrelationships of members of the SSA subfamily (encompassing SSAJ through SSA4) have been the most thoroughly investigated. The SSAI gene is expressed at moderate levels at 23°C but is induced approximately 10-fold after a temperature shift to 37°C, whereas SSA2 is expressed at a high level at all temperatures. SSA3 and SSA4 are expressed at extremely low levels at 23°C, but after a temperature upshift the amounts of SSA3 and SSA4 mRNAs increase dramatically (9
Distinct epigenomic profiles of histone marks have been associated with gene expression, but questions regarding the causal relationship remain. Here we investigated the activity of a broad collection of genomically targeted epigenetic regulators that could write epigenetic marks associated with a repressed chromatin state (G9A, SUV39H1, Krüppel-associated box (KRAB), DNMT3A as well as the first targetable versions of Ezh2 and Friend of GATA-1 (FOG1)). dCas9 fusions produced target gene repression over a range of 0- to 10-fold that varied by locus and cell type. dCpf1 fusions were unable to repress gene expression. The most persistent gene repression required the action of several effector domains; however, KRAB-dCas9 did not contribute to persistence in contrast to previous reports. A ‘direct tethering’ strategy attaching the Ezh2 methyltransferase enzyme to dCas9, as well as a ‘recruitment’ strategy attaching the N-terminal 45 residues of FOG1 to dCas9 to recruit the endogenous nucleosome remodeling and deacetylase complex, were both successful in targeted deposition of H3K27me3. Surprisingly, however, repression was not correlated with deposition of either H3K9me3 or H3K27me3. Our results suggest that so-called repressive histone modifications are not sufficient for gene repression. The easily programmable dCas9 toolkit allowed precise control of epigenetic information and dissection of the relationship between the epigenome and gene regulation.
Next-generation sequencing is revolutionizing the identification of transcription factor binding sites throughout the human genome. However, the bioinformatics analysis of large datasets collected using chromatin immunoprecipitation and high-throughput sequencing is often a roadblock that impedes researchers in their attempts to gain biological insights from their experiments. We have developed integrated peak-calling and analysis software (Sole-Search) which is available through a user-friendly interface and (i) converts raw data into a format for visualization on a genome browser, (ii) outputs ranked peak locations using a statistically based method that overcomes the significant problem of false positives, (iii) identifies the gene nearest to each peak, (iv) classifies the location of each peak relative to gene structure, (v) provides information such as the number of binding sites per chromosome and per gene and (vi) allows the user to determine overlap between two different experiments. In addition, the program performs an analysis of amplified and deleted regions of the input genome. This software is web-based and automated, allowing easy and immediate access to all investigators. We demonstrate the utility of our software by collecting, analyzing and comparing ChIP-seq data for six different human transcription factors/cell line combinations.
The development and application of genomic tools to loblolly pine (Pinus taeda L.) offer promising insights into the organization and structure of conifer genomes. The application of a high-throughput genotyping assay across diverse forest tree species, however, is currently limited taxonomically. This is despite the ongoing development of genome-scale projects aiming at the construction of expressed sequence tag (EST) libraries and the resequencing of EST-derived unigenes for a diverse array of forest tree species. In this paper, we report on the application of Illumina's high-throughput GoldenGate™ SNP genotyping assay to a loblolly pine mapping population. Single nucleotide polymorphisms (SNPs) were identified through resequencing of previously identified wood quality, drought tolerance, and disease resistance candidate genes prior to genotyping. From that effort, a 384 multiplexed SNP assay was developed for high-throughput genotyping. Approximately 67% of the 384 SNPs queried converted into high-quality genotypes for the 48 progeny samples. Of those 257 successfully genotyped SNPs, 70 were segregating within the mapping population. A total of 27 candidate genes were subsequently mapped onto the existing loblolly pine consensus map, which consists of 12 linkage groups spanning a total map distance of 1,227.6 cM. The ability of SNPs to be mapped to the same position as fragment-based markers previously developed within the same candidate genes, as well as the pivotal role that SNPs currently play in the dissection of complex phenotypic traits, illustrate the usefulness of high-throughput SNP genotyping technologies to the continued development of pine genomics.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.