The majority of mammalian genomes are devoted to transposable elements (TEs). Whilst TEs are increasingly recognized for their important biological functions, they are a potential danger to genomic stability and are carefully regulated by the epigenetic system. However, the full complexity of this regulatory system is not understood. Here, using mouse embryonic stem cells, we show that TEs are suppressed by heterochromatic marks like H3K9me3, and are also labelled by all major types of chromatin modification in complex patterns, including bivalent activatory and repressive marks. We identified 29 epigenetic modifiers that significantly deregulated at least one type of TE. The loss of Setdb1, Ncor2, Rnf2, Kat5, Prmt5, Uhrf1, and Rrp8 caused widespread changes in TE expression and chromatin accessibility. These effects were context-specific, with different chromatin modifiers regulating the expression and chromatin accessibility of specific subsets of TEs. Our work reveals the complex patterns of epigenetic regulation of TEs.
The current classification of cells in an organism is largely based on their anatomic and developmental origin. Cells types and tissues are traditionally classified into those that arise from the three embryonic germ layers, the ectoderm, mesoderm and endoderm, but this model does not take into account the organization of cell type-specific patterns of gene expression. Here, we present computational models for cell type and tissue specification derived from a collection of 921 RNA-sequencing samples from 272 distinct mouse cell types or tissues. In an unbiased fashion, this analysis accurately predicts the three known germ layers. Unexpectedly, this analysis also suggests that in total there are eight major domains of cell type-specification, corresponding to the neurectoderm, neural crest, surface ectoderm, endoderm, mesoderm, blood mesoderm, germ cells and the embryonic domain. Further, we identify putative genes responsible for specifying the domain and the cell type. This model has implications for understanding trans-lineage differentiation for stem cells, developmental cell biology and regenerative medicine.
Somatic cell reprogramming by exogenous factors requires cooperation with transcriptional co-activators and co-repressors to effectively remodel the epigenetic environment. How this interplay is regulated remains poorly understood. Here, we demonstrate that NCoR/SMRT co-repressors bind to pluripotency loci to create a barrier to reprogramming with the four Yamanaka factors (OCT4, SOX2, KLF4 and c-MYC), and consequently, suppressing NCoR/SMRT significantly enhances reprogramming efficiency and kinetics. The core epigenetic subunit of the NCoR/SMRT complex, histone deacetylase 3 (HDAC3), contributes to the effects of NCoR/SMRT by inducing histone deacetylation at pluripotency loci. Among the Yamanaka factors, recruitment of NCoR/SMRT-HDAC3 to genomic loci is mostly facilitated by c-MYC. Hence, we describe how c-MYC is beneficial for the early phase of reprogramming but deleterious later. Overall, we uncover a role for NCoR/SMRT co-repressors in reprogramming and propose a dual function for c-MYC in this process.
Transposable elements (TEs) occupy nearly 40% of mammalian genomes and, whilst most are fragmentary and no longer capable of transposition, they can nevertheless contribute to cell function. TEs within genes transcribed by RNA polymerase II can be copied as parts of primary transcripts; however, their full contribution to mature transcript sequences remains unresolved. Here, using long and short read (LR and SR) RNA sequencing data, we show that 26% of coding and 65% of noncoding transcripts in human pluripotent stem cells (hPSCs) contain TE-derived sequences. Different TE families are incorporated into RNAs in unique patterns, with consequences to transcript structure and function. The presence of TE sequences within a transcript is correlated with TE-type specific changes in its subcellular distribution, alterations in steady-state levels and half-life, and differential association with RNA Binding Proteins (RBPs). We identify hPSC-specific incorporation of endogenous retroviruses (ERVs) and LINE:L1 into protein-coding mRNAs, which generate TE sequence-derived peptides. Finally, single cell RNA-seq reveals that hPSCs express ERV-containing transcripts, whilst differentiating subpopulations lack ERVs and express SINE and LINE-containing transcripts. Overall, our comprehensive analysis demonstrates that the incorporation of TE sequences into the RNAs of hPSCs is more widespread and has a greater impact than previously appreciated.
In this study, a continuous cell line (named as CPB) was established from Siniperca chuatsi brain and has been subcultured >140 times. CPB cell line predominantly consisted of fibroblast-like cells that could grow better in Leibovitz's L-15 supplemented with 10% foetal bovine serum at 28° C. Polymerase chain reaction amplification of 18s recombinant (r)RNA confirmed the origin of this cell line from S. chuatsi. The CPB cell line was cryopreserved at different passage levels and revived successfully with 80-90% survival. The cell line was further characterized by chromosome number and transfection. The CPB cells were highly susceptible to infectious spleen and kidney necrosis virus (ISKNV) with a titre of 6·58-6·62 log TCID50 ml(-1) and numerous ISKNV particles were observed in the cytoplasm by transmission electron microscopy. At the same time, ISKNV infection was confirmed by reverse transcriptase polymerase chain reaction, immunodot blot and individual challenge experiments. The development and characterization of a new brain cell line from S. chuatsi were described in this study and it could be used as an in vitro tool for propagation of ISKNV and gene expression studies.
Infectious spleen and kidney necrosis virus (ISKNV) is the causative agent of a disease causing high mortality in mandarin fish, Siniperca chuatsi. In this study, complete major capsid protein (MCP) genes of nine ISKNV isolates were sequenced and compared with other known megalocytiviruses to evaluate genetic variation and host range of the viruses. Comparison of nucleotide sequences of MCP gene revealed 92.6-100% identity among nine ISKNV isolates. A phylogenetic tree revealed that 33 megalocytiviruses were divided into three genotypes, and there was a strong host species signal in three genotypes: for genotype I, the host was mainly marine fish; for genotype II, the host was freshwater fish; and for genotype III, the host was mainly flatfish. Nine ISKNV isolates belonged to genotype I or genotype II, suggesting mandarin fish may be a mixing vessel host for megalocytivirus.
BackgroundFireflies are a family of insects within the beetle order Coleoptera, or winged beetles, and they are one of the most well-known and loved insect species because of their bioluminescence. However, the firefly is in danger of extinction because of the massive destruction of its living environment. In order to improve the understanding of fireflies and protect them effectively, we sequenced the whole genome of the terrestrial firefly Pyrocoelia pectoralis.FindingsHere, we developed a highly reliable genome resource for the terrestrial firefly Pyrocoelia pectoralis (E. Oliv., 1883; Coleoptera: Lampyridae) using single molecule real time (SMRT) sequencing on the PacBio Sequel platform. In total, 57.8 Gb of long reads were generated and assembled into a 760.4-Mb genome, which is close to the estimated genome size and covered 98.7% complete and 0.7% partial insect Benchmarking Universal Single-Copy Orthologs. The k-mer analysis showed that this genome is highly heterozygous. However, our long-read assembly demonstrates continuousness with a contig N50 length of 3.04 Mb and the longest contig length of 13.69 Mb. Furthermore, 135 589 SSRs and 341 Mb of repeat sequences were detected. A total of 23 092 genes were predicted; 88.44% of genes were annotated with one or more related functions.ConclusionsWe assembled a high-quality firefly genome, which will not only provide insights into the conservation and biodiversity of fireflies, but also provide a wealth of information to study the mechanisms of their sexual communication, bio-luminescence, and evolution.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.