Most of the human genome is non-coding. However, some of the non-coding part is transcriptionally active. In humans, the tandemly repeated (TR) pericentromeric non-coding DNA—human satellites 2 and 3 (HS2, HS3)—are transcribed in somatic cells. These transcripts are also found in pre- and post-implantation embryos. The aim of this study was to analyze HS2/HS3 transcription and cellular localization of transcripts in human maturating oocytes. The maternal HS2/HS3 TR transcripts transcribed from both strands were accumulated in the ooplasm in GV-MI oocytes as shown by DNA–RNA FISH (fluorescence in-situ hybridization). The transcripts’ content was higher in GV oocytes than in somatic cumulus cells according to real-time PCR. Using bioinformatics analysis, we demonstrated the presence of polyadenylated HS2 and HS3 RNAs in datasets of GV and MII oocyte transcriptomes. The transcripts shared a high degree of homology with HS2, HS3 transcripts previously observed in cancer cells. The HS2/HS3 transcripts were revealed by a combination of FISH and immunocytochemical staining within membraneless RNP structures that contained DEAD-box helicases DDX5 and DDX4. The RNP structures were closely associated with mitochondria, and are therefore similar to membraneless bodies described previously only in oogonia. These membraneless structures may be a site for spatial sequestration of RNAs and proteins in both maturating oocytes and cancer cells.
BackgroundChromocenters are defined as a punctate condensed blocks of chromatin in the interphase cell nuclei of certain cell types with unknown biological significance. In recent years a progress in revealing of chromocenters protein content has been made although the details of DNA content within constitutive heterochromatin still remain unclear. It is known that these regions are enriched in tandem repeats (TR) and transposable elements. Quick improvement of genome sequencing does not help to assemble the heterochromatic regions due to lack of appropriate bioinformatics techniques.ResultsChromocenters DNA have been isolated by a biochemical approach from mouse liver cells nuclei and sequenced on the Illumina MiSeq resulting in ChrmC dataset. Analysis of ChrmC dataset by the bioinformatics tools available revealed that the major component of chromocenter DNA are TRs: ~ 66% MaSat and ~ 4% MiSat. Other previously classified TR families constitute ~ 1% of ChrmC dataset. About 6% of chromocenters DNA are mostly unannotated sequences. In the contigs assembled with IDBA_UD there are many fragments of heterochromatic Y-chromosome, rDNA and other pseudo-genes and non-coding DNA. A protein coding sfi1 homolog gene fragment was also found in contigs. The Sfi1 homolog gene is located on the chromosome 11 in the reference genome very close to the Golden Pass Gap (a ~ 3 Mb empty region reserved to the pericentromeric region) and proves the purity of chromocenters isolation. The second major fraction are non-LTR retroposons (SINE and LINE) with overwhelming majority of LINE - ~ 11% of ChrmC. Most of the LINE fragments are from the ~ 2 kb region at the end of the 2nd ORF and its’ flanking region. The precise LINEs’ segment of ~ 2 kb is the necessary mouse constitutive heterohromatin component together with TR. The third most abundant fraction are ERVs. The ERV distribution in chromocenters differs from the whole genome: IAP (ERV2 class) is the most numerous in ChrmC while MaLR (ERV3 class) prevails in the reference genome. IAP and its LTR also prevail in TR containing contigs extracted from the WGS dataset. In silico prediction of IAP and LINE fragments in chromocenters was confirmed by direct fluorescent in situ hybridization (FISH).ConclusionOur data of chromocenters’ DNA (ChrmC) sequencing demonstrate that IAP with LTR and a precise ~ 2 kb fragment of LINE represent a substantial fraction of mouse chromocenters (constitutive heteroсhromatin) along with TRs.Electronic supplementary materialThe online version of this article (10.1186/s12864-018-4534-z) contains supplementary material, which is available to authorized users.
Centromeres (CEN), pericentromeric regions (periCEN), and subtelomeric regions (subTel) comprise the areas of constitutive heterochromatin (HChr). Tandem repeats (TRs or satellite DNA) are the main components of HChr forming no less than 10% of the mouse and human genome. HChr is assembled within distinct structures in the interphase nuclei of many species - chromocenters. In this review, the main classes of HChr repeat sequences are considered in the order of their number increase in the sequencing reads of the mouse chromocenters (ChrmC). TRs comprise ~70% of ChrmC occupying the first place. Non-LTR (-long terminal repeat) retroposons (mainly LINE, long interspersed nuclear element) are the next (~11%), and endogenous retroviruses (ERV; LTR-containing) are in the third position (~9%). HChr is not enriched with ERV in comparison with the whole genome, but there are differences in distribution of certain elements: while MaLR-like elements (ERV3) are dominant in the whole genome, intracisternal A-particles and corresponding LTR (ERV2) are prevalent in HChr. Most of LINE in ChrmC is represented by the 2-kb fragment at the end of the 2nd open reading frame and its flanking regions. Almost all tandem repeats classified as CEN or periCEN are contained in ChrmC. Our previous classification revealed 60 new mouse TR families with 29 of them being absent in ChrmC, which indicates their location on chromosome arms. TR transcription is necessary for maintenance of heterochromatic status of the HChr genome part. A burst of TR transcription is especially important in embryogenesis and other cases of radical changes in the cell program, including carcinogenesis. The recently discovered mechanism of epigenetic regulation with noncoding sequences transcripts, long noncoding RNA, and its role in embryogenesis and pluripotency maintenance is discussed.
Chromocenters are interphase nuclear landmark structures of constitutive heterochromatin. The tandem repeat (TR)-enriched parts of different chromosomes cluster together in chromocenters. There has been progress in recent years in determining the protein content of chromocenters, although it is not clear which DNA sequences underly constitutive heterochromatin apart from the TRs. The aim of the current work was to find out which DNA sequences besides TRs are involved in chromocenters' formation. Biochemically isolated chromocenters and microdissected centromeric regions were amplified by DOP-PCR, then cloned and sequenced. Alignment to Repbase, the mouse reference genome and WGS databases separated the sequences from both libraries into three groups: (1) sequences with similarity to pericentromere mouse major satellite; (2) sequences without similarity to any repetitive sequences; (3) sequences with similarity to long interspersed nuclear elements (LINEs). LINE-related sequences have a disperse pattern distribution on chromosomes predicted in silico. Selected clones were used for fluorescent in situ hybridization (FISH). The 10 clones tested hybridized to chromocenters and centromeric regions of metaphase chromosomes. These clones were used for double FISH with four known cloned TRs (satDNA, satellite DNA) and a probe specific for the sex chromosomes. The probes bind various chromocenters' regions without overlapping; so, FISH results reveal a complex chromocenter composition. We mapped 18 LINE-derived clones to the RepBase L1 records. Most of them grouped in a ∼2-kb region at the end of the second ORF and 3' untranslated region (UTR). So, even the limited number of the clones allows us to determine the region of the L1 element that is specific for heterochromatic regions. Although the L1 full-length probe did not hybridize at detectable levels to the heterochromatic region on any chromosome, the 2-kb fragment found is definitely a part of these regions. The precise LINE ∼2-kb fragment is the component of mouse and human constitutive heterochromatin enriched with TRs. The method used for amplification of the probes from two sources of the heterochromatic material uncovered the enrichment of a precise fragment of LINE within chromocenters.
Tandemly repeated sequences are unique to eukaryotes and make up tens of percent of the higher eukaryote genome. However, the evolution of this class of sequences is poorly studied. In our paper, 62 fam ilies of Mus musculus tandem repeats are analyzed by bioinformatic methods, while 7 of them are analyzed by fluorescence in situ hybridization. It is shown that the same tandem repeat sets occur together only in closely related species of mice. However, even in such species, we observe differences in localization on chromo somes and the number of individual tandem repeats. With increasing evolutionary distance, only some of the tandem repeat families remain common for different species. It is shown that the use of a combination of bio informatics and molecular biology techniques is very promising for further studies of tandem repeat evolu tion.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.