The identification of underlying differences is crucial for understanding the embryogenesis of species specificity. In this paper, a comparative transcriptome analysis of multi-species (CTAMS) was first developed to detect the genome activation differences of embryogenesis among human, mouse, and bovine. The annotation of intra-species and the detection of inter-species difference were separately to avoid potential batch effects. The profile clusters and Pearson correlation showed the human and bovine embryos displayed a high similarity in genome activation with a dramatic maternal to zygotic transition between 4-8 cell stages. But the mouse embryos maintained the high activity during the whole embryogenesis. Moreover, the stage-specific differentially expressed genes (DEGs) exhibited the specification of trophectoderm and inner cell mass may occur at blastocyst stage in human and bovine embryos, at morula stage in the mouse. The transcription and translation associated pathways were widely activated at morula stage in human embryos, 8-cell stage in bovine embryos and 2-cell stage in mouse embryos. The signal transduction pathways also displayed species-specific expression patterns. Finally, a probable pathway activated landscape of embryonic genome was given with the development axis.INDEX TERMS Embryogenesis, comparative transcriptome of multi-species, genome activation, underlying differences.
Human preimplantation development is a complex process involving dramatic changes in transcriptional architecture. For a better understanding of their time-spatial development, it is indispensable to identify key genes. Although the singlecell RNA sequencing (RNA-seq) techniques could provide detailed clustering signatures, the identification of decisive factors remains difficult. Additionally, it requires high experimental cost and a long experimental period. Thus, it is highly desired to develop computational methods for identifying effective genes of development signature. In this study, we first developed a predictor called EmPredictor to identify developmental stages of human preimplantation embryogenesis. First, we compared the F-score of feature selection algorithms with differential gene expression (DGE) analysis to find specific signatures of the development stage. In addition, by training the support vector machine (SVM), four types of signature subsets were comprehensively discussed. The prediction results showed that a feature subset with 1,881 genes from the F-score algorithm obtained the best predictive performance, which achieved the highest accuracy of 93.3% on the cross-validation set. Further function enrichment demonstrated that the gene set selected by the feature selection method was involved in more development-related pathways and cell fate determination biomarkers. This indicates that the F-score algorithm should be preferentially proposed for detecting key genes of multi-period data in mammalian early development.
Understanding early development offers a striking opportunity to investigate genetic disease, stem cell and assisted reproductive technology. Recent advances in high-throughput sequencing technology have led to the rising influx of omics data, which have rapidly boosted our understanding of mammalian developmental mechanisms. Here, we review the database EmExplorer (a database for exploring time activation of gene expression in mammalian embryos), which systematically organizes the genes from development-related pathways, and which we have already established and continue to update it. The current version of EmExplorer incorporates over 26 000 genes obtained from 306 functional pathways in five species. The function annotations of development-related genes were also integrated into EmExplorer. To facilitate data extraction, the database also contains the following information. (i) The dynamic expression values for each development stage are matched to the corresponding genes. (ii) A two-layer search tool which supports multi-option searching, such as by official symbol, pathway name and function annotation. The returned entries can directly link to the analysis results for the corresponding gene or pathway in the analysis module. (iii) The analysis module provides different gene comparisons at the multi-species level and functional pathway level, which shows the species specificity and stage specificity at the gene or pathway level. (iv) The analysis based on the hypergeometric distribution test reveals the enrichment of gene functions at a particular stage of one organism's pathway. (v) The browser is designed for users with ambiguous searching goals and greatly helps new users to get a general idea of the contents of the database. (vi) The experimentally validated pathways are manually curated and shown on the home page. EmExplorer will be helpful for elucidating early developmental mechanisms and exploring time activation genes. EmExplorer is freely available at http://bioinfor.imu.edu.cn/emexplorer .
The emerging importance of embryonic development research rapidly increases the volume for a professional resource related to multi-omics data. However, the lack of global embryogenesis repository and systematic analysis tools limits the preceding in stem cell research, human congenital diseases and assisted reproduction. Here, we developed the EmAtlas, which collects the most comprehensive multi-omics data and provides multi-scale tools to explore spatiotemporal activation during mammalian embryogenesis. EmAtlas contains data on multiple types of gene expression, chromatin accessibility, DNA methylation, nucleosome occupancy, histone modifications, and transcription factors, which displays the complete spatiotemporal landscape in mouse and human across several time points, involving gametogenesis, preimplantation, even fetus and neonate, and each tissue involves various cell types. To characterize signatures involved in the tissue, cell, genome, gene and protein levels during mammalian embryogenesis, analysis tools on these five scales were developed. Additionally, we proposed EmRanger to deliver extensive development-related biological background annotations. Users can utilize these tools to analyze, browse, visualize, and download data owing to the user-friendly interface. EmAtlas is freely accessible at http://bioinfor.imu.edu.cn/ematlas.
Many progresses have recently been achieved in animal somatic cell nuclear transfer (SCNT). However, embryos derived from SCNT rarely result in live births. Single‐cell RNA sequencing (scRNA‐seq) can be used to investigate the development details of SCNT embryos. Here, bovine fibroblasts and three factors bovine iPSCs (3F biPSCs) were used as donors for bovine nuclear transfer, and the single blastomere transcriptome was analysed by scRNA‐seq. Compared to in vitro fertilization (IVF) embryos, SCNT embryos exhibited many defects. Abnormally expressed genes were found at each stage of embryos, which enriched in metabolism, and epigenetic modification. The DEGs of the adjacent stage in SCNT embryos did not follow the temporal expression pattern similar to that of IVF embryos. Particularly, SCNT 8‐cell stage embryos showed failures in some gene activation, including ZSCAN4 , and defects in protein association networks which cored as POLR2K, GRO1, and ANKRD1. Some important signalling pathways also showed incomplete activation at SCNT zygote to morula stage. Interestingly, 3F biPSCNT embryos exhibited more dysregulated genes than SCNT embryos at zygote and 2‐cell stage, including genes in KDM family. Pseudotime analysis of 3F biPSCNT embryos showed the different developmental fate from SCNT and IVF embryos. These findings suggested partial reprogrammed 3F biPS cells as donors for bovine nuclear transfer hindered the reprogramming of nuclear transfer embryos. Our studies revealed the abnormal gene expression and pathway activation of SCNT embryos, which could increase our understanding of the development of SCNT embryos and give hints to improve the efficiency of nuclear transfer.
Developmental pluripotency-associated 2 (Dppa2) and developmental pluripotency-associated 4 (Dppa4) as positive drivers were helpful for transcriptional regulation of zygotic genome activation (ZGA). Here, we systematically assessed the cooperative interplay of Dppa2 and Dppa4 in regulating cell pluripotency and found that simultaneous overexpression of Dppa2/4 can make induced pluripotent stem cells closer to embryonic stem cells (ESCs). Compared with other pluripotency transcription factors, Dppa2/4 can regulate majorities of signaling pathways by binding on CG-rich region of proximal promoter (0–500 bp), of which 85% and 77% signaling pathways were significantly activated by Dppa2 and Dppa4, respectively. Notably, Dppa2/4 also can dramatically trigger the decisive signaling pathways for facilitating ZGA, including Hippo, MAPK and TGF-beta signaling pathways and so on. At last, we found alkaline phosphatase, placental-like 2 (Alppl2) was completely silenced when Dppa2 and 4 single- or double-knockout in ESC, which is consistent with Dux. Moreover, Alppl2 was significantly activated in mouse 2-cell embryos and 4–8 cells stage of human embryos, further predicted that Alppl2 was directly regulated by Dppa2/4 as a ZGA candidate driver to facilitate pre-embryonic development.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.