Here, we provide fundamental insights into early human development by single-cell RNA-sequencing of human and mouse preimplantation embryos. We elucidate conserved transcriptional programs along with those that are human specific. Importantly, we validate our RNA-sequencing findings at the protein level, which further reveals differences in human and mouse embryo gene expression. For example, we identify several genes exclusively expressed in the human pluripotent epiblast, including the transcription factor KLF17. Key components of the TGF-β signalling pathway, including NODAL, GDF3, TGFBR1/ALK5, LEFTY1, SMAD2, SMAD4 and TDGF1, are also enriched in the human epiblast. Intriguingly, inhibition of TGF-β signalling abrogates NANOG expression in human epiblast cells, consistent with a requirement for this pathway in pluripotency. Although the key trophectoderm factors Id2, Elf5 and Eomes are exclusively localized to this lineage in the mouse, the human orthologues are either absent or expressed in alternative lineages. Importantly, we also identify genes with conserved expression dynamics, including Foxa2/FOXA2, which we show is restricted to the primitive endoderm in both human and mouse embryos. Comparison of the human epiblast to existing embryonic stem cells (hESCs) reveals conservation of pluripotency but also additional pathways more enriched in hESCs. Our analysis highlights significant differences in human preimplantation development compared with mouse and provides a molecular blueprint to understand human embryogenesis and its relationship to stem cells.
SummaryDespite their fundamental biological and clinical importance, the molecular mechanisms that regulate the first cell fate decisions in the human embryo are not well understood. Here we use CRISPR–Cas9-mediated genome editing to investigate the function of the pluripotency transcription factor OCT4 during human embryogenesis. We identified an efficient OCT4-targeting guide RNA using an inducible human embryonic stem cell-based system and microinjection of mouse zygotes. Using these refined methods, we efficiently and specifically targeted the gene encoding OCT4 (POU5F1) in diploid human zygotes and found that blastocyst development was compromised. Transcriptomics analysis revealed that, in POU5F1-null cells, gene expression was downregulated not only for extra-embryonic trophectoderm genes, such as CDX2, but also for regulators of the pluripotent epiblast, including NANOG. By contrast, Pou5f1-null mouse embryos maintained the expression of orthologous genes, and blastocyst development was established, but maintenance was compromised. We conclude that CRISPR–Cas9-mediated genome editing is a powerful method for investigating gene function in the context of human development.
Mitochondrial DNA (mtDNA) mutations are maternally inherited and are associated with a broad range of debilitating and fatal diseases1. Reproductive technologies designed to uncouple the inheritance of mtDNA from nuclear DNA may enable affected women to have a genetically related child with a greatly reduced risk of mtDNA disease. Here we report the first preclinical studies on pronuclear transplantation (PNT). Surprisingly, techniques used in proof of concept studies involving abnormally fertilized human zygotes2 were not well tolerated by normally fertilized zygotes. We have therefore developed an alternative approach based on transplanting pronuclei shortly after completion of meiosis rather than shortly before the first mitotic division. This promotes efficient development to the blastocyst stage with no detectable effect on aneuploidy or gene expression. Following optimisation, mtDNA carryover was reduced to <2% in the majority (79%) of PNT blastocysts. The importance of reducing carryover to the lowest possible levels is highlighted by a progressive increase in heteroplasmy in a stem cell line derived from a PNT blastocyst with 4% mtDNA carryover. We conclude that PNT has the potential to reduce the risk of mtDNA disease, but it may not guarantee prevention.
Transcription factor-mediated reprograming is a powerful method to study cell fate changes. In this study, we demonstrate that the transcription factor Gata6 can initiate reprograming of multiple cell types to induced extraembryonic endoderm stem (iXEN) cells. Intriguingly, Gata6 is sufficient to drive iXEN cells from mouse pluripotent cells and differentiated neural cells. Furthermore, GATA6 induction in human embryonic stem (hES) cells also down-regulates pluripotency gene expression and up-regulates extraembryonic endoderm (ExEn) genes, revealing a conserved function in mediating this cell fate switch. Profiling transcriptional changes following Gata6 induction in mES cells reveals step-wise pluripotency factor disengagement, with initial repression of Nanog and Esrrb, then Sox2, and finally Oct4, alongside step-wise activation of ExEn genes. Chromatin immunoprecipitation and subsequent high-throughput sequencing analysis shows Gata6 enrichment near pluripotency and endoderm genes, suggesting that Gata6 functions as both a direct repressor and activator. Together, this demonstrates that Gata6 is a versatile and potent reprograming factor that can act alone to drive a cell fate switch from diverse cell types.
Excessive alcohol consumption is one of the main causes of death and disability worldwide. Alcohol consumption is a heritable complex trait. We conducted a meta-analysis of genome-wide association studies (GWAS) of gram/day (g/d) alcohol consumption in UK-Biobank, AlcGen and CHARGE+ consortia accumulating 480,842 people of European descent to decipher the genetic architecture of alcohol intake. We identified 46 novel, common loci, and investigated their potential functional significance using magnetic resonance imaging data and gene expression studies. Our results identify genetic pathways associated with alcohol consumption and suggest shared genetic mechanisms with neuropsychiatric disorders including schizophrenia.
Proteogenomics has the potential to advance genome annotation through high quality peptide identifications derived from mass spectrometry experiments, which demonstrate a given gene or isoform is expressed and translated at the protein level. This can advance our understanding of genome function, discovering novel genes and gene structure that have not yet been identified or validated. Because of the high-throughput shotgun nature of most proteomics experiments, it is essential to carefully control for false positives and prevent any potential misannotation. A number of statistical procedures to deal with this are in wide use in proteomics, calculating false discovery rate (FDR) and posterior error probability (PEP) values for groups and individual peptide spectrum matches (PSMs). These methods control for multiple testing and exploit decoy databases to estimate statistical significance. Here, we show that database choice has a major effect on these confidence estimates leading to significant differences in the number of PSMs reported. We note that standard target:decoy approaches using six-frame translations of nucleotide sequences, such as assembled transcriptome data, apparently underestimate the confidence assigned to the PSMs. The source of this error stems from the inflated and unusual nature of the six-frame database, where for every target sequence there exists five “incorrect” targets that are unlikely to code for protein. The attendant FDR and PEP estimates lead to fewer accepted PSMs at fixed thresholds, and we show that this effect is a product of the database and statistical modeling and not the search engine. A variety of approaches to limit database size and remove noncoding target sequences are examined and discussed in terms of the altered statistical estimates generated and PSMs reported. These results are of importance to groups carrying out proteogenomics, aiming to maximize the validation and discovery of gene structure in sequenced genomes, while still controlling for false positives.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.