The generation of a collection of human induced pluripotent stem cell (hiPSC) lines expressing endogenously GFP-tagged proteins using CRISPR/Cas9 methods is described. The methods used and the genomic and cell biological data validating the GFP-tagged hiPSC lines are also presented.
We present a CRISPR/Cas9 genome editing strategy to systematically tag endogenous proteins with fluorescent tags in human inducible pluripotent stem cells. To date we have generated multiple human iPSC lines with GFP tags for 10 proteins representing key cellular structures. The tagged proteins include alpha tubulin, beta actin, desmoplakin, fibrillarin, lamin B1, non-muscle myosin heavy chain IIB, paxillin, Sec61 beta, tight junction protein ZO1, and Tom20. Our genome editing methodology using Cas9 ribonuclear protein electroporation and fluorescence-based enrichment of edited cells resulted in <0.1-24% HDR across all experiments. Clones were generated from each edited population and screened for precise editing. ~25% of the clones contained precise mono-allelic edits at the targeted locus. Furthermore, 92% (36/39) of expanded clonal lines satisfied key quality control criteria including genomic stability, appropriate expression and localization of the tagged protein, and pluripotency. Final clonal lines corresponding to each of the 10 cellular structures are now available to the research community. The data described here, including our editing protocol, genetic screening, quality control assays, and imaging observations, can serve as an initial resource for genome editing in cell biology and stem cell research.
The discovery that the bacterial clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated protein 9 (Cas9) acquired immune system can be utilized to create double-strand breaks (DSBs) in eukaryotic genomes has resulted in the ability to create genomic changes more easily than with other genome engineering techniques. While there is significant potential for the CRISPR-Cas9 system to advance basic and applied research, several unknowns remain, including the specificity of the RNA-directed DNA cleavage by the small targeting RNA, the CRISPR RNA (crRNA). Here we describe a novel synthetic RNA approach that allows for high-throughput gene editing experiments. This was used with a functional assay for protein disruption to perform high-throughput analysis of crRNA activity and specificity. We performed a comprehensive test of target cleavage using crRNAs that contain one and two nucleotide mismatches to the DNA target in the 20mer targeting region of the crRNA, allowing for the evaluation of hundreds of potential mismatched target sites without the requirement for the off-target sequences and their adjacent PAMs to be present in the genome. Our results demonstrate that while many crRNAs are functional, less than 5% of crRNAs with two mismatches to their target are effective in gene editing; this suggests an overall high level of functionality but low level of off-targeting.
Summary We describe a multistep method for endogenous tagging of transcriptionally silent genes in human induced pluripotent stem cells (hiPSCs). A monomeric EGFP (mEGFP) fusion tag and a constitutively expressed mCherry fluorescence selection cassette were delivered in tandem via homology-directed repair to five genes not expressed in hiPSCs but important for cardiomyocyte sarcomere function: TTN , MYL7 , MYL2 , TNNI1 , and ACTN2 . CRISPR/Cas9 was used to deliver the selection cassette and subsequently mediate its excision via microhomology-mediated end-joining and non-homologous end-joining. Most excised clones were effectively tagged, and all properly tagged clones expressed the mEGFP fusion protein upon differentiation into cardiomyocytes, allowing live visualization of these cardiac proteins at the sarcomere. This methodology provides a broadly applicable strategy for endogenously tagging transcriptionally silent genes in hiPSCs, potentially enabling their systematic and dynamic study during differentiation and morphogenesis.
Understanding how a subset of expressed genes dictates cellular phenotype is a considerable challenge owing to the large numbers of molecules involved, their combinatorics and the plethora of cellular behaviours that they determine1,2. Here we reduced this complexity by focusing on cellular organization—a key readout and driver of cell behaviour3,4—at the level of major cellular structures that represent distinct organelles and functional machines, and generated the WTC-11 hiPSC Single-Cell Image Dataset v1, which contains more than 200,000 live cells in 3D, spanning 25 key cellular structures. The scale and quality of this dataset permitted the creation of a generalizable analysis framework to convert raw image data of cells and their structures into dimensionally reduced, quantitative measurements that can be interpreted by humans, and to facilitate data exploration. This framework embraces the vast cell-to-cell variability that is observed within a normal population, facilitates the integration of cell-by-cell structural data and allows quantitative analyses of distinct, separable aspects of organization within and across different cell populations. We found that the integrated intracellular organization of interphase cells was robust to the wide range of variation in cell shape in the population; that the average locations of some structures became polarized in cells at the edges of colonies while maintaining the ‘wiring’ of their interactions with other structures; and that, by contrast, changes in the location of structures during early mitotic reorganization were accompanied by changes in their wiring.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.