The recent advent of methods for high-throughput single-cell molecular profiling has catalyzed a growing sense in the scientific community that the time is ripe to complete the 150-year-old effort to identify all cell types in the human body. The Human Cell Atlas Project is an international collaborative effort that aims to define all human cell types in terms of distinctive molecular profiles (such as gene expression profiles) and to connect this information with classical cellular descriptions (such as location and morphology). An open comprehensive reference map of the molecular state of cells in healthy human tissues would propel the systematic study of physiological states, developmental trajectories, regulatory circuitry and interactions of cells, and also provide a framework for understanding cellular dysregulation in human disease. Here we describe the idea, its potential utility, early proofs-of-concept, and some design considerations for the Human Cell Atlas, including a commitment to open data, code, and community.
The National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high accuracy. The MGC has currently sequenced and verified the full ORF for a nonredundant set of >9,000 human and >6,000 mouse genes. Candidate full-ORF clones for an additional 7,800 human and 3,500 mouse genes also have been identified. All MGC sequences and clones are available without restriction through public databases and clone distribution networks (see http:͞͞mgc.nci.nih.gov).T he gene content of the mammalian genome is a topic of great interest. While draft sequences are now available for the human (1, 2), mouse (www.ensembl.org͞Mus musculus), and rat (http:͞͞hgsc.bcm.tmc.edu͞projects͞rat) genomes, the challenge remains to correctly identify all of the encoded genes. Difficulty in deciphering the anatomy of mammalian genes is due to several factors, including large amounts of intervening (noncoding) sequence, the imperfection of gene-prediction algorithms (3), and the incompleteness of cDNA-sequence resources, many of which consist of gene tags of variable length and quality. Full-length cDNA sequences are extremely useful for determining the genomic structure of genes, especially when analyzed within the context of genomic sequence. To facilitate geneidentification efforts and to catalyze experimental investigation, the National Institutes of Health (NIH) launched the Mammalian Gene Collection (MGC) program (4) with the aim of providing freely accessible, high-quality sequences for validated, complete ORF cDNA clones. In this article, we describe our progress toward the goal of identifying and accurately sequencing at least one full ORF-containing cDNA clone for each human and mouse gene, as well as making these fully sequenced clones available without restriction. Materials and MethodscDNA Library Production. MGC cDNA libraries were prepared from a diverse set of tissues and cell lines, in several different vector systems, by using a variety of methods. Vector maps and details of library construction are available at http:͞͞mgc. nci.nih.gov͞Info͞VectorMaps. The complete sequences for each of the MGC vectors can be found at http:͞͞image.llnl.gov͞ image͞html͞vectors.shtml. The catalog of MGC cDNA libraries can be accessed at http:͞͞mgc.nci.nih.gov.
The recent advent of methods for high-throughput single-cell molecular profiling has catalyzed a growing sense in the scientific community that the time is ripe to complete the 150-year-old effort to identify all cell types in the human body, by undertaking a Human Cell Atlas Project as an international collaborative effort. The aim would be to define all human cell types in terms of distinctive molecular profiles (e.g., gene expression) and connect this information with classical cellular descriptions (e.g., location and morphology). A comprehensive reference map of the molecular state of cells in healthy human tissues would propel the systematic study of physiological states, developmental trajectories, regulatory circuitry and interactions of cells, as well as provide a framework for understanding cellular dysregulation in human disease. Here we describe the idea, its potential utility, early proofs-of-concept, and some design considerations for the Human Cell Atlas.
Active transport of proteins into the nucleus is mediated by interaction between the classical nuclear localization signals (NLSs) of the targeted proteins and the NLS receptor (importin) complex. This nuclear transport system is highly regulated and conserved in eukaryotes and is essential for cell survival. Using a fragment of BRCA1 containing the two NLS motifs as a bait for yeast two-hybrid screening, we have isolated four clones, one of which is importin ␣. Here we characterize one of the other clones identified, BRAP2, which is a novel gene and expressed as a 2-kilobase mRNA in human mammary epithelial cells and some but not all tissues of mice. The isolated full-length cDNA encodes a novel protein containing 600 amino acid residues with pI 6.04. Characteristic motifs of C2H2 zinc fingers and leucine heptad repeats are present in the middle and C-terminal regions of the protein, respectively. BRAP2 also shares significant homology with a hypothetical protein from yeast Saccharomyces cerevisiae, especially in the zinc finger region. Antibodies prepared against the C-terminal region of BRAP2 fused to glutathione S-transferase specifically recognize a cellular protein with a molecular size of 68 kDa, consistent with the size of the in vitro translated protein. Cellular BRAP2 is mainly cytoplasmic and binds to the NLS motifs of BRCA1 with similar specificity to that of importin ␣ in both two-hybrid assays in yeast and glutathione S-transferase pull-down assays in vitro. Other motifs such as the SV40 large T antigen NLS motif and the bipartite NLS motif found in mitosin are also recognized by BRAP2. Similarly, the yeast homolog of BRAP2 also binds to these NLS motifs in vitro. These results imply that BRAP2 may function as a cytoplasmic retention protein and play a role in regulating transport of nuclear proteins.The passage of macromolecules between the nucleus and the cytoplasm occurs through nuclear pores. Small macromolecules can diffuse through the nuclear pores at a rate inversely proportional to their mass. Proteins with molecular masses greater than 40 -60 kDa are actively transported through the nuclear pores. To be transported into the nucleus, the protein must either contain a nuclear localization signal or, if not, be bound to another protein that does (1, 2). This process requires at least four different factors acting in two distinct steps. The first step is mediated by importin ␣ (also termed karyopherin ␣) and importin  (also termed karyopherin ). The ␣ subunit is primarily responsible for NLS 1 recognition, whereas the  subunit appears to mediate docking to the nuclear pore complex. The second translocation step requires the small G protein Ran/TC4 and an interacting partner, p15 (3-13).The presence of a nuclear localization signal may not be sufficient to direct nuclear import. The target efficiency of NLS motifs can be modified by the presence of multiple NLS motifs within a protein, by modifications of the flanking sequences, and by the accessibility of the NLSs to the import machinery (1...
Here we develop a high-throughput single-cell ATAC-seq (assay for transposition of accessible chromatin) method to measure physical access to DNA in whole cells. Our approach integrates fluorescence imaging and addressable reagent deposition across a massively parallel (5184) nano-well array, yielding a nearly 20-fold improvement in throughput (up to ~1800 cells/chip, 4–5 h on-chip processing time) and library preparation cost (~81¢ per cell) compared to prior microfluidic implementations. We apply this method to measure regulatory variation in peripheral blood mononuclear cells (PBMCs) and show robust, de novo clustering of single cells by hematopoietic cell type.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.