The proliferation of large-scale DNA-sequencing projects in recent years has driven a search for alternative methods to reduce time and cost. Here we describe a scalable, highly parallel sequencing system with raw throughput significantly greater than that of state-of-the-art capillary electrophoresis instruments. The apparatus uses a novel fibre-optic slide of individual wells and is able to sequence 25 million bases, at 99% or better accuracy, in one four-hour run. To achieve an approximately 100-fold increase in throughput over current Sanger sequencing technology, we have developed an emulsion method for DNA amplification and an instrument for sequencing by synthesis using a pyrosequencing protocol optimized for solid support and picolitre-scale volumes. Here we show the utility, throughput, accuracy and robustness of this system by shotgun sequencing and de novo assembly of the Mycoplasma genitalium genome with 96% coverage at 99.96% accuracy in one run of the machine.DNA sequencing has markedly changed the nature of biomedical research and medicine. Reductions in the cost, complexity and time required to sequence large amounts of DNA, including improvements in the ability to sequence bacterial and eukaryotic genomes, will have significant scientific, economic and cultural impact. Largescale sequencing projects, including whole-genome sequencing, have usually required the cloning of DNA fragments into bacterial vectors, amplification and purification of individual templates, followed by Sanger sequencing 1 using fluorescent chain-terminating nucleotide analogues 2 and either slab gel or capillary electrophoresis. Current estimates put the cost of sequencing a human genome between $10 million and $25 million 3 . Alternative sequencing methods have been described 4-8 ; however, no technology has displaced the use of bacterial vectors and Sanger sequencing as the main generators of sequence information.Here we describe an integrated system whose throughput routinely enables applications requiring millions of bases of sequence information, including whole-genome sequencing. Our focus has been on the co-development of an emulsion-based method 9-11 to isolate and amplify DNA fragments in vitro, and of a fabricated substrate and instrument that performs pyrophosphate-based sequencing (pyrosequencing 5,12 ) in picolitre-sized wells.In a typical run we generate over 25 million bases with a Phred quality score of 20 or better (predicted to have an accuracy of 99% or higher). Although this Phred 20 quality throughput is significantly higher than that of Sanger sequencing by capillary electrophoresis, it is currently at the cost of substantially shorter reads and lower average individual read accuracy. Sanger-based capillary electrophoresis sequencing systems produce up to 700 bases of sequence information from each of 96 DNA templates at an average read accuracy of 99.4% in 1 h, or 67,000 bases per hour, with substantially all of the bases having Phred 20 or better quality 23 . We further characterize the performance ...
Two large-scale yeast two-hybrid screens were undertaken to identify protein-protein interactions between full-length open reading frames predicted from the Saccharomyces cerevisiae genome sequence. In one approach, we constructed a protein array of about 6,000 yeast transformants, with each transformant expressing one of the open reading frames as a fusion to an activation domain. This array was screened by a simple and automated procedure for 192 yeast proteins, with positive responses identified by their positions in the array. In a second approach, we pooled cells expressing one of about 6,000 activation domain fusions to generate a library. We used a high-throughput screening procedure to screen nearly all of the 6,000 predicted yeast proteins, expressed as Gal4 DNA-binding domain fusion proteins, against the library, and characterized positives by sequence analysis. These approaches resulted in the detection of 957 putative interactions involving 1,004 S. cerevisiae proteins. These data reveal interactions that place functionally unclassified proteins in a biological context, interactions between proteins involved in the same biological function, and interactions that link biological functions together into larger cellular processes. The results of these screens are shown here.
The association of genetic variation with disease and drug response, and improvements in nucleic acid technologies, have given great optimism for the impact of 'genomic medicine'. However, the formidable size of the diploid human genome, approximately 6 gigabases, has prevented the routine application of sequencing methods to deciphering complete individual human genomes. To realize the full potential of genomics for human health, this limitation must be overcome. Here we report the DNA sequence of a diploid genome of a single individual, James D. Watson, sequenced to 7.4-fold redundancy in two months using massively parallel sequencing in picolitre-size reaction vessels. This sequence was completed in two months at approximately one-hundredth of the cost of traditional capillary electrophoresis methods. Comparison of the sequence to the reference genome led to the identification of 3.3 million single nucleotide polymorphisms, of which 10,654 cause amino-acid substitution within the coding sequence. In addition, we accurately identified small-scale (2-40,000 base pair (bp)) insertion and deletion polymorphism as well as copy number variation resulting in the large-scale gain and loss of chromosomal segments ranging from 26,000 to 1.5 million base pairs. Overall, these results agree well with recent results of sequencing of a single individual by traditional methods. However, in addition to being faster and significantly less expensive, this sequencing technology avoids the arbitrary loss of genomic sequences inherent in random shotgun sequencing by bacterial cloning because it amplifies DNA in a cell-free system. As a result, we further demonstrate the acquisition of novel human sequence, including novel genes not previously identified by traditional genomic sequencing. This is the first genome sequenced by next-generation technologies. Therefore it is a pilot for the future challenges of 'personalized genome sequencing'.
We have identified a new protein, Tim54p, located in the yeast mitochondrial inner membrane. Tim54p is an essential import component, required for the insertion of at least two polytopic proteins into the inner membrane, but not for the translocation of precursors into the matrix. Several observations suggest that Tim54p and Tim22p are part of a protein complex in the inner membrane distinct from the previously characterized Tim23p-Tim17p complex. First, multiple copies of the TIM22 gene, but not TIM23 or TIM17, suppress the growth defect of a tim54-1 temperature-sensitive mutant. Second, Tim22p can be coprecipitated with Tim54p from detergent-solubilized mitochondria, but Tim54p and Tim22p do not interact with either Tim23p or Tim17p. Finally, the tim54-1 mutation destabilizes the Tim22 protein, but not Tim23p or Tim17p. Our results support the idea that the mitochondrial inner membrane carries two independent import complexes: one required for the translocation of proteins across the inner membrane (Tim23p–Tim17p), and the other required for the insertion of proteins into the inner membrane (Tim54p–Tim22p).
In the yeast Saccharomyces cerevisiae, mitochondria form a branched, tubular reticulum in the periphery of the cell. Mmm1p is required to maintain normal mitochondrial shape and in mmm1 mutants mitochondria form large, spherical organelles. To further explore Mmm1p function, we examined the localization of a Mmm1p–green fluorescent protein (GFP) fusion in living cells. We found that Mmm1p-GFP is located in small, punctate structures on the mitochondrial outer membrane, adjacent to a subset of matrix-localized mitochondrial DNA nucleoids. We also found that the temperature-sensitive mmm1-1 mutant was defective in transmission of mitochondrial DNA to daughter cells immediately after the shift to restrictive temperature. Normal mitochondrial nucleoid structure also collapsed at the nonpermissive temperature with similar kinetics. Moreover, we found that mitochondrial inner membrane structure is dramatically disorganized in mmm1 disruption strains. We propose that Mmm1p is part of a connection between the mitochondrial outer and inner membranes, anchoring mitochondrial DNA nucleoids in the matrix.
The mitochondrial outer membrane protein, Mmm1p, is required for normal mitochondrial shape in yeast. To identify new morphology proteins, we isolated mutations incompatible with the mmm1-1 mutant. One of these mutants, mmm2-1, is defective in a novel outer membrane protein. Lack of Mmm2p causes a defect in mitochondrial shape and loss of mitochondrial DNA (mtDNA) nucleoids. Like the Mmm1 protein (Aiken Hobbs, A.E., M. Srinivasan, J.M. McCaffery, and R.E. Jensen. 2001. J. Cell Biol. 152:401–410.), Mmm2p is located in dot-like particles on the mitochondrial surface, many of which are adjacent to mtDNA nucleoids. While some of the Mmm2p-containing spots colocalize with those containing Mmm1p, at least some of Mmm2p is separate from Mmm1p. Moreover, while Mmm2p and Mmm1p both appear to be part of large complexes, we find that Mmm2p and Mmm1p do not stably interact and appear to be members of two different structures. We speculate that Mmm2p and Mmm1p are components of independent machinery, whose dynamic interactions are required to maintain mitochondrial shape and mtDNA structure.
Single cell RNA sequencing has emerged as a powerful tool for resolving transcriptional diversity in tumors, but is limited by throughput, cost and the ability to process archival frozen tissue samples. Here we develop a high-throughput 3′ single-nucleus RNA sequencing approach that combines nanogrid technology, automated imaging, and cell selection to sequence up to ~1800 single nuclei in parallel. We compare the transcriptomes of 485 single nuclei to 424 single cells in a breast cancer cell line, which shows a high concordance (93.34%) in gene levels and abundance. We also analyze 416 nuclei from a frozen breast tumor sample and 380 nuclei from normal breast tissue. These data reveal heterogeneity in cancer cell phenotypes, including angiogenesis, proliferation, and stemness, and a minor subpopulation (19%) with many overexpressed cancer genes. Our studies demonstrate the utility of nanogrid single-nucleus RNA sequencing for studying the transcriptional programs of tumor nuclei in frozen archival tissue samples.
BackgroundTechnological advances have enabled transcriptome characterization of cell types at the single-cell level providing new biological insights. New methods that enable simple yet high-throughput single-cell expression profiling are highly desirable.ResultsHere we report a novel nanowell-based single-cell RNA sequencing system, ICELL8, which enables processing of thousands of cells per sample. The system employs a 5,184-nanowell-containing microchip to capture ~1,300 single cells and process them. Each nanowell contains preprinted oligonucleotides encoding poly-d(T), a unique well barcode, and a unique molecular identifier. The ICELL8 system uses imaging software to identify nanowells containing viable single cells and only wells with single cells are processed into sequencing libraries. Here, we report the performance and utility of ICELL8 using samples of increasing complexity from cultured cells to mouse solid tissue samples. Our assessment of the system to discriminate between mixed human and mouse cells showed that ICELL8 has a low cell multiplet rate (< 3%) and low cross-cell contamination. We characterized single-cell transcriptomes of more than a thousand cultured human and mouse cells as well as 468 mouse pancreatic islets cells. We were able to identify distinct cell types in pancreatic islets, including alpha, beta, delta and gamma cells.ConclusionsOverall, ICELL8 provides efficient and cost-effective single-cell expression profiling of thousands of cells, allowing researchers to decipher single-cell transcriptomes within complex biological samples.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-017-3893-1) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.