The proliferation of large-scale DNA-sequencing projects in recent years has driven a search for alternative methods to reduce time and cost. Here we describe a scalable, highly parallel sequencing system with raw throughput significantly greater than that of state-of-the-art capillary electrophoresis instruments. The apparatus uses a novel fibre-optic slide of individual wells and is able to sequence 25 million bases, at 99% or better accuracy, in one four-hour run. To achieve an approximately 100-fold increase in throughput over current Sanger sequencing technology, we have developed an emulsion method for DNA amplification and an instrument for sequencing by synthesis using a pyrosequencing protocol optimized for solid support and picolitre-scale volumes. Here we show the utility, throughput, accuracy and robustness of this system by shotgun sequencing and de novo assembly of the Mycoplasma genitalium genome with 96% coverage at 99.96% accuracy in one run of the machine.DNA sequencing has markedly changed the nature of biomedical research and medicine. Reductions in the cost, complexity and time required to sequence large amounts of DNA, including improvements in the ability to sequence bacterial and eukaryotic genomes, will have significant scientific, economic and cultural impact. Largescale sequencing projects, including whole-genome sequencing, have usually required the cloning of DNA fragments into bacterial vectors, amplification and purification of individual templates, followed by Sanger sequencing 1 using fluorescent chain-terminating nucleotide analogues 2 and either slab gel or capillary electrophoresis. Current estimates put the cost of sequencing a human genome between $10 million and $25 million 3 . Alternative sequencing methods have been described 4-8 ; however, no technology has displaced the use of bacterial vectors and Sanger sequencing as the main generators of sequence information.Here we describe an integrated system whose throughput routinely enables applications requiring millions of bases of sequence information, including whole-genome sequencing. Our focus has been on the co-development of an emulsion-based method 9-11 to isolate and amplify DNA fragments in vitro, and of a fabricated substrate and instrument that performs pyrophosphate-based sequencing (pyrosequencing 5,12 ) in picolitre-sized wells.In a typical run we generate over 25 million bases with a Phred quality score of 20 or better (predicted to have an accuracy of 99% or higher). Although this Phred 20 quality throughput is significantly higher than that of Sanger sequencing by capillary electrophoresis, it is currently at the cost of substantially shorter reads and lower average individual read accuracy. Sanger-based capillary electrophoresis sequencing systems produce up to 700 bases of sequence information from each of 96 DNA templates at an average read accuracy of 99.4% in 1 h, or 67,000 bases per hour, with substantially all of the bases having Phred 20 or better quality 23 . We further characterize the performance ...
The association of genetic variation with disease and drug response, and improvements in nucleic acid technologies, have given great optimism for the impact of 'genomic medicine'. However, the formidable size of the diploid human genome, approximately 6 gigabases, has prevented the routine application of sequencing methods to deciphering complete individual human genomes. To realize the full potential of genomics for human health, this limitation must be overcome. Here we report the DNA sequence of a diploid genome of a single individual, James D. Watson, sequenced to 7.4-fold redundancy in two months using massively parallel sequencing in picolitre-size reaction vessels. This sequence was completed in two months at approximately one-hundredth of the cost of traditional capillary electrophoresis methods. Comparison of the sequence to the reference genome led to the identification of 3.3 million single nucleotide polymorphisms, of which 10,654 cause amino-acid substitution within the coding sequence. In addition, we accurately identified small-scale (2-40,000 base pair (bp)) insertion and deletion polymorphism as well as copy number variation resulting in the large-scale gain and loss of chromosomal segments ranging from 26,000 to 1.5 million base pairs. Overall, these results agree well with recent results of sequencing of a single individual by traditional methods. However, in addition to being faster and significantly less expensive, this sequencing technology avoids the arbitrary loss of genomic sequences inherent in random shotgun sequencing by bacterial cloning because it amplifies DNA in a cell-free system. As a result, we further demonstrate the acquisition of novel human sequence, including novel genes not previously identified by traditional genomic sequencing. This is the first genome sequenced by next-generation technologies. Therefore it is a pilot for the future challenges of 'personalized genome sequencing'.
The importance of the interdomain connector loop and of the carboxy-terminal domain of Saccharomyces cerevisiae proliferating cell nuclear antigen (PCNA) for functional interaction with DNA polymerases ␦ (Pol␦) and (Pol) was investigated by site-directed mutagenesis. Two alleles, pol30-79 (IL126,128AA) in the interdomain connector loop and pol30-90 (PK252,253AA) near the carboxy terminus, caused growth defects and elevated sensitivity to DNA-damaging agents. These two mutants also had elevated rates of spontaneous mutations. The mutator phenotype of pol30-90 was due to partially defective mismatch repair in the mutant. In vitro, the mutant PCNAs showed defects in DNA synthesis. Interestingly, the pol30-79 mutant PCNA (pcna-79) was most defective in replication with Pol␦, whereas pcna-90 was defective in replication with Pol. Protein-protein interaction studies showed that pcna-79 and pcna-90 failed to interact with Pol␦ and Pol, respectively. In addition, pcna-90 was defective in interaction with the FEN-1 endo-exonuclease (RTH1 product). A loss of interaction between pcna-79 and the smallest subunit of Pol␦, the POL32 gene product, implicates this interaction in the observed defect with the polymerase. Neither PCNA mutant showed a defect in the interaction with replication factor C or in loading by this complex. Processivity of DNA synthesis by the mutant holoenzyme containing pcna-79 was unaffected on poly(dA) ⅐ oligo(dT) but was dramatically reduced on a natural template with secondary structure. A stem-loop structure with a 20-bp stem formed a virtually complete block for the holoenzyme containing pcna-79 but posed only a minor pause site for wild-type holoenzyme, indicating a function of the POL32 gene product in allowing replication past structural blocks.
In the presence of proliferating cell nuclear antigen, yeast DNA polymerase ␦ (Pol ␦) replicated DNA at a rate of 40 -60 nt/s. When downstream double-stranded DNA was encountered, Pol ␦ paused, but most replication complexes proceeded to carry out strand-displacement synthesis at a rate of 1.5 nt/s. In the presence of the flap endonuclease FEN1 (Rad27), the complex carried out nick translation (1.7 nt/s). The Dna2 nuclease/helicase alone did not efficiently promote nick translation, nor did it affect nick translation with FEN1. Maturation in the presence of DNA ligase was studied with various downstream primers. Downstream DNA primers, RNA primers, and small 5-flaps were efficiently matured by Pol ␦ and FEN1, and Dna2 did not stimulate maturation. However, maturation of long 5-flaps to which replication protein A can bind required both DNA2 and FEN1. The maturation kinetics were optimal with a slight molar excess over DNA of Pol ␦, FEN1, and proliferating cell nuclear antigen. A large molar excess of DNA ligase substantially enhanced the rate of maturation and shortened the nick-translation patch (nucleotides excised past the RNA/DNA junction before ligation) to 4 -6 nt from 8 -12 nt with equimolar ligase. These results suggest that FEN1, but not DNA ligase, is a stable component of the maturation complex.In eukaryotic cells, Okazaki fragments are efficiently matured during elongation of DNA replication. Earlier models of this process, based on in vitro replication studies of simian virus 40 DNA, implicated the FEN1 5Ј-FLAP-exo/endonuclease and RNase H1 as the main degradative enzymes to remove the RNA moiety of the Okazaki fragment and provide an appropriate gap for filling by a DNA polymerase and sealing by DNA ligase I (for reviews, see Refs. 1 and 2). Recent studies of the mutator phenotypes of RAD27 (encoding FEN1), RNH35 (encoding RNase H1), and the double mutants suggest that the two enzymes function in separate pathways (3). Rather, genetic studies have suggested that an essential nuclease/helicase, Dna2, may be an important component of the lagging-strand replication apparatus based on several criteria, including synthetic lethality of temperature-sensitive mutations in DNA2 with a deletion mutation of the RAD27 gene (4, 5). DNA2 shows genetic interactions with DNA polymerase alpha and alphaaccessory proteins (6), and the temperature sensitivity of S. pombe DNA2 mutants is suppressed by overexpression of each of several genes playing a role in the elongation and maturation of Okazaki fragments, including those encoding FEN1, DNA ligase, and DNA polymerase ␦ (Pol ␦) 1 (7). Subsequent to the demonstration of a rather inefficient helicase activity in Dna2, a 5Ј 3 3Ј-endonuclease activity was characterized, and it is the nuclease rather than the helicase that confers the essential phenotype (8 -10). In addition, DNA2 is required for the proper maintenance of telomeres (11,12).Beyond its demonstrated function in Okazaki-fragment maturation, FEN1 plays an important role in several other DNA metabolic processe...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.