Genome sequencing of large numbers of individuals promises to advance the understanding, treatment, and prevention of human diseases, among other applications. We describe a genome sequencing platform that achieves efficient imaging and low reagent consumption with combinatorial probe anchor ligation chemistry to independently assay each base from patterned nanoarrays of self-assembling DNA nanoballs. We sequenced three human genomes with this platform, generating an average of 45- to 87-fold coverage per genome and identifying 3.2 to 4.5 million sequence variants per genome. Validation of one genome data set demonstrates a sequence accuracy of about 1 false variant per 100 kilobases. The high accuracy, affordable cost of $4400 for sequencing consumables, and scalability of this platform enable complete human genome sequencing for the detection of rare variants in large-scale genetic studies.
Recent advances in whole genome sequencing have brought the vision of personal genomics and genomic medicine closer to reality. However, current methods lack clinical accuracy and the ability to describe the context (haplotypes) in which genome variants co-occur in a cost-effective manner. Here we describe a low-cost DNA sequencing and haplotyping process, Long Fragment Read (LFR) technology, similar to sequencing long single DNA molecules without cloning or separation of metaphase chromosomes. In this study, ten LFR libraries were made using only ~100 pg of human DNA per sample. Up to 97% of the heterozygous single nucleotide variants (SNVs) were assembled into long haplotype contigs. Removal of false positive SNVs not phased by multiple LFR haplotypes resulted in a final genome error rate of 1 in 10 Mb. Cost-effective and accurate genome sequencing and haplotyping from 10-20 human cells, as demonstrated here, will enable comprehensive genetic studies and diverse clinical applications.
The human ecto-apyrase gene family consists of five reported members (CD39, CD39-L1, CD39-L2, CD39-L3, and CD39-L4). The family can be subdivided into two groups by conservation of proposed structural domains. The CD39, CD39-L1, and CD39-L3 genes all encode hydrophobic portions in their carboxy and amino termini, serving as transmembrane domains for CD39 and potentially for the other two members. CD39-L2 and CD39-L4 genes encode hydrophobic portions in their amino termini, suggesting that they might encode secreted apyrases. We demonstrate that the CD39-L4 gene encodes the first reported human secreted ecto-apyrase. COS-7 cells transfected with a CD39-L4 expression construct utilizing the naturally occurring leader peptide express recombinant protein outside of the cells. This expression can be blocked by brefeldin A, a chemical that inhibits a step in mammalian secretory pathways. We also demonstrate expression of CD39-L4 message in macrophages, suggesting that the protein is present in the circulation. Furthermore, we show that CD39-L4 is an E-type apyrase, is dependent on calcium and magnesium cations, and has high degree of specificity for NDPs over NTPs as enzymatic substrates. A potential physiological role in hemostasis and platelet aggregation is presented.
E-NTPDases are extracellular enzymes that hydrolyze nucleotides. The human E-NTPDase gene family currently consists of five reported members (CD39, CD39L1, CD39L2, CD39L3, and CD39L4). Both membrane-bound and secreted family members have been predicted by encoded transmembrane and leader peptide motifs. In this report, we demonstrate that the human CD39L2 gene is expressed predominantly in the heart. In situ hybridization results from heart indicate that the CD39L2 message is expressed in muscle and capillary endothelial cells. We also show that the CD39L2 gene encodes an extracellular E-NTPDase. Flow cytometric experiments show that transiently expressed CD39L2 is present on the surface of COS-7 cells. Transfected cells also produce recombinant glycosylated protein in the medium, and this process can be blocked by brefeldin A, an inhibitor of the mammalian secretory pathway. The enzymology of CD39L2 shows characteristic features of a typical E-NTPDase, but with a much higher degree of specificity for NDPs over NTPs as enzymatic substrates. The kinetics of the ADPase activity exhibit positive cooperativity. The predominance of CD39L2 expression in the heart supports a functional role in regulating platelet activation and recruitment in this organ.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.