The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
To understand the impact of gut microbes on human health and well-being it is crucial to assess their genetic potential. Here we describe the Illumina-based metagenomic sequencing, assembly and characterization of 3.3 million non-redundant microbial genes, derived from 576.7 gigabases of sequence, from faecal samples of 124 European individuals. The gene set, approximately 150 times larger than the human gene complement, contains an overwhelming majority of the prevalent (more frequent) microbial genes of the cohort and probably includes a large proportion of the prevalent human intestinal microbial genes. The genes are largely shared among individuals of the cohort. Over 99% of the genes are bacterial, indicating that the entire cohort harbours between 1,000 and 1,150 prevalent bacterial species and each individual at least 160 such species, which are also largely shared. We define and describe the minimal gut metagenome and the minimal gut bacterial genome in terms of functions present in all individuals and most bacteria, respectively.
Next-generation massively parallel DNA sequencing technologies provide ultrahigh throughput at a substantially lower unit data cost; however, the data are very short read length sequences, making de novo assembly extremely challenging. Here, we describe a novel method for de novo assembly of large genomes from short read sequences. We successfully assembled both the Asian and African human genome sequences, achieving an N50 contig size of 7.4 and 5.9 kilobases (kb) and scaffold of 446.3 and 61.9 kb, respectively. The development of this de novo short read assembly method creates new opportunities for building reference sequences and carrying out accurate analyses of unexplored genomes in a cost-effective way.
Cucumber is an economically important crop as well as a model system for sex determination studies and plant vascular biology. Here we report the draft genome sequence of Cucumis sativus var. sativus L., assembled using a novel combination of traditional Sanger and next-generation Illumina GA sequencing technologies to obtain 72.2-fold genome coverage. The absence of recent whole-genome duplication, along with the presence of few tandem duplications, explains the small number of genes in the cucumber. Our study establishes that five of the cucumber's seven chromosomes arose from fusions of ten ancestral chromosomes after divergence from Cucumis melo. The sequenced cucumber genome affords insight into traits such as its sex expression, disease resistance, biosynthesis of cucurbitacin and 'fresh green' odor. We also identify 686 gene clusters related to phloem function. The cucumber genome provides a valuable resource for developing elite cultivars and for studying the evolution and function of the plant vascular system.
Using next-generation sequencing technology alone, we have successfully generated and assembled a draft sequence of the giant panda genome. The assembled contigs (2.25 gigabases (Gb)) cover approximately 94% of the whole genome, and the remaining gaps (0.05 Gb) seem to contain carnivore-specific repeats and tandem repeats. Comparisons with the dog and human showed that the panda genome has a lower divergence rate. The assessment of panda genes potentially underlying some of its unique traits indicated that its bamboo diet might be more dependent on its gut microbiome than its own genetic composition. We also identified more than 2.7 million heterozygous single nucleotide polymorphisms in the diploid genome. Our data and analyses provide a foundation for promoting mammalian genetic research, and demonstrate the feasibility for using next-generation sequencing technologies for accurate, cost-effective and rapid de novo assembly of large eukaryotic genomes.
Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next-generation sequencing technologies for personal genomics.
Recent studies have underscored questions about the balance of risk and benefit of RBC transfusion. A better understanding of the nature and timing of molecular and functional changes in stored RBCs may provide strategies to improve the balance of benefit and risk of RBC transfusion. We analyzed changes occurring during RBC storage focusing on RBC deformability, RBC-dependent vasoregulatory function, and S-nitrosohemoglobin (SNO-Hb), through which hemoglobin (Hb) O2 desaturation is coupled to regional increases in blood flow in vivo (hypoxic vasodilation). Five hundred ml of blood from each of 15 healthy volunteers was processed into leukofiltered, additive solution 3-exposed RBCs and stored at 1-6°C according to AABB standards. Blood was subjected to 26 assays at 0, 3, 8, 24 and 96 h, and at 1, 2, 3, 4, and 6 weeks. RBC SNO-Hb decreased rapidly (1.2 ؋ 10 ؊4 at 3 h vs. 6.5 ؋ 10 ؊4 (fresh) mol S-nitrosothiol (SNO)/mol Hb tetramer (P ؍ 0.032, mercuric-displaced photolysis-chemiluminescence assay), and remained low over the 42-day period. The decline was corroborated by using the carbon monoxide-saturated copper-cysteine assay [3.0 ؋ 10 ؊5 at 3 h vs. 9.0 ؋ 10 ؊5 (fresh) mol SNO/mol Hb]. In parallel, vasodilation by stored RBCs was significantly depressed. RBC deformability assayed at a physiological shear stress decreased gradually over the 42-day period (P < 0.001). Time courses vary for several storage-induced defects that might account for recent observations linking blood transfusion with adverse outcomes. Of clinical concern is that SNO levels, and their physiological correlate, RBC-dependent vasodilation, become depressed soon after collection, suggesting that even ''fresh'' blood may have developed adverse biological characteristics.adenosine triphosphate ͉ hemoglobin ͉ nitric oxide ͉ S-nitrosothiols ͉ transfusion
We report the sequencing at 131× coverage, de novo assembly and analyses of the genome of a female Tibetan wild boar. We also resequenced the whole genomes of 30 Tibetan wild boars from six major distributed locations and 18 geographically related pigs in China. We characterized genetic diversity, population structure and patterns of evolution. We searched for genomic regions under selection, which includes genes that are involved in hypoxia, olfaction, energy metabolism and drug response. Comparing the genome of Tibetan wild boar with those of neighboring Chinese domestic pigs further showed the impact of thousands of years of artificial selection and different signatures of selection in wild boar and domestic pig. We also report genetic adaptations in Tibetan wild boar that are associated with high altitudes and characterize the genetic basis of increased salivation in domestic pig.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.