The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
Rice is a staple crop that has undergone substantial phenotypic and physiological changes during domestication. Here we resequenced the genomes of 40 cultivated accessions selected from the major groups of rice and 10 accessions of their wild progenitors (Oryza rufipogon and Oryza nivara) to >15 × raw data coverage. We investigated genome-wide variation patterns in rice and obtained 6.5 million high-quality single nucleotide polymorphisms (SNPs) after excluding sites with missing data in any accession. Using these population SNP data, we identified thousands of genes with significantly lower diversity in cultivated but not wild rice, which represent candidate regions selected during domestication. Some of these variants are associated with important biological features, whereas others have yet to be functionally characterized. The molecular markers we have identified should be valuable for breeding and for identifying agronomically important genes in rice.
Transitional cell carcinoma (TCC) is the most common type of bladder cancer. Here we sequenced the exomes of nine individuals with TCC and screened all the somatically mutated genes in a prevalence set of 88 additional individuals with TCC with different tumor stages and grades. In our study, we discovered a variety of genes previously unknown to be mutated in TCC. Notably, we identified genetic aberrations of the chromatin remodeling genes (UTX, MLL-MLL3, CREBBP-EP300, NCOR1, ARID1A and CHD6) in 59% of our 97 subjects with TCC. Of these genes, we showed UTX to be altered substantially more frequently in tumors of low stages and grades, highlighting its potential role in the classification and diagnosis of bladder cancer. Our results provide an overview of the genetic basis of TCC and suggest that aberration of chromatin regulation might be a hallmark of bladder cancer.
Tumor heterogeneity presents a challenge for inferring clonal evolution and driver gene identification. Here, we describe a method for analyzing the cancer genome at a single-cell nucleotide level. To perform our analyses, we first devised and validated a high-throughput whole-genome single-cell sequencing method using two lymphoblastoid cell line single cells. We then carried out whole-exome single-cell sequencing of 90 cells from a JAK2-negative myeloproliferative neoplasm patient. The sequencing data from 58 cells passed our quality control criteria, and these data indicated that this neoplasm represented a monoclonal evolution. We further identified essential thrombocythemia (ET)-related candidate mutations such as SESN2 and NTRK1, which may be involved in neoplasm progression. This pilot study allowed the initial characterization of the disease-related genetic architecture at the single-cell nucleotide level. Further, we established a single-cell sequencing method that opens the way for detailed analyses of a variety of tumor types, including those with high genetic complex between patients.
l e t t e r sHow an insect evolves to become a successful herbivore is of profound biological and practical importance. Herbivores are often adapted to feed on a specific group of evolutionarily and biochemically related host plants 1 , but the genetic and molecular bases for adaptation to plant defense compounds remain poorly understood 2 . We report the first whole-genome sequence of a basal lepidopteran species, Plutella xylostella, which contains 18,071 protein-coding and 1,412 unique genes with an expansion of gene families associated with perception and the detoxification of plant defense compounds. A recent expansion of retrotransposons near detoxification-related genes and a wider system used in the metabolism of plant defense compounds are shown to also be involved in the development of insecticide resistance. This work shows the genetic and molecular bases for the evolutionary success of this worldwide herbivore and offers wider insights into insect adaptation to plant feeding, as well as opening avenues for more sustainable pest management.The global pest P. xylostella (Lepidoptera: Yponomeutidae) is thought to have coevolved with the crucifer plant family 3 ( Supplementary Fig. 1) and has become the most destructive pest of economically important food crops, including rapeseed, cauliflower and cabbage 4 . Recently, the total cost of damage and management worldwide was estimated at $4-5 billion per annum 5,6 . This insect is the first species to have evolved resistance to dichlorodiphenyltrichloroethane (DDT) in the 1950s 7 and to Bacillus thuringiensis (Bt) toxins in the 1990s 8 and has developed resistance to all classes of insecticide, making it increasingly difficult to control 9,10 . P. xylostella provides an exceptional system for understanding the genetic and molecular bases of how insect herbivores cope with the broad range of plant defenses and chemicals encountered in the environment (Supplementary Fig. 2).We used a P. xylostella strain (Fuzhou-S) collected from a field in Fuzhou in southeastern China (26.08 °N, 119.28 °E) for sequencing ( Supplementary Fig. 1). Whole-genome shotgun-based Illumina sequencing of single individuals (Supplementary Table 1), even after ten generations of laboratory inbreeding, resulted in a poor initial assembly (N50 = 2.4 kb, representing the size above which 50% of the total length of the sequences is included), owing to high levels of heterozygosity ( Supplementary Figs. 3 and 4 and Supplementary Table 2). Subsequently, we sequenced 100,800 fosmid clones (comprising ~10× the genome length) to a depth of 200× ( Supplementary Fig. 5 and Supplementary Tables 3-5), assembling the resulting sequence data into 1,819 scaffolds, with an N50 of 737 kb, spanning ~394 Mb of the genome sequence (version 1; Supplementary Fig. 6 and Supplementary Table 6). The assembly covered 85.5% of a set of protein-coding ESTs (Supplementary Tables 7 and 8) generated by transcriptome sequencing 11 . Alignment of a subject scaffold against a 126-kb BAC (GenBank GU058050) from an altern...
A novel coronavirus, the severe acute respiratory syndrome (SARS) coronavirus (SARS-CoV), was identified as the causative agent of SARS. The profile of specific antibodies to individual proteins of the virus is critical to the development of vaccine and diagnostic tools. In this study, 13 recombinant proteins associated with four structural proteins (S, E, M and N) and five putative uncharacterized proteins (3a, 3b, 6, 7a and 9b) of the SARS-CoV were prepared and used for screening and monitoring their specific IgG antibodies in SARS patient sera by protein microarray. Antibodies to proteins S, 3a, N and 9b were detected in the sera from convalescent-phase SARS patients, whereas those to proteins E, M, 3b, 6 and 7a were undetected. In the detectable specific antibodies, anti-S and anti-N were dominant and could persist in the sera of SARS patients until week 30. Among the rabbit antisera to recombinant proteins S3, N, 3a and 9b, only anti-S3 serum showed significant neutralizing activity to the SARS-CoV infection in Vero E6 cells. The results suggest (1) that anti-S and anti-N antibodies are diagnostic markers and in particular that S3 is immunogenic and therefore is a good candidate as a subunit vaccine antigen; and (2) that, from a virus structure viewpoint, the presence in some human sera of antibodies reacting with two recombinant polypeptides, 3a and 9b, supports the hypothesis that they are synthesized during the virus cycle.
BackgroundCancers arise through an evolutionary process in which cell populations are subjected to selection; however, to date, the process of bladder cancer, which is one of the most common cancers in the world, remains unknown at a single-cell level.ResultsWe carried out single-cell exome sequencing of 66 individual tumor cells from a muscle-invasive bladder transitional cell carcinoma (TCC). Analyses of the somatic mutant allele frequency spectrum and clonal structure revealed that the tumor cells were derived from a single ancestral cell, but that subsequent evolution occurred, leading to two distinct tumor cell subpopulations. By analyzing recurrently mutant genes in an additional cohort of 99 TCC tumors, we identified genes that might play roles in the maintenance of the ancestral clone and in the muscle-invasive capability of subclones of this bladder cancer, respectively.ConclusionsThis work provides a new approach of investigating the genetic details of bladder tumoral changes at the single-cell level and a new method for assessing bladder cancer evolution at a cell-population level.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.