The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
Obesity is a heritable trait and a risk factor for many common diseases such as type 2 diabetes, heart disease, and hypertension. We used a dense whole-genome scan of DNA samples from the Framingham Heart Study participants to identify a common genetic variant near the INSIG2 gene associated with obesity. We have replicated the finding in four separate samples composed of individuals of Western European ancestry, African Americans, and children. The obesity-predisposing genotype is present in 10% of individuals. Our study suggests that common genetic polymorphisms are important determinants of obesity.
Lung cancer is the leading cause of death from cancer in the US and the world. The high mortality rate (80-85% within 5 years) results, in part, from a lack of effective tools to diagnose the disease at an early stage. Given that cigarette smoke creates a field of injury throughout the airway, we sought to determine if gene expression in histologically normal large-airway epithelial cells obtained at bronchoscopy from smokers with suspicion of lung cancer could be used as a lung cancer biomarker. Using a training set (n = 77) and gene-expression profiles from Affymetrix HG-U133A microarrays, we identified an 80-gene biomarker that distinguishes smokers with and without lung cancer. We tested the biomarker on an independent test set (n = 52), with an accuracy of 83% (80% sensitive, 84% specific), and on an additional validation set independently obtained from five medical centers (n = 35). Our biomarker had approximately 90% sensitivity for stage 1 cancer across all subjects. Combining cytopathology of lower airway cells obtained at bronchoscopy with the biomarker yielded 95% sensitivity and a 95% negative predictive value. These findings indicate that gene expression in cytologically normal large-airway epithelial cells can serve as a lung cancer biomarker, potentially owing to a cancer-specific airway-wide response to cigarette smoke.
The evidence for the existence of genetic susceptibility variants for the common form of hypertension (“essential hypertension”) remains weak and inconsistent. We sought genetic variants underlying blood pressure (BP) by conducting a genome-wide association study (GWAS) among African Americans, a population group in the United States that is disproportionately affected by hypertension and associated complications, including stroke and kidney diseases. Using a dense panel of over 800,000 SNPs in a discovery sample of 1,017 African Americans from the Washington, D.C., metropolitan region, we identified multiple SNPs reaching genome-wide significance for systolic BP in or near the genes: PMS1, SLC24A4, YWHA7, IPO7, and CACANA1H. Two of these genes, SLC24A4 (a sodium/potassium/calcium exchanger) and CACNA1H (a voltage-dependent calcium channel), are potential candidate genes for BP regulation and the latter is a drug target for a class of calcium channel blockers. No variant reached genome wide significance for association with diastolic BP (top scoring SNP rs1867226, p = 5.8×10−7) or with hypertension as a binary trait (top scoring SNP rs9791170, p = 5.1×10−7). We replicated some of the significant SNPs in a sample of West Africans. Pathway analysis revealed that genes harboring top-scoring variants cluster in pathways and networks of biologic relevance to hypertension and BP regulation. This is the first GWAS for hypertension and BP in an African American population. The findings suggests that, in addition to or in lieu of relying solely on replicated variants of moderate-to-large effect reaching genome-wide significance, pathway and network approaches may be useful in identifying and prioritizing candidate genes/loci for further experiments.
Anatomically separate fat depots differ in size, function, and contribution to pathological states, such as the metabolic syndrome. We isolated preadipocytes from different human fat depots to determine whether the basis for this variation is partly attributable to differences in inherent properties of fat cell progenitors. We found that genome-wide expression profiles of primary preadipocytes cultured in parallel from abdominal subcutaneous, mesenteric, and omental fat depots were distinct. Interestingly, visceral fat was not homogeneous. Preadipocytes from one of the two main visceral depots, mesenteric fat, had an expression profile closer to that of subcutaneous than omental preadipocytes, the other main visceral depot. Expression of genes that regulate early development, including homeotic genes, differed extensively among undifferentiated preadipocytes isolated from different fat depots. These profiles were confirmed by real-time PCR analysis of preadipocytes from additional lean and obese male and female subjects. We made preadipocyte strains from single abdominal subcutaneous and omental preadipocytes by expressing telomerase. Depot-specific developmental gene expression profiles persisted for 40 population doublings in these strains. Thus, human fat cell progenitors from different regions are effectively distinct, consistent with different fat depots being separate mini-organs.
Background: Renal cell carcinoma is a common malignancy that often presents as a metastaticdisease for which there are no effective treatments. To gain insights into the mechanism of renal cell carcinogenesis, a number of genome-wide expression profiling studies have been performed. Surprisingly, there is very poor agreement among these studies as to which genes are differentially regulated. To better understand this lack of agreement we profiled renal cell tumor gene expression using genome-wide microarrays (45,000 probe sets) and compare our analysis to previous microarray studies.
Normal‐appearing epithelium of cancer patients can harbor occult genetic abnormalities. Data comprehensively comparing gene expression between histologically normal breast epithelium of breast cancer patients and cancer‐free controls are limited. The present study compares global gene expression between these groups. We performed microarrays using RNA from microdissected histologically normal terminal ductal‐lobular units (TDLU) from 2 groups: (i) cancer normal (CN) (TDLUs adjacent to untreated ER+ breast cancers (n = 14)) and (ii) reduction mammoplasty (RM) (TDLUs of age‐matched women without breast disease (n = 15)). Cyber‐T identified differentially expressed genes. Quantitative RT‐PCR (qRT‐PCR), immunohistochemistry (IHC), and comparison to independent microarray data including 6 carcinomas in situ (CIS), validated the results. Gene ontology (GO), UniProt and published literature evaluated gene function. About 127 probesets, corresponding to 105 genes, were differentially expressed between CN and RM (p < 0.0009, corresponding to FDR <0.10). 104/127 (82%) probesets were also differentially expressed between CIS and RM, nearly always (102/104 (98%)) in the same direction as in CN vs. RM. Two‐thirds of the 105 genes were implicated previously in carcinogenesis. Overrepresented functional groups included transcription, G‐protein coupled and chemokine receptor activity, the MAPK cascade and immediate early genes. Most genes in these categories were under‐expressed in CN vs. RM. We conclude that global gene expression abnormalities exist in normal epithelium of breast cancer patients and are also present in early cancers. Thus, cancer‐related pathways may be perturbed in normal epithelium. These abnormalities could be markers of disease risk, occult disease, or the tissue's response to an existing tumor. © 2007 Wiley‐Liss, Inc.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.