The complete nucleotide sequence of two human T-cell leukaemia type III (HTLV-III) proviral DNAs each have four long open reading frames, the first two corresponding to the gag and pol genes. The fourth open reading frame encodes two functional polypeptides, a large precursor of the major envelope glycoprotein and a smaller protein derived from the 3'-terminus long open reading frame analogous to the long open reading frame (lor) product of HTLV-I and -II.
Admixture mapping (also known as "mapping by admixture linkage disequilibrium," or MALD) provides a way of localizing genes that cause disease, in admixed ethnic groups such as African Americans, with approximately 100 times fewer markers than are required for whole-genome haplotype scans. However, it has not been possible to perform powerful scans with admixture mapping because the method requires a dense map of validated markers known to have large frequency differences between Europeans and Africans. To create such a map, we screened through databases containing approximately 450000 single-nucleotide polymorphisms (SNPs) for which frequencies had been estimated in African and European population samples. We experimentally confirmed the frequencies of the most promising SNPs in a multiethnic panel of unrelated samples and identified 3011 as a MALD map (1.2 cM average spacing). We estimate that this map is approximately 70% informative in differentiating African versus European origins of chromosomal segments. This map provides a practical and powerful tool, which is freely available without restriction, for screening for disease genes in African American patient cohorts. The map is especially appropriate for those diseases that differ in incidence between the parental African and European populations.
We performed a three-phase genome-wide association study (GWAS) using cases and controls from a genetically isolated population, Ashkenazi Jews (AJ), to identify loci associated with breast cancer risk. In the first phase, we compared allele frequencies of 150,080 SNPs in 249 high-risk, BRCA1/2 mutation-negative AJ familial cases and 299 cancer-free AJ controls using 2 and the Cochran-Armitage trend tests. In the second phase, we genotyped 343 SNPs from 123 regions most significantly associated from stage 1, including 4 SNPs from the FGFR2 region, in 950 consecutive AJ breast cancer cases and 979 age-matched AJ controls. We replicated major associations in a third independent set of 243 AJ cases and 187 controls. We obtained a significant allele P value of association with AJ breast cancer in the FGFR2 region (P ؍ 1.5 ؋ 10 ؊5 , odds ratio (OR) 1.26, 95% confidence interval (CI) 1.13-1.40 at rs1078806 for all phases combined). In addition, we found a risk locus in a region of chromosome 6q22.33 (P ؍ 2.9 ؋ 10 ؊8 , OR 1.41, 95% CI 1.25-1.59 at rs2180341). Using several SNPs at each implicated locus, we were able to verify associations and impute haplotypes. The major haplotype at the 6q22.33 locus conferred protection from disease, whereas the minor haplotype conferred risk. Candidate genes in the 6q22.33 region include ECHDC1, which encodes a protein involved in mitochondrial fatty acid oxidation, and also RNF146, which encodes a ubiquitin protein ligase, both known pathways in breast cancer pathogenesis.genomics ͉ mapping ͉ disease ͉ predisposition ͉ SNP C ohort and twin studies have indicated that 5-15% of incident breast cancer cases result from autosomal-dominant cancer susceptibility (1-5). However, only Ϸ40% of the familial aggregation of breast cancers can be explained by mutations in BRCA1, BRCA2, or other identified cancer susceptibility genes (6). Attempts to use linkage strategies to localize other genes associated with an inherited predisposition to cancer have been hampered by genetic heterogeneity, decreased penetrance, and chance clustering (7-12). Candidate gene studies in multiplex kindreds affected by breast cancer have implicated rare variants of CHEK2, ATM, BRIP1, and PALB2 in the subset of families lacking BRCA mutations, but in most cases, the rarity and small effect sizes of these associations have precluded clinical application (13). Association studies of biologically plausible candidate genes have identified low-penetrance susceptibility alleles in pathways of carcinogen metabolism, inflammation and immune response, DNA metabolism and DNA repair as well as other known oncogenes and tumor suppressor genes (14-17). Most recently, two groups have carried out genome-wide association studies (GWAS) of selected kindreds and unselected individuals affected by breast cancer (18,19). These studies have implicated a locus near FGFR2 as associated with an Ϸ1.2-fold increased risk of the disease. To add to the potential power of the GWAS approach, we have proposed and validated the use of a genet...
BackgroundAs we enter an era when testing millions of SNPs in a single gene association study will become the standard, consideration of multiple comparisons is an essential part of determining statistical significance. Bonferroni adjustments can be made but are conservative due to the preponderance of linkage disequilibrium (LD) between genetic markers, and permutation testing is not always a viable option. Three major classes of corrections have been proposed to correct the dependent nature of genetic data in Bonferroni adjustments: permutation testing and related alternatives, principal components analysis (PCA), and analysis of blocks of LD across the genome. We consider seven implementations of these commonly used methods using data from 1514 European American participants genotyped for 700,078 SNPs in a GWAS for AIDS.ResultsA Bonferroni correction using the number of LD blocks found by the three algorithms implemented by Haploview resulted in an insufficiently conservative threshold, corresponding to a genome-wide significance level of α = 0.15 - 0.20. We observed a moderate increase in power when using PRESTO, SLIDE, and simpleℳ when compared with traditional Bonferroni methods for population data genotyped on the Affymetrix 6.0 platform in European Americans (α = 0.05 thresholds between 1 × 10-7 and 7 × 10-8).ConclusionsCorrecting for the number of LD blocks resulted in an anti-conservative Bonferroni adjustment. SLIDE and simpleℳ are particularly useful when using a statistical test not handled in optimized permutation testing packages, and genome-wide corrected p-values using SLIDE, are much easier to interpret for consumers of GWAS studies.
Polyadenylated RNAs of certain human tumour cell lines are shown to contain transcripts related to the cell-derived transforming onc genes of molecularly cloned primate, murine or avian transforming retrovirus genomes. Thus, analogues of retroviral transforming genes are both present and frequently expressed in human neoplastic cells.
Cellular ets sequences homologous to v-ets of the avian leukemia virus E26 are highly conserved. In mammals the ets sequences are dispersed on two separate chromosomal loci, called ets4 and ets-2. To determine the structure of these two genes and identify the open reading frames that code for the putative proteins, we have sequenced human ets-) cDNAs and ets-2 cDNA clones obtained from both human and mouse. The human ETS) gene is capable of encoding a protein of 441 amino acids. This protein is >95% identical to the chicken c-ets-1 gene product. Thus, the human ETS) gene is homologous to the chicken c-ets-1 gene, the protooncogene that the E26 virus transduced. Human and mouse ets-2 cDNA clones are closely related and contain open reading frames capable of encoding proteins of 469 and 468 residues, respectively. Direct comparison of these data with previously published findings indicates that ets is a family of genes whose members share distinct domains.
Nasopharyngeal carcinoma (NPC) is an epithelial malignancy facilitated by Epstein-Barr Virus infection. Here we resolve the major genetic influences for NPC incidence using a genome-wide association study (GWAS), independent cohort replication, and high-resolution molecular HLA class I gene typing including 4,055 study participants from the Guangxi Zhuang Autonomous Region and Guangdong province of southern China. We detect and replicate strong association signals involving SNPs, HLA alleles, and amino acid (aa) variants across the major histocompatibility complex-HLA-A, HLA –B, and HLA -C class I genes (PHLA-A-aa-site-62 = 7.4×10−29; P HLA-B-aa-site-116 = 6.5×10−19; P HLA-C-aa-site-156 = 6.8×10−8 respectively). Over 250 NPC-HLA associated variants within HLA were analyzed in concert to resolve separate and largely independent HLA-A, -B, and -C gene influences. Multivariate logistical regression analysis collapsed significant associations in adjacent genes spanning 500 kb (OR2H1, GABBR1, HLA-F, and HCG9) as proxies for peptide binding motifs carried by HLA- A*11:01. A similar analysis resolved an independent association signal driven by HLA-B*13:01, B*38:02, and B*55:02 alleles together. NPC resistance alleles carrying the strongly associated amino acid variants implicate specific class I peptide recognition motifs in HLA-A and -B peptide binding groove as conferring strong genetic influence on the development of NPC in China.
Population linkage disequilibrium occurs as a consequence of mutation, selection, genetic drift, and population substructure produced by admixture of genetically distinct ethnic populations. African American and Hispanic ethnic groups have a history of significant gene flow among parent groups, which can be of value in affecting genome scans for disease-gene discovery in the case-control and transmission/disequilibrium test designs. Disease-gene discovery using mapping by admixture linkage disequilibrium (MALD) requires a map of polymorphic markers that differentiate between the founding populations, along with differences in disease-gene allele frequencies. We describe markers appropriate for MALD mapping by assessing allele frequencies of 744 short tandem repeats (STRs) in African Americans, Hispanics, European Americans, and Asians, by choosing STR markers that have large differences in composite delta, log-likelihood ratios, and/or I*(2) for MALD. Additional markers can be added to this MALD map by utilization of the rapidly growing single-nucleotide-polymorphism databases and the literature, to achieve a 3-10-cM scanning scale. The map will be useful for studies of diseases, including prostate and breast cancer, diabetes, hypertension, and end-stage renal disease, that have large differences in incidence between the founding populations of either Hispanics or African Americans.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.