Prostate cancer (PrCa) is the most frequently diagnosed male cancer in developed countries. To identify common PrCa susceptibility alleles, we have previously conducted a genome-wide association study in which 541, 129 SNPs were genotyped in 1,854 PrCa cases with clinically detected disease and 1,894 controls. We have now evaluated promising associations in a second stage, in which we genotyped 43,671 SNPs in 3,650 PrCa cases and 3,940 controls, and a third stage, involving an additional 16,229 cases and 14,821 controls from 21 studies. In addition to previously identified loci, we identified a further seven new prostate cancer susceptibility loci on chromosomes 2, 4, 8, 11, and 22 (P=1.6×10 −8 to P=2.7×10 −33 ).Genome-wide association studies (GWAS) provide a powerful approach to identify common disease alleles. We previously conducted a GWAS 1 , based on genotyping of 541, 129 SNPs in 1,854 clinically detected PrCa cases and 1,894 controls (see Figure 1, stage 1). Follow-up genotyping of SNPs exhibiting strong evidence of association (P<10 −6 ), in a further 3,268 cases and 3,366 controls, allowed us to identify SNPs at 7 susceptibility loci associated with the disease at genome-wide levels of significance 1 . Other studies have identified an additional 8 loci [2][3][4][5][6][7][8][9] . These loci, however, explain only a small fraction of the familial risk of PrCa. Moreover, the strength of the associations that have been detected are generally small (perallele odds ratios, OR, 1.1-1.2), and the power of the existing studies to detect many of the susceptibility alleles has been limited. It is highly likely, therefore, that other PrCa predisposition loci exist, and that such loci should be detectable by studies with larger sample sizes.In an attempt to identify further susceptibility loci, we conducted a more extensive follow-up of SNPs showing evidence of association in stage 1 of our GWAS. We designed a panel of 47,120 SNPs, aiming to include all SNPs with a significant association in stage 1 at P-trend (1df)<.05 or P(2df)<.01 (see Online Methods). These SNPs were genotyped using the Illumina iSELECT platform in 3,894 PrCa cases and 4,055 controls from the United Kingdom (UK) and Australia ( Figure 1, stage 2). After quality control (QC) exclusions (as described in Online Methods), we utilised data from 43,671 SNPs in 3,650 PrCa cases and 3,940 controls. NIH-PA Author ManuscriptNIH-PA Author Manuscript NIH-PA Author ManuscriptGenotype frequencies in cases and controls were compared using a 1 degree of freedom (df) Cochran-Armitage trend test (for QQ plots see Supplementary Figure 1). There was little evidence of inflation in the test statistics in the UK samples (estimated inflation factor λ=1.08), but there was more marked inflation in those from Australia (λ=1.23; λ=1.19 for stage 2 overall), suggestive of some population substructure. The Australian samples were selected from three studies (MCCS, RFPCS and EOPCS; see Supplementary Note for cohort descriptions), and further analysis revealed that ...
A large proportion of functional sequence within mammalian genomes falls outside protein-coding exons and can be transcribed into long RNAs. However, the roles in mammalian biology of long noncoding RNA (lncRNA) are not well understood. Few lncRNAs have experimentally determined roles, with some of these being lineage-specific. Determining the extent by which transcription of lncRNA loci is retained or lost across multiple evolutionary lineages is essential if we are to understand their contribution to mammalian biology and to lineage-specific traits. Here, we experimentally investigated the conservation of lncRNA expression among closely related rodent species, allowing the evolution of DNA sequence to be uncoupled from evolution of transcript expression. We generated total RNA (RNAseq) and H3K4me3-bound (ChIPseq) DNA data, and combined both to construct catalogues of transcripts expressed in the adult liver of Mus musculus domesticus (C57BL/6J), Mus musculus castaneus, and Rattus norvegicus. We estimated the rate of transcriptional turnover of lncRNAs and investigated the effects of their lineage-specific birth or death. LncRNA transcription showed considerably greater gain and loss during rodent evolution, compared with protein-coding genes. Nucleotide substitution rates were found to mirror the in vivo transcriptional conservation of intergenic lncRNAs between rodents: only the sequences of noncoding loci with conserved transcription were constrained. Finally, we found that lineage-specific intergenic lncRNAs appear to be associated with modestly elevated expression of genomically neighbouring protein-coding genes. Our findings show that nearly half of intergenic lncRNA loci have been gained or lost since the last common ancestor of mouse and rat, and they predict that such rapid transcriptional turnover contributes to the evolution of tissue- and lineage-specific gene expression.
Gene expression levels are thought to diverge primarily via regulatory mutations in trans within species, and in cis between species. To test this hypothesis in mammals we used RNA-sequencing to measure gene expression divergence between C57BL/6J and CAST/EiJ mouse strains and allele-specific expression in their F1 progeny. We identified 535 genes with parent-of-origin specific expression patterns, although few of these showed full allelic silencing. This suggests that the number of imprinted genes in a typical mouse somatic tissue is relatively small. In the set of nonimprinted genes, 32% showed evidence of divergent expression between the two strains. Of these, 2% could be attributed purely to variants acting in trans, while 43% were attributable only to variants acting in cis. The genes with expression divergence driven by changes in trans showed significantly higher sequence constraint than genes where the divergence was explained by variants acting in cis. The remaining genes with divergent patterns of expression (55%) were regulated by a combination of variants acting in cis and variants acting in trans. Intriguingly, the changes in expression induced by the cis and trans variants were in opposite directions more frequently than expected by chance, implying that compensatory regulation to stabilize gene expression levels is widespread. We propose that expression levels of genes regulated by this mechanism are fine-tuned by cis variants that arise following regulatory changes in trans, suggesting that many cis variants are not the primary targets of natural selection.
SummaryTo mechanistically characterize the microevolutionary processes active in altering transcription factor (TF) binding among closely related mammals, we compared the genome-wide binding of three tissue-specific TFs that control liver gene expression in six rodents. Despite an overall fast turnover of TF binding locations between species, we identified thousands of TF regions of highly constrained TF binding intensity. Although individual mutations in bound sequence motifs can influence TF binding, most binding differences occur in the absence of nearby sequence variations. Instead, combinatorial binding was found to be significant for genetic and evolutionary stability; cobound TFs tend to disappear in concert and were sensitive to genetic knockout of partner TFs. The large, qualitative differences in genomic regions bound between closely related mammals, when contrasted with the smaller, quantitative TF binding differences among Drosophila species, illustrate how genome structure and population genetics together shape regulatory evolution.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.