SummaryWhile many disease-associated variants have been identified through genome-wide association studies, their downstream molecular consequences remain unclear.To identify these effects, we performed cis- and trans-expression quantitative trait locus (eQTL) analysis in blood from 31,684 individuals through the eQTLGen Consortium.We observed that cis-eQTLs can be detected for 88% of the studied genes, but that they have a different genetic architecture compared to disease-associated variants, limiting our ability to use cis-eQTLs to pinpoint causal genes within susceptibility loci.In contrast, trans-eQTLs (detected for 37% of 10,317 studied trait-associated variants) were more informative. Multiple unlinked variants, associated to the same complex trait, often converged on trans-genes that are known to play central roles in disease etiology.We observed the same when ascertaining the effect of polygenic scores calculated for 1,263 genome-wide association study (GWAS) traits. Expression levels of 13% of the studied genes correlated with polygenic scores, and many resulting genes are known to drive these traits.
Genome-wide association studies (GWAS) have identified thousands of variants associated with complex traits, but their biological interpretation often remains unclear. Most of these variants overlap with expression QTLs, indicating their potential involvement in regulation of gene expression. Here, we propose a transcriptome-wide summary statistics-based Mendelian Randomization approach (TWMR) that uses multiple SNPs as instruments and multiple gene expression traits as exposures, simultaneously. Applied to 43 human phenotypes, it uncovers 3,913 putatively causal gene–trait associations, 36% of which have no genome-wide significant SNP nearby in previous GWAS. Using independent association summary statistics, we find that the majority of these loci were missed by GWAS due to power issues. Noteworthy among these links is educational attainment-associated
BSCL2
, known to carry mutations leading to a Mendelian form of encephalopathy. We also find pleiotropic causal effects suggestive of mechanistic connections. TWMR better accounts for pleiotropy and has the potential to identify biological mechanisms underlying complex traits.
Age at first sexual intercourse (AFS) and age at first birth (AFB) have implications for health and evolutionary fitness. In this genome-wide association study (AFS, N=387,338; AFB, N=542,901), we identify 371 SNPs, 11 sex-specific, with a 5-6% polygenic score (PGS) prediction. Heritability of AFB shifted from 9% [CI=4-14] for women born in 1940 to 22% [CI=19-25] in 1965. Signals are driven by the genetics of reproductive biology and externalising behaviour, with key genes related to follicle stimulating hormone (FSHB), implantation (ESR1), infertility, and spermatid differentiation. Our findings suggest that Polycystic Ovarian Syndrome may lead to later AFB, linking with infertility. Late AFB is associated with parental longevity, and reduced incidence of Type 2 Diabetes (T2D) and Cardiovascular disease (CAD). Higher childhood socioeconomic circumstances and those in the highest PGS decile (90%+) experience markedly later reproductive onset. Results are relevant for improving teenage and late-life health, for understanding longevity, and guiding experimentation into mechanisms of infertility.
Root-knot nematodes secrete proteinaceous effectors into plant tissues to facilitate infection by suppressing host defences and reprogramming the host metabolism to their benefit. Meloidogyne graminicola is a major pest of rice (Oryza sativa) in Asia and Latin America, causing important crop losses. The goal of this study was to identify M. graminicola pathogenicity genes expressed during the plant-nematode interaction. Using the dual RNA-sequencing (RNA-seq) strategy, we generated transcriptomic data of M. graminicola samples covering the pre-parasitic J2 stage and five parasitic stages in rice plants, from the parasitic J2 to the adult female. In the absence of a reference genome, a de novo M. graminicola transcriptome of 66 396 contigs was obtained from those reads that were not mapped on the rice genome. Gene expression profiling across the M. graminicola life cycle revealed key genes involved in nematode development and provided insights into the genes putatively associated with parasitism. The development of a 'secreted protein prediction' pipeline revealed a typical set of proteins secreted by nematodes, as well as a large number of cysteine-rich proteins and putative nuclear proteins. Combined with expression data, this pipeline enabled the identification of 15 putative effector genes, including two homologues of well-characterized effectors from cyst nematodes (CLE-like and VAP1) and a metallothionein. The localization of gene expression was assessed by in situ hybridization for a subset of candidates. All of these data represent important molecular resources for the elucidation of M. graminicola biology and for the selection of potential targets for the development of novel control strategies for this nematode species.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.