Genome-wide association studies (GWAS) have become indispensable in human medicine and genomics, but very few have been carried out on bacteria. Here we introduce Scoary, an ultra-fast, easy-to-use, and widely applicable software tool that scores the components of the pan-genome for associations to observed phenotypic traits while accounting for population stratification, with minimal assumptions about evolutionary processes. We call our approach pan-GWAS to distinguish it from traditional, single nucleotide polymorphism (SNP)-based GWAS. Scoary is implemented in Python and is available under an open source GPLv3 license at https://github.com/AdmiralenOla/Scoary.Electronic supplementary materialThe online version of this article (doi:10.1186/s13059-016-1108-8) contains supplementary material, which is available to authorized users.
Elastic net type regression methods have become very popular for prediction of certain outcomes in epigenome-wide association studies (EWAS). The methods considered accept biased coefficient estimates in return for lower variance thus obtaining improved prediction accuracy. We provide guidelines on how to obtain parsimonious models with low mean squared error and include easy to follow walk-through examples for each step in R. Electronic supplementary material The online version of this article (10.1186/s13148-019-0730-1) contains supplementary material, which is available to authorized users.
Folate is vital for fetal development. Periconceptional folic acid supplementation and food fortification are recommended to prevent neural tube defects. Mechanisms whereby periconceptional folate influences normal development and disease are poorly understood: epigenetics may be involved. We examine the association between maternal plasma folate during pregnancy and epigenome-wide DNA methylation using Illumina's HumanMethyl450 Beadchip in 1,988 newborns from two European cohorts. Here we report the combined covariate-adjusted results using meta-analysis and employ pathway and gene expression analyses. Four-hundred forty-three CpGs (320 genes) are significantly associated with maternal plasma folate levels during pregnancy (false discovery rate 5%); 48 are significant after Bonferroni correction. Most genes are not known for folate biology, including APC2, GRM8, SLC16A12, OPCML, PRPH, LHX1, KLK4 and PRSS21. Some relate to birth defects other than neural tube defects, neurological functions or varied aspects of embryonic development. These findings may inform how maternal folate impacts the developing epigenome and health outcomes in offspring.
Repeated emergence, not international dissemination, is behind the rise of multidrug-resistant lineage 4 tuberculosis.
BackgroundWe explored the association between gestational age and cord blood DNA methylation at birth and whether DNA methylation could be effective in predicting gestational age due to limitations with the presently used methods. We used data from the Norwegian Mother and Child Birth Cohort study (MoBa) with Illumina HumanMethylation450 data measured for 1753 newborns in two batches: MoBa 1, n = 1068; and MoBa 2, n = 685. Gestational age was computed using both ultrasound and the last menstrual period. We evaluated associations between DNA methylation and gestational age and developed a statistical model for predicting gestational age using MoBa 1 for training and MoBa 2 for predictions. The prediction model was additionally used to compare ultrasound and last menstrual period-based gestational age predictions. Furthermore, both CpGs and associated genes detected in the training models were compared to those detected in a published prediction model for chronological age.ResultsThere were 5474 CpGs associated with ultrasound gestational age after adjustment for a set of covariates, including estimated cell type proportions, and Bonferroni-correction for multiple testing. Our model predicted ultrasound gestational age more accurately than it predicted last menstrual period gestational age.ConclusionsDNA methylation at birth appears to be a good predictor of gestational age. Ultrasound gestational age is more strongly associated with methylation than last menstrual period gestational age. The CpGs linked with our gestational age prediction model, and their associated genes, differed substantially from the corresponding CpGs and genes associated with a chronological age prediction model.Electronic supplementary materialThe online version of this article (doi:10.1186/s13059-016-1063-4) contains supplementary material, which is available to authorized users.
Pre-pregnancy maternal obesity is associated with adverse offspring outcomes at birth and later in life. Individual studies have shown that epigenetic modifications such as DNA methylation could contribute. Within the Pregnancy and Childhood Epigenetics (PACE) Consortium, we meta-analysed the association between pre-pregnancy maternal BMI and methylation at over 450,000 sites in newborn blood DNA, across 19 cohorts (9,340 mother-newborn pairs). We attempted to infer causality by comparing the effects of maternal versus paternal BMI and incorporating genetic variation. In four additional cohorts (1,817 mother-child pairs), we meta-analysed the association between maternal BMI at the start of pregnancy and blood methylation in adolescents. In newborns, maternal BMI was associated with small (<0.2% per BMI unit (1 kg/m2), P < 1.06 × 10−7) methylation variation at 9,044 sites throughout the genome. Adjustment for estimated cell proportions greatly attenuated the number of significant CpGs to 104, including 86 sites common to the unadjusted model. At 72/86 sites, the direction of the association was the same in newborns and adolescents, suggesting persistence of signals. However, we found evidence for a6causal intrauterine effect of maternal BMI on newborn methylation at just 8/86 sites. In conclusion, this well-powered analysis identified robust associations between maternal adiposity and variations in newborn blood DNA methylation, but these small effects may be better explained by genetic or lifestyle factors than a causal intrauterine mechanism. This highlights the need for large-scale collaborative approaches and the application of causal inference techniques in epigenetic epidemiology.
RNA viruses are abundant infectious agents and present in all domains of life. Arthropods, including ticks, are well known as vectors of many viruses of concern for human and animal health. Despite their obvious importance, the extent and structure of viral diversity in ticks is still poorly understood, particularly in Europe. Using a bulk RNA-sequencing approach that captures the complete transcriptome, we analysed the virome of the most common tick in Europe – Ixodes ricinus. In total, RNA sequencing was performed on six libraries consisting of 33 I. ricinus nymphs and adults sampled in Norway. Despite the small number of animals surveyed, our virus identification pipeline revealed nine diverse and novel viral species, phylogenetically positioned within four different viral groups – bunyaviruses, luteoviruses, mononegavirales and partitiviruses – and sometimes characterized by extensive genetic diversity including a potentially novel genus of bunyaviruses. This work sheds new light on the virus diversity in I. ricinus, expands our knowledge of potential host/vector-associations and tick-transmitted viruses within several viral groups, and pushes the latitudinal limit where it is likely to find tick-associated viruses. Notably, our phylogenetic analysis revealed the presence of tick-specific virus clades that span multiple continents, highlighting the role of ticks as important virus reservoirs.
BackgroundThe core genome consists of genes shared by the vast majority of a species and is therefore assumed to have been subjected to substantially stronger purifying selection than the more mobile elements of the genome, also known as the accessory genome. Here we examine intragenic base composition differences in core genomes and corresponding accessory genomes in 36 species, represented by the genomes of 731 bacterial strains, to assess the impact of selective forces on base composition in microbes. We also explore, in turn, how these results compare with findings for whole genome intragenic regions.ResultsWe found that GC content in coding regions is significantly higher in core genomes than accessory genomes and whole genomes. Likewise, GC content variation within coding regions was significantly lower in core genomes than in accessory genomes and whole genomes. Relative entropy in coding regions, measured as the difference between observed and expected trinucleotide frequencies estimated from mononucleotide frequencies, was significantly higher in the core genomes than in accessory and whole genomes. Relative entropy was positively associated with coding region GC content within the accessory genomes, but not within the corresponding coding regions of core or whole genomes.ConclusionThe higher intragenic GC content and relative entropy, as well as the lower GC content variation, observed in the core genomes is most likely associated with selective constraints. It is unclear whether the positive association between GC content and relative entropy in the more mobile accessory genomes constitutes signatures of selection or selective neutral processes.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-017-3543-7) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.