Regulatory SNPs (rSNPs) are a special class of SNPs which have a high potential to affect the phenotype due to their impact on DNA-binding of transcription factors (TFs). Thus, the knowledge about such rSNPs and TFs could provide essential information regarding different genetic programs, such as tissue development or environmental stress responses. In this study, we use a multi-omics approach by combining genomics, transcriptomics, and proteomics data of two different Brassica napus L. cultivars, namely Zhongshuang11 (ZS11) and Zhongyou821 (ZY821), with high and low oil content, respectively, to monitor the regulatory interplay between rSNPs, TFs and their corresponding genes in the tissues flower, leaf, stem, and root. By predicting the effect of rSNPs on TF-binding and by measuring their association with the cultivars, we identified a total of 41,117 rSNPs, of which 1141 are significantly associated with oil content. We revealed several enriched members of the TF families DOF, MYB, NAC, or TCP, which are important for directing transcriptional programs regulating differential expression of genes within the tissues. In this work, we provide the first genome-wide collection of rSNPs for B. napus and their impact on the regulation of gene expression in vegetative and floral tissues, which will be highly valuable for future studies on rSNPs and gene regulation.
Genome wide association studies (GWAS) are a well established methodology to identify genomic variants and genes that are responsible for traits of interest in all branches of the life sciences. Despite the long time this methodology has had to mature the reliable detection of genotype–phenotype associations is still a challenge for many quantitative traits mainly because of the large number of genomic loci with weak individual effects on the trait under investigation. Thus, it can be hypothesized that many genomic variants that have a small, however real, effect remain unnoticed in many GWAS approaches. Here, we propose a two-step procedure to address this problem. In a first step, cubic splines are fitted to the test statistic values and genomic regions with spline-peaks that are higher than expected by chance are considered as quantitative trait loci (QTL). Then the SNPs in these QTLs are prioritized with respect to the strength of their association with the phenotype using a Random Forests approach. As a case study, we apply our procedure to real data sets and find trustworthy numbers of, partially novel, genomic variants and genes involved in various egg quality traits.
Transcription factors (TFs) and their complex interplay are essential for directing specific genetic programs, such as responses to environmental stresses, tissue development, or cell differentiation by regulating gene expression. Knowledge regarding TF–TF cooperations could be promising in gaining insight into the developmental switches between the cultivars of Brassica napus L., namely Zhongshuang11 (ZS11), a double-low accession with high-oil- content, and Zhongyou821 (ZY821), a double-high accession with low-oil-content. In this regard, we analysed a time series RNA-seq data set of seed tissue from both of the cultivars by mainly focusing on the monotonically expressed genes (MEGs). The consideration of the MEGs enables the capturing of multi-stage progression processes that are orchestrated by the cooperative TFs and, thus, facilitates the understanding of the molecular mechanisms determining seed oil content. Our findings show that TF families, such as NAC, MYB, DOF, GATA, and HD-ZIP are highly involved in the seed developmental process. Particularly, their preferential partner choices as well as changes in their gene expression profiles seem to be strongly associated with the differentiation of the oil content between the two cultivars. These findings are essential in enhancing our understanding of the genetic programs in both cultivars and developing novel hypotheses for further experimental studies.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.