Metabolomics, the large-scale study of the metabolic complement of the cell [1][2][3] , is a mature science that has been practiced for over 20 years 4 . Indeed, it is now a commonly used experimental systems biology tool with demonstrated utility in both fundamental and applied aspects of plant, microbial and mammalian research [5][6][7][8][9][10][11][12][13][14][15] . Among the many thousands of studies published in this area over the last 20 years, notable highlights [5][6][7][8]10,11,16 are briefly described in Supplementary Note 1.Despite the insight afforded by such studies, the nature of metabolites, particularly their diversity (in both chemical structure and dynamic range of abundance 9,12 ), remains a major challenge with regard to the ability to provide adequate coverage of the metabolome that can complement that achieved for the genome, transcriptome and proteome. Despite these comparative limitations, enormous advances have been made with regard to the number of analytes about which accurate quantitative information can be acquired, and a vast number of studies have yielded important biological information and biologically active metabolites across the kingdoms of life 14 . We have previously estimated that upwards of 1 million different metabolites occur across the tree of life, with between 1,000 and 40,000 estimated to occur in a single species 4 .
Plants produce a variety of metabolites that have a critical role in growth and development. Here we present a comprehensive study of maize metabolism, combining genetic, metabolite and expression profiling methodologies to dissect the genetic basis of metabolic diversity in maize kernels. We quantify 983 metabolite features in 702 maize genotypes planted at multiple locations. We identify 1,459 significant locus–trait associations (P≤1.8 × 10−6) across three environments through metabolite-based genome-wide association mapping. Most (58.5%) of the identified loci are supported by expression QTLs, and some (14.7%) are validated through linkage mapping. Re-sequencing and candidate gene association analysis identifies potential causal variants for five candidate genes involved in metabolic traits. Two of these genes were further validated by mutant and transgenic analysis. Metabolite features associated with kernel weight could be used as biomarkers to facilitate genetic improvement of maize.
Association mapping is a powerful approach for dissecting the genetic architecture of complex quantitative traits using high-density SNP markers in maize. Here, we expanded our association panel size from 368 to 513 inbred lines with 0.5 million high quality SNPs using a two-step data-imputation method which combines identity by descent (IBD) based projection and k-nearest neighbor (KNN) algorithm. Genome-wide association studies (GWAS) were carried out for 17 agronomic traits with a panel of 513 inbred lines applying both mixed linear model (MLM) and a new method, the Anderson-Darling (A-D) test. Ten loci for five traits were identified using the MLM method at the Bonferroni-corrected threshold −log10 (P) >5.74 (α = 1). Many loci ranging from one to 34 loci (107 loci for plant height) were identified for 17 traits using the A-D test at the Bonferroni-corrected threshold −log10 (P) >7.05 (α = 0.05) using 556809 SNPs. Many known loci and new candidate loci were only observed by the A-D test, a few of which were also detected in independent linkage analysis. This study indicates that combining IBD based projection and KNN algorithm is an efficient imputation method for inferring large missing genotype segments. In addition, we showed that the A-D test is a useful complement for GWAS analysis of complex quantitative traits. Especially for traits with abnormal phenotype distribution, controlled by moderate effect loci or rare variations, the A-D test balances false positives and statistical power. The candidate SNPs and associated genes also provide a rich resource for maize genetics and breeding.
Deciphering the influence of genetics on primary metabolism in plants will provide insights useful for genetic improvement and enhance our fundamental understanding of plant growth and development. Although maize (Zea mays) is a major crop for food and feed worldwide, the genetic architecture of its primary metabolism is largely unknown. Here, we use high-density linkage mapping to dissect large-scale metabolic traits measured in three different tissues (leaf at seedling stage, leaf at reproductive stage, and kernel at 15 d after pollination [DAP]) of a maize recombinant inbred line population. We identify 297 quantitative trait loci (QTLs) with moderate (86.2% of the mapped QTL, R 2 = 2.4 to 15%) to major effects (13.8% of the mapped QTL, R 2 >15%) for 79 primary metabolites across three tissues. Pairwise epistatic interactions between these identified loci are detected for more than 25.9% metabolites explaining 6.6% of the phenotypic variance on average (ranging between 1.7 and 16.6%), which implies that epistasis may play an important role for some metabolites. Key candidate genes are highlighted and mapped to carbohydrate metabolism, the tricarboxylic acid cycle, and several important amino acid biosynthetic and catabolic pathways, with two of them being further validated using candidate gene association and expression profiling analysis. Our results reveal a metabolite-metabolite-agronomic trait network that, together with the genetic determinants of maize primary metabolism identified herein, promotes efficient utilization of metabolites in maize improvement.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.