Alfalfa (Medicago sativa L.) is the most important legume forage crop worldwide with high nutritional value and yield. For a long time, the breeding of alfalfa was hampered by lacking reliable information on the autotetraploid genome and molecular markers linked to important agronomic traits. We herein reported the de novo assembly of the allele-aware chromosome-level genome of Zhongmu-4, a cultivar widely cultivated in China, and a comprehensive database of genomic variations based on resequencing of 220 germplasms. Approximate 2.74 Gb contigs (N50 of 2.06 Mb), accounting for 88.39% of the estimated genome, were assembled, and 2.56 Gb contigs were anchored to 32 pseudo-chromosomes. A total of 34,922 allelic genes were identified from the allele-aware genome. We observed the expansion of gene families, especially those related to the nitrogen metabolism, and the increase of repetitive elements including transposable elements, which probably resulted in the increase of Zhongmu-4 genome compared with Medicago truncatula. Population structure analysis revealed that the accessions from Asia and South America had relatively lower genetic diversity than those from Europe, suggesting that geography may influence alfalfa genetic divergence during local adaption. Genome-wide association studies identified 101 single nucleotide polymorphisms (SNPs) associated with 27 agronomic traits. Two candidate genes were predicted to be correlated with fall dormancy and salt response. We believe that the allele-aware chromosome-level genome sequence of Zhongmu-4 combined with the resequencing data of the diverse alfalfa germplasms will facilitate genetic research and genomics-assisted breeding in variety improvement of alfalfa.
Fall dormancy (FD) is an essential trait to overcome winter damage and for alfalfa (Medicago sativa) cultivar selection. The plant regrowth height (PRH) after autumn clipping is an indirect way to evaluate FD. Transcriptomics, proteomics, and QTL mapping have revealed crucial genes correlated with FD, however, these genes can’t predict alfalfa FD very well. Here, we conducted genomic prediction of FD using whole genome SNP markers based on machine learning-related methods support vector machines (SVM) regression and regularization-related methods, such as lasso and ridge regression. The results showed that using SVM regression with linear kernel and the top 3,000 GWAS-associated markers achieved the highest prediction accuracy for FD of 64.1%. For RPH, the prediction accuracy was 59.0% using the 3,000 GWAS-associated markers and the SVM linear model. It is better than that using whole-genome markers (25.0%). Therefore, the method we explored for alfalfa FD prediction outperformed the other models, such as lasso and ElastNet. The study suggests the feasibility of using machine learning to predict FD with GWAS-associated markers, and the GWAS-associated markers combined with machine learning would benefit FD-related traits as well. Application of the methodology may provide potential targets for FD selection, which would accelerate genetic research and molecular breeding of alfalfa with optimized FD.
Forage quality determined mainly by protein content and fiber composition has a crucial influence on digestibility and nutrition intake for animal feeding. To explore the genetic basis of quality traits, we conducted QTL mapping based on the phenotypic data of crude protein (CP), neutral detergent fiber (NDF), acid detergent fiber (ADF), and lignin of an F1 alfalfa population generated by crossing of two alfalfa parents with significant difference in quality. In total, 83 QTLs were identified with contribution to the phenotypic variation (PVE) ranging from 1.45 to 14.35%. Among them, 47 QTLs interacted significantly with environment and 12 QTLs were associated with more than one trait. Epistatic effect was also detected for 73 pairs of QTLs with PVE of 1.08–14.06%. The results suggested that the inheritance of quality-related traits was jointly affected by additive, epistasis and environment. In addition, 83.33% of the co-localized QTLs were shared by ADF and NDF with the same genetic direction, while the additive effect of crude protein-associated QTLs was opposite to that fiber composition on the same locus, suggesting that the loci may antagonistically contribute to protein content and fiber composition. Further analysis of a QTL related to all the three traits of fiber composition (qNDF1C, qADF1C-2, and qlignin1C-2) showed that five candidate genes were homologs of cellulose synthase-like protein A1 in Medicago truncatula, indicating the potential role in fiber synthesis. For the protein-associated loci we identified, qCP4C-1 was located in the shortest region (chr 4.3 39.3–39.4 Mb), and two of the seven corresponding genes in this region were predicted to be E3 ubiquitin-protein ligase in protein metabolism. Therefore, our results provide some reliable regions significantly associated with alfalfa quality, and identification of the key genes would facilitate marker-assisted selection for favorable alleles in breeding program of alfalfa quality improvement.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.