The ancestors of Gossypium arboreum and Gossypium herbaceum provided the A subgenome for the modern cultivated allotetraploid cotton. Here, we upgraded the G. arboreum genome assembly by integrating different technologies. We resequenced 243 G. arboreum and G. herbaceum accessions to generate a map of genome variations and found that they are equally diverged from Gossypium raimondii. Independent analysis suggested that Chinese G. arboreum originated in South China and was subsequently introduced to the Yangtze and Yellow River regions. Most accessions with domestication-related traits experienced geographic isolation. Genome-wide association study (GWAS) identified 98 significant peak associations for 11 agronomically important traits in G. arboreum. A nonsynonymous substitution (cysteine-to-arginine substitution) of GaKASIII seems to confer substantial fatty acid composition (C16:0 and C16:1) changes in cotton seeds. Resistance to fusarium wilt disease is associated with activation of GaGSTF9 expression. Our work represents a major step toward understanding the evolution of the A genome of cotton.
Identification of stable quantitative trait loci (QTLs) across different environments and mapping populations is a prerequisite for marker-assisted selection (MAS) for cotton yield and fiber quality. To construct a genetic linkage map and to identify QTLs for fiber quality and yield traits, a backcross inbred line (BIL) population of 146 lines was developed from a cross between Upland cotton (Gossypium hirsutum) and Egyptian cotton (Gossypium barbadense) through two generations of backcrossing using Upland cotton as the recurrent parent followed by four generations of self pollination. The BIL population together with its two parents was tested in five environments representing three major cotton production regions in China. The genetic map spanned a total genetic distance of 2,895 cM and contained 392 polymorphic SSR loci with an average genetic distance of 7.4 cM per marker. A total of 67 QTLs including 28 for fiber quality and 39 for yield and its components were detected on 23 chromosomes, each of which explained 6.65-25.27% of the phenotypic variation. Twenty-nine QTLs were located on the At subgenome originated from a cultivated diploid cotton, while 38 were on the Dt subgenome from an ancestor that does not produce spinnable fibers. Of the eight common QTLs (12%) detected in more than two environments, two were for fiber quality traits including one for fiber strength and one for uniformity, and six for yield and its components including three for lint yield, one for seedcotton yield, one for lint percentage and one for boll weight. QTL clusters for the same traits or different traits were also identified. This research represents one of the first reports using a permanent advanced backcross inbred population of an interspecific hybrid population to identify QTLs for fiber quality and yield traits in cotton across diverse environments. It provides useful information for transferring desirable genes from G. barbadense to G. hirsutum using MAS.
Cotton is one of the most economically important fiber crop plants worldwide. The genus Gossypium contains a single allotetraploid group (AD) and eight diploid genome groups (A–G and K). However, the evolution of repeat sequences in the chloroplast genomes and the phylogenetic relationships of Gossypium species are unclear. Thus, we determined the variations in the repeat sequences and the evolutionary relationships of 40 cotton chloroplast genomes, which represented the most diverse in the genus, including five newly sequenced diploid species, i.e., G. nandewarense (C1-n), G. armourianum (D2-1), G. lobatum (D7), G. trilobum (D8), and G. schwendimanii (D11), and an important semi-wild race of upland cotton, G. hirsutum race latifolium (AD1). The genome structure, gene order, and GC content of cotton species were similar to those of other higher plant plastid genomes. In total, 2860 long sequence repeats (>10 bp in length) were identified, where the F-genome species had the largest number of repeats (G. longicalyx F1: 108) and E-genome species had the lowest (G. stocksii E1: 53). Large-scale repeat sequences possibly enrich the genetic information and maintain genome stability in cotton species. We also identified 10 divergence hotspot regions, i.e., rpl33-rps18, psbZ-trnG (GCC), rps4-trnT (UGU), trnL (UAG)-rpl32, trnE (UUC)-trnT (GGU), atpE, ndhI, rps2, ycf1, and ndhF, which could be useful molecular genetic markers for future population genetics and phylogenetic studies. Site-specific selection analysis showed that some of the coding sites of 10 chloroplast genes (atpB, atpE, rps2, rps3, petB, petD, ccsA, cemA, ycf1, and rbcL) were under protein sequence evolution. Phylogenetic analysis based on the whole plastomes suggested that the Gossypium species grouped into six previously identified genetic clades. Interestingly, all 13 D-genome species clustered into a strong monophyletic clade. Unexpectedly, the cotton species with C, G, and K-genomes were admixed and nested in a large clade, which could have been due to their recent radiation, incomplete lineage sorting, and introgression hybridization among different cotton lineages. In conclusion, the results of this study provide new insights into the evolution of repeat sequences in chloroplast genomes and interspecific relationships in the genus Gossypium.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.