Allotetraploid cotton is an economically important natural-fiber-producing crop worldwide. After polyploidization, Gossypium hirsutum L. evolved to produce a higher fiber yield and to better survive harsh environments than Gossypium barbadense, which produces superior-quality fibers. The global genetic and molecular bases for these interspecies divergences were unknown. Here we report high-quality de novo-assembled genomes for these two cultivated allotetraploid species with pronounced improvement in repetitive-DNA-enriched centromeric regions. Whole-genome comparative analyses revealed that speciesspecific alterations in gene expression, structural variations and expanded gene families were responsible for speciation and the evolutionary history of these species. These findings help to elucidate the evolution of cotton genomes and their domestication history. The information generated not only should enable breeders to improve fiber quality and resilience to ever-changing environmental conditions but also can be translated to other crops for better understanding of their domestication history and use in improvement.
To increase the numbers of microsatellites available for use in constructing a genetic map, and facilitate the use of functional genomics to elucidate fiber development and breeding in cotton, we sampled microsatellite sequences from expressed sequence tags (ESTs) transcribed during fiber elongation in the A-genome species Gossypium arboreum to evaluate their frequency of occurrence, level of polymorphism and distribution in the At and Dt subgenomes of tetraploid cotton. From among ESTs derived from G. arboreum fibers at 7-10 days post anthesis (dpa), 931 ESTs were found to contain simple sequence repeats (SSRs); 544 (58.4%) EST-SSR primer pairs were developed, and 468 (86%) amplified PCR products from allotetraploid cotton (G. hirsutumcv. TM-1 and G. barbadense cv. Hai7124). However, only 99 (18.2%) of these were found to be polymorphic and segregating in our interspecific BC1 mapping population [(TM-1xHai7124)xTM-1]. In these amplified and informative EST-SSRs, hexa- and tri-nucleotide repeat motifs were the most frequent, representing 40.1 and 30%, respectively, of the total. A total of 111 loci detected with these 99 EST-SSRs were integrated into our backbone map including 511 SSR loci. The distribution of the EST-SSRs appeared to be non-random, since 72 loci were anchored to the At and 37 to the Dt subgenome of allotetraploid cotton based on linkage tests. Interestingly, out of the 10 pairs of duplicate loci amplified, seven were mapped to the corresponding homologous linkage groups and/or chromosomes. BLASTX analysis revealed that 69 of the 99 ESTs showed significant similarities to known genes. Some genes important for fiber development, such as sucrose synthase, were mapped to corresponding chromosomes. These EST-SSRs provide structural and functional genomic information that will be useful for understanding cotton fiber development.
We characterize GoSP genes underlying the development of cotton plants with short branches and clustered bolls, a phenotype that allows higher planting density and promotes increased fiber yield per acre.
Interspecific genomic variation can provide a genetic basis for local adaptation and domestication. A series of studies have presented its role of interspecific haplotypes and introgressions in adaptive traits, but few studies have addressed their role in improving agronomic character. Two allotetraploid Gossypium species, Gossypium barbadense (Gb) and G. hirsutum (Gh) originating from the Americas, are cultivated independently. Here, through sequencing and the comparison of one GWAS panel in 229 Gb accessions and two GWAS panels in 491 Gh accessions, we found that most associated loci or functional haplotypes for agronomic traits were highly divergent, representing the strong divergent improvement between Gb and Gh. Using a comprehensive interspecific haplotype map, we revealed that six interspecific introgressions from Gh to Gb were significantly associated with the phenotypic performance of Gb, which could explain 5%-40% of phenotypic variation in yield and fibre qualities. In addition, three introgressions overlapped with six associated loci in Gb, indicating that these introgression regions were under further selection and stabilized during improvement. A single interspecific introgression often possessed yield-increasing potential but decreased fibre qualities, or the opposite, making it difficult to simultaneously improve yield and fibre qualities. Our study not only has proved the importance of interspecific functional haplotypes or introgressions in the divergent improvement of Gb and Gh, but also supports their potential value in further humanmediated hybridization or precision breeding.
Fiber quality is an important economic index and a major breeding goal in cotton, but direct phenotypic selection is often hindered due to environmental influences and linkage with yield traits. A genome-wide association study (GWAS) is a powerful tool to identify genes associated with phenotypic traits. In this study, we identified fiber quality genes in upland cotton (Gossypium hirsutum L.) using GWAS based on a high-density CottonSNP80K array and multiple environment tests. A total of 30 and 23 significant single nucleotide polymorphisms (SNPs) associated with five fiber quality traits were identified across the 408 cotton accessions in six environments and the best linear unbiased predictions, respectively. Among these SNPs, seven loci were the same, and 128 candidate genes were predicted in a 1-Mb region (±500 kb of the peak SNP). Furthermore, two major genome regions (GR1 and GR2) associated with multiple fiber qualities in multiple environments on chromosomes A07 and A13 were identified, and within them, 22 candidate genes were annotated. Of these, 11 genes were expressed [log2(1 + FPKM)>1] in the fiber development stages (5, 10, 20, and 25 dpa) using RNA-Seq. This study provides fundamental insight relevant to identification of genes associated with fiber quality and will accelerate future efforts toward improving fiber quality of upland cotton.
Summary Xinjiang has been the largest and highest yield cotton production region not only in China, but also in the world. Improvements in Upland cotton cultivars in Xinjiang have occurred via pedigree selection and/or crossing of elite alleles from the former Soviet Union and other cotton producing regions of China. But it is unclear how genomic constitutions from foundation parents have been selected and inherited. Here, we deep‐sequenced seven historic foundation parents, comprising four cultivars introduced from the former Soviet Union (108Ф, C1470, 611Б and KK1543) and three from United States and Africa (DPL15, STV2B and UGDM), and re‐sequenced sixty‐nine Xinjiang modern cultivars. Phylogenetic analysis of more than 2 million high‐quality single nucleotide polymorphisms allowed their classification two groups, suggesting that Xinjiang Upland cotton cultivars were not only spawned from 108Ф, C1470, 611Б and KK1543, but also had a close kinship with DPL15, STV2B and UGDM. Notably, identity‐by‐descent (IBD) tracking demonstrated that the former Soviet Union cultivars have made a huge contribution to modern cultivar improvement in Xinjiang. A total of 156 selective sweeps were identified. Among them, apoptosis‐antagonizing transcription factor gene (GhAATF1) and mitochondrial transcription termination factor family protein gene (GhmTERF1) were highly involved in the determination of lint percentage. Additionally, the auxin response factor gene (GhARF3) located in inherited IBD segments from 108Ф and 611Б was highly correlated with fibre quality. These results provide an insight into the genomics of artificial selection for improving cotton production and facilitate next‐generation precision breeding of cotton and other crops.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.