Upland cotton (Gossypium hirsutum) is the most important natural fiber crop in the world. The overall genetic diversity among cultivated species of cotton and the genetic changes that occurred during their improvement are poorly understood. Here we report a comprehensive genomic assessment of modern improved upland cotton based on the genome-wide resequencing of 318 landraces and modern improved cultivars or lines. We detected more associated loci for lint yield than for fiber quality, which suggests that lint yield has stronger selection signatures than other traits. We found that two ethylene-pathway-related genes were associated with increased lint yield in improved cultivars. We evaluated the population frequency of each elite allele in historically released cultivar groups and found that 54.8% of the elite genome-wide association study (GWAS) alleles detected were transferred from three founder landraces: Deltapine 15, Stoneville 2B and Uganda Mian. Our results provide a genomic basis for improving cotton cultivars and for further evolutionary analysis of polyploid crops.
The mapping of functional genes plays an important role in studies of genome structure, function, and evolution, as well as allowing gene cloning and marker-assisted selection to improve agriculturally important traits. Simple sequence repeats (SSRs) developed from expressed sequence tags (ESTs), EST-SSR (eSSR), can be employed as putative functional marker loci to easily tag corresponding functional genes. In this paper, 2218 eSSRs, 1554 from G. raimondii-derived and 754 from G. hirsutum-derived ESTs, were developed and used to screen polymorphisms to enhance our backbone genetic map in allotetraploid cotton. Of the 1554 G. raimondii-derived eSSRs, 744 eSSRs were able to successfully amplify polymorphisms between our two mapping parents, TM-1 and Hai7124, presenting a polymorphic rate of 47.9%. However, only a 23.9% (159/754) polymorphic rate was produced from G. hirsutum-derived eSSRs. No relationship was observed between the level of polymorphism, motif type, and tissue origin, but the polymorphism appeared to be correlated with repeat type. After integrating these new eSSRs, our enhanced genetic map consists of 1790 loci in 26 linkage groups and covers 3425.8 cM with an average intermarker distance of 1.91 cM. This microsatellite-based, gene-rich linkage map contains 71.96% functional marker loci, of which 87.11% are eSSR loci. There were 132 duplicated loci bridging 13 homeologous At/Dt chromosome pairs. Two reciprocal translocations after polyploidization between A2 and A3, and between A4 and A5, chromosomes were further confirmed. A functional analysis of 975 ESTs producing 1122 eSSR loci tagged in the map revealed that 60% had clear BLASTX hits (,1e À10 ) to the Uniprot database and that 475 were associated mainly with genes belonging to the three major gene ontology categories of biological process, cellular component, and molecular function; many of the ESTs were associated with two or more category functions. The results presented here will provide new insights for future investigations of functional and evolutionary genomics, especially those associated with cotton fiber improvement.
Despite rapidly decreasing costs and innovative technologies, sequencing of angiosperm genomes is not yet undertaken lightly. Generating larger amounts of sequence data more quickly does not address the difficulties of sequencing and assembling complex genomes de novo. The cotton (Gossypium spp.) genomes represent a challenging case. To this end, a coalition of cotton genome scientists has developed a strategy for sequencing the cotton genomes, which will vastly expand opportunities for cotton research and improvement worldwide.
In order to construct a saturated genetic map and facilitate marker-assisted selection (MAS) breeding, it is necessary to enhance the current reservoir of known molecular markers in Gossypium. Microsatellites or simple sequence repeats (SSRs) occur in expressed sequence tags (EST) in plants. Many ESTs are publicly available now and represent a good tool in developing EST-SSRs. From 13,505 ESTs developed from our two cotton fiber/ovule cDNA libraries constructed for Upland cotton, 966 (7.15%) contained one or more SSRs and from them, 489 EST-SSR primer pairs were developed. Among the EST-SSRs, 59.1% are trinucleotides, followed by dinucleotides (30%), tetranucleotides (6.4%), pentanucleotides (1.8%), and hexanucleotides (2.7%). AT/TA (18.4%) is the most frequent repeat, followed by CTT/GAA (5.3%), AG/TC (5.1%), AGA/TCT (4.9%), AGT/TCA (4.5%), and AAG/TTC (4.5%). One hundred and thirty EST-SSR loci were produced from 114 informative EST-SSR primer pairs, which generated polymorphism between our two mapping parents. Of these, 123 were integrated on our allotetraploid cotton genetic map, based on the cross [(TM-1xHai7124)TM-1]. EST-SSR markers were distributed over 20 chromosomes and 6 linkage groups in the map. These EST-SSR markers can be used in genetic mapping, identification of quantitative trait loci (QTLs), and comparative genomics studies of cotton.
The cytochrome P450 (CYP) superfamily is the largest enzymatic protein family in plants, and it also widely exists in mammals, fungi, bacteria, insects and so on. Members of this superfamily are involved in multiple metabolic pathways with distinct and complex functions, playing important roles in a vast array of reactions. As a result, numerous secondary metabolites are synthesized that function as growth and developmental signals or protect plants from various biotic and abiotic stresses. Here, we summarize the characterization of CYPs, as well as their phylogenetic classification. We also focus on recent advances in elucidating the roles of CYPs in mediating plant growth and development as well as biotic and abiotic stresses responses, providing insights into their potential utilization in plant breeding.
SummaryNatural antisense transcripts (NATs) are commonly observed in eukaryotic genomes, but only a limited number of such genes have been identified as being involved in gene regulation in plants. In this research, we investigated the function of small RNA derived from a NAT in fiber cell development.Using a map-based cloning strategy for the first time in tetraploid cotton, we cloned a naked seed mutant gene (N 1 ) encoding a MYBMIXTA-like transcription factor 3 (MML3)/GhMYB25-like in chromosome A12, GhMML3_A12, that is associated with fuzz fiber development.The extremely low expression of GhMML3_A12 in N 1 is associated with NAT production, driven by its 3 0 antisense promoter, as indicated by the promoter-driven histochemical staining assay. In addition, small RNA deep sequencing analysis suggested that the bidirectional transcriptions of GhMML3_A12 form double-stranded RNAs and generate 21-22 nt small RNAs. Therefore, in a fiber-specific manner, small RNA derived from the GhMML3_A12 locus can mediate GhMML3_A12 mRNA self-cleavage and result in the production of naked seeds followed by lint fiber inhibition in N 1 plants. The present research reports the first observation of gene-mediated NATs and siRNA directly controlling fiber development in cotton.
Background: High-throughput genotyping platforms play important roles in plant genomic studies. Cotton (Gossypium spp.) is the world's important natural textile fiber and oil crop. Upland cotton accounts for more than 90% of the world's cotton production, however, modern upland cotton cultivars have narrow genetic diversity. The amounts of genomic sequencing and re-sequencing data released make it possible to develop a high-quality single nucleotide polymorphism (SNP) array for intraspecific genotyping detection in cotton.Results: Here we report a high-throughput CottonSNP80K array and its utilization in genotyping detection in different cotton accessions. 82,259 SNP markers were selected from the re-sequencing data of 100 cotton cultivars and used to produce the array on the Illumina Infinium platform. 77,774 SNP loci (94.55%) were successfully synthesized on the array. Of them, 77,252 (99.33%) had call rates of >95% in 352 cotton accessions and 59,502 (76.51%) were polymorphic loci. Application tests using 22 cotton accessions with parent/F 1 combinations or with similar genetic backgrounds showed that CottonSNP80K array had high genotyping accuracy, good repeatability, and wide applicability. Phylogenetic analysis of 312 cotton cultivars and landraces with wide geographical distribution showed that they could be classified into ten groups, irrelevant of their origins. We found that the different landraces were clustered in different subgroups, indicating that these landraces were major contributors to the development of different breeding populations of modern G. hirsutum cultivars in China. We integrated a total of 54,588 SNPs (MAFs >0.05) associated with 10 salt stress traits into 288 G. hirsutum accessions for genome-wide association studies (GWAS), and eight significant SNPs associated with three salt stress traits were detected. Conclusions: We developed CottonSNP80K array with high polymorphism to distinguish upland cotton accessions. Diverse application tests indicated that the CottonSNP80K play important roles in germplasm genotyping, variety verification, functional genomics studies, and molecular breeding in cotton.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.