Gossypium hirsutum has proven difficult to sequence owing to its complex allotetraploid (AtDt) genome. Here we produce a draft genome using 181-fold paired-end sequences assisted by fivefold BAC-to-BAC sequences and a high-resolution genetic map. In our assembly 88.5% of the 2,173-Mb scaffolds, which cover 89.6%∼96.7% of the AtDt genome, are anchored and oriented to 26 pseudochromosomes. Comparison of this G. hirsutum AtDt genome with the already sequenced diploid Gossypium arboreum (AA) and Gossypium raimondii (DD) genomes revealed conserved gene order. Repeated sequences account for 67.2% of the AtDt genome, and transposable elements (TEs) originating from Dt seem more active than from At. Reduction in the AtDt genome size occurred after allopolyploidization. The A or At genome may have undergone positive selection for fiber traits. Concerted evolution of different regulatory mechanisms for Cellulose synthase (CesA) and 1-Aminocyclopropane-1-carboxylic acid oxidase1 and 3 (ACO1,3) may be important for enhanced fiber production in G. hirsutum.
Cotton is one of the most economically important crop plants worldwide. Its fiber, commonly known as cotton lint, is the principal natural source for the textile industry. Approximately 33 million ha (5% of the world's arable land) is used for cotton planting 1 , with an annual global market value of textile mills of approximately $630.6 billion in 2011 (MarketPublishers; see URLs). Apart from its economic value, cotton is also an excellent model system for studying polyploidization, cell elongation and cell wall biosynthesis 2-5 .The Gossypium genus contains 5 tetraploid (AD 1 to AD 5 , 2n = 4×) and over 45 diploid (2n = 2×) species (where n is the number of chromosomes in the gamete of an individual), which are believed to have originated from a common ancestor approximately 5-10 million years ago 6 . Eight diploid subgenomes, designated as A to G and K, have been found across North America, Africa, Asia and Australia. The haploid genome size of diploid cottons (2n = 2× = 26) varies from about 880 Mb (G. raimondii Ulbrich) in the D genome to 2,500 Mb in the K genome 7,8 . Diploid cotton species share a common chromosome number (n = 13), and high levels of synteny or colinearity are observed among them 9-12 . The tetraploid cotton species (2n = 4× = 52), such as G. hirsutum L. and Gossypium barbadense L., are thought to have formed by an allopolyploidization event that occurred approximately 1-2 million years ago, which involved a D-genome species as the pollen-providing parent and an A-genome species as the maternal parent 13,14 . To gain insights into the cultivated polyploid genomes-how they have evolved and how their subgenomes interact-it is first necessary to have a basic knowledge of the structure of the component genomes. Therefore, we have created a draft sequence of the putative D-genome parent, G. raimondii, using DNA samples prepared from Cotton Microsatellite Database (CMD) 10 (refs. 15,16), a genetic standard originated from a single seed (accession D 5 -3) in 2004 and brought to near homozygosity by six successive generations of self-fertilization. We believe that sequencing of the G. raimondii genome will not only provide a major source of candidate genes important for the genetic improvement of cotton quality and productivity, but it may also serve as a reference for the assembly of the tetraploid G. hirsutum genome. RESULTS Sequencing and assemblyA whole-genome shotgun strategy was used to sequence and assemble the G. raimondii genome. A total of 78.7 Gb of next-generation Illumina paired-end 50-bp, 100-bp and 150-bp reads was generated by sequencing genome shotgun libraries of different fragment lengths (170 bp, 250 bp, 500 bp, 800 bp, 2 kb, 5 kb, 10 kb, 20 kb and 40 kb) that covered 103.6-fold of the 775.2-Mb assembled G. raimondii genome (Supplementary Table 1). The resulting assembly appeared to cover a very large proportion of the euchromatin of the G. raimondii genome. The unassembled genomic regions are likely to contain heterochromatic satellites, large repetitive sequences or ribosoma...
The ancestors of Gossypium arboreum and Gossypium herbaceum provided the A subgenome for the modern cultivated allotetraploid cotton. Here, we upgraded the G. arboreum genome assembly by integrating different technologies. We resequenced 243 G. arboreum and G. herbaceum accessions to generate a map of genome variations and found that they are equally diverged from Gossypium raimondii. Independent analysis suggested that Chinese G. arboreum originated in South China and was subsequently introduced to the Yangtze and Yellow River regions. Most accessions with domestication-related traits experienced geographic isolation. Genome-wide association study (GWAS) identified 98 significant peak associations for 11 agronomically important traits in G. arboreum. A nonsynonymous substitution (cysteine-to-arginine substitution) of GaKASIII seems to confer substantial fatty acid composition (C16:0 and C16:1) changes in cotton seeds. Resistance to fusarium wilt disease is associated with activation of GaGSTF9 expression. Our work represents a major step toward understanding the evolution of the A genome of cotton.
Resurrection plants differ from other species in their unique ability to survive desiccation. In order to understand the mechanisms of desiccation tolerance, proteome studies were carried out using leaves of the resurrection plant Boea hygrometrica to reveal proteins that were differentially expressed in response to changes in relative water content. This opportunity was afforded by the rare ability of excised B. hygrometrica leaves to survive and resume metabolism following desiccation in a manner similar to intact plants. From a total of 223 proteins that were reproducibly detected and analyzed, 35% showed increased abundance in dehydrated leaves, 5% were induced in rehydrated leaves and 60% showed decreased or unchanged abundance in dehydrated and rehydrated leaves. Since the induction kinetics fall into clearly defined patterns, we suggest that programmed regulation of protein expression triggered by changes of water status. Fourteen dehydration responsive proteins were analyzed by mass spectrometry. Eight proteins were classified as playing a role in reactive oxygen species scavenging, photosynthesis and energy metabolism. In agreement with these findings, glutathione content and polyphenol oxidase activity were found to increase upon dehydration and rapid recovery of photosynthesis was observed.
BackgroundThe identification of quantitative trait loci (QTLs) that are stable and consistent across multiple environments and populations plays an essential role in marker-assisted selection (MAS). In the present study, we used 28,861 simple sequence repeat (SSR) markers, which included 12,560 Gossypium raimondii (D genome) sequence-based SSR markers to identify polymorphism between two upland cotton strains 0–153 and sGK9708. A total of 851 polymorphic primers were finally selected and used to genotype 196 recombinant inbred lines (RIL) derived from a cross between 0 and 153 and sGK9708 and used to construct a linkage map. The RIL population was evaluated for fiber quality traits in six locations in China for five years. Stable QTLs identified in this intraspecific cross could be used in future cotton breeding program and with fewer obstacles.ResultsThe map covered a distance of 4,110 cM, which represents about 93.2 % of the upland cotton genome, and with an average distance of 5.2 cM between adjacent markers. We identified 165 QTLs for fiber quality traits, of which 47 QTLs were determined to be stable across multiple environments. Most of these QTLs aggregated into clusters with two or more traits. A total of 30 QTL clusters were identified which consisted of 103 QTLs. Sixteen clusters in the At sub-genome comprised 44 QTLs, whereas 14 clusters in the Dt sub-genome that included 59 QTLs for fiber quality were identified. Four chromosomes, including chromosome 4 (c4), c7, c14, and c25 were rich in clusters harboring 5, 4, 5, and 6 clusters respectively. A meta-analysis was performed using Biomercator V4.2 to integrate QTLs from 11 environmental datasets on the RIL populations of the above mentioned parents and previous QTL reports. Among the 165 identified QTLs, 90 were identified as common QTLs, whereas the remaining 75 QTLs were determined to be novel QTLs. The broad sense heritability estimates of fiber quality traits were high for fiber length (0.93), fiber strength (0.92), fiber micronaire (0.85), and fiber uniformity (0.80), but low for fiber elongation (0.27). Meta-clusters on c4, c7, c14 and c25 were identified as stable QTL clusters and were considered more valuable in MAS for the improvement of fiber quality of upland cotton.ConclusionMultiple environmental evaluations of an intraspecific RIL population were conducted to identify stable QTLs. Meta-QTL analyses identified a common chromosomal region that plays an important role in fiber development. Therefore, QTLs identified in the present study are an ideal candidate for MAS in cotton breeding programs to improve fiber quality.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-016-2560-2) contains supplementary material, which is available to authorized users.
BackgroundUpland Cotton (Gossypium hirsutum) is one of the most important worldwide crops it provides natural high-quality fiber for the industrial production and everyday use. Next-generation sequencing is a powerful method to identify single nucleotide polymorphism markers on a large scale for the construction of a high-density genetic map for quantitative trait loci mapping.ResultsIn this research, a recombinant inbred lines population developed from two upland cotton cultivars 0–153 and sGK9708 was used to construct a high-density genetic map through the specific locus amplified fragment sequencing method. The high-density genetic map harbored 5521 single nucleotide polymorphism markers which covered a total distance of 3259.37 cM with an average marker interval of 0.78 cM without gaps larger than 10 cM. In total 18 quantitative trait loci of boll weight were identified as stable quantitative trait loci and were detected in at least three out of 11 environments and explained 4.15–16.70 % of the observed phenotypic variation. In total, 344 candidate genes were identified within the confidence intervals of these stable quantitative trait loci based on the cotton genome sequence. These genes were categorized based on their function through gene ontology analysis, Kyoto Encyclopedia of Genes and Genomes analysis and eukaryotic orthologous groups analysis.ConclusionsThis research reported the first high-density genetic map for Upland Cotton (Gossypium hirsutum) with a recombinant inbred line population using single nucleotide polymorphism markers developed by specific locus amplified fragment sequencing. We also identified quantitative trait loci of boll weight across 11 environments and identified candidate genes within the quantitative trait loci confidence intervals. The results of this research would provide useful information for the next-step work including fine mapping, gene functional analysis, pyramiding breeding of functional genes as well as marker-assisted selection.Electronic supplementary materialThe online version of this article (doi:10.1186/s12870-016-0741-4) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.