Linkage maps enable the study of important biological questions. The construction of high-density linkage maps appears more feasible since the advent of next-generation sequencing (NGS), which eases SNP discovery and high-throughput genotyping of large population. However, the marker number explosion and genotyping errors from NGS data challenge the computational efficiency and linkage map quality of linkage study methods. Here we report the HighMap method for constructing high-density linkage maps from NGS data. HighMap employs an iterative ordering and error correction strategy based on a k-nearest neighbor algorithm and a Monte Carlo multipoint maximum likelihood algorithm. Simulation study shows HighMap can create a linkage map with three times as many markers as ordering-only methods while offering more accurate marker orders and stable genetic distances. Using HighMap, we constructed a common carp linkage map with 10,004 markers. The singleton rate was less than one-ninth of that generated by JoinMap4.1. Its total map distance was 5,908 cM, consistent with reports on low-density maps. HighMap is an efficient method for constructing high-density, high-quality linkage maps from high-throughput population NGS data. It will facilitate genome assembling, comparative genomic analysis, and QTL studies. HighMap is available at http://highmap.biomarker.com.cn/.
BackgroundThe genetics and molecular biology of sesame has only recently begun to be studied even though sesame is an important oil seed crop. A high-density genetic map for sesame has not been published yet due to a lack of sufficient molecular markers. Specific length amplified fragment sequencing (SLAF-seq) is a recently developed high-resolution strategy for large-scale de novo SNP discovery and genotyping. SLAF-seq was employed in this study to obtain sufficient markers to construct a high-density genetic map for sesame.ResultsIn total, 28.21 Gb of data containing 201,488,285 pair-end reads was obtained after sequencing. The average coverage for each SLAF marker was 23.48-fold in the male parent, 23.38-fold in the female parent, and 14.46-fold average in each F2 individual. In total, 71,793 high-quality SLAFs were detected of which 3,673 SLAFs were polymorphic and 1,272 of the polymorphic markers met the requirements for use in the construction of a genetic map. The final map included 1,233 markers on the 15 linkage groups (LGs) and was 1,474.87 cM in length with an average distance of 1.20 cM between adjacent markers. To our knowledge, this map is the densest genetic linkage map to date for sesame. 'SNP_only’ markers accounted for 87.51% of the markers on the map. A total of 205 markers on the map showed significant (P < 0.05) segregation distortion.ConclusionsWe report here the first high-density genetic map for sesame. The map was constructed using an F2 population and the SLAF-seq approach, which allowed the efficient development of a large number of polymorphic markers in a short time. Results of this study will not only provide a platform for gene/QTL fine mapping, map-based gene isolation, and molecular breeding for sesame, but will also serve as a reference for positioning sequence scaffolds on a physical map, to assist in the process of assembling the sesame genome sequence.
Olive (Olea europaea L.) is internationally renowned for its high-end product, extra virgin olive oil. An incomplete genome of O. europaea was previously obtained using shotgun sequencing in 2016. To further explore the genetic and breeding utilization of olive, an updated draft genome of olive was obtained using Oxford Nanopore third-generation sequencing and Hi-C technology. Seven different assembly strategies were used to assemble the final genome of 1.30 Gb, with contig and scaffold N50 sizes of 4.67 Mb and 42.60 Mb, respectively. This greatly increased the quality of the olive genome. We assembled 1.1 Gb of sequences of the total olive genome to 23 pseudochromosomes by Hi-C, and 53,518 protein-coding genes were predicted in the current assembly. Comparative genomics analyses, including gene family expansion and contraction, whole-genome replication, phylogenetic analysis, and positive selection, were performed. Based on the obtained high-quality olive genome, a total of nine gene families with 202 genes were identified in the oleuropein biosynthesis pathway, which is twice the number of genes identified from the previous data. This new accession of the olive genome is of sufficient quality for genome-wide studies on gene function in olive and has provided a foundation for the molecular breeding of olive species.
Cultivated chrysanthemum (Chrysanthemum × morifolium Ramat.) is a beloved ornamental crop due to the diverse capitula types among varieties, but the molecular mechanism of capitulum development remains unclear. Here, we report a 2.60 Gb chromosome-scale reference genome of C. lavandulifolium, a wild Chrysanthemum species found in China, Korea and Japan. The evolutionary analysis of the genome revealed that only recent tandem duplications occurred in the C. lavandulifolium genome after the shared whole genome triplication (WGT) in Asteraceae. Based on the transcriptomic profiling of six important developmental stages of the radiate capitulum in C. lavandulifolium, we found genes in the MADS-box, TCP, NAC and LOB gene families that were involved in disc and ray floret primordia differentiation. Notably, NAM and LOB30 homologs were specifically expressed in the radiate capitulum, suggesting their pivotal roles in the genetic network of disc and ray floret primordia differentiation in chrysanthemum. The present study not only provides a high-quality reference genome of chrysanthemum but also provides insight into the molecular mechanism underlying the diverse capitulum types in chrysanthemum.
Flower type is an important and extremely complicated trait of chrysanthemum. The corolla tube merged degree (CTMD) and the relative number of ray florets (RNRF) are the two key factors affecting chrysanthemum flower type. However, few reports have clarified the inheritance of these two complex traits, which limits directed breeding for flower-type improvement. In this study, 305 F 1 hybrids were obtained from two parents with obvious differences in CTMD and RNRF performance. Using specific-locus amplified fragment sequencing (SLAF-seq) technology, we constructed a high-density genetic linkage map with an average map distance of 0.76 cM. Three major QTLs controlling CTMD and four major QTLs underlying RNRF were repeatedly detected in the 2 years. Moreover, the synteny between the genetic map and other Compositae species was investigated, and weak collinearity was observed. In QTL regions with a high degree of genomic collinearity, eight annotated genes were probed in the Helianthus annuus L. and Lactuca sativa L. var. ramosa Hort. genomes. Furthermore, 20 and 11 unigenes were identified via BLAST searches between the SNP markers of the QTL regions and the C. vestitum and C. lavandulifolium transcriptomes, respectively. These results lay a foundation for molecular marker-assisted breeding and candidate gene exploration in chrysanthemum without a reference assembly.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.