2019
DOI: 10.1186/s12864-019-5642-0
|View full text |Cite
|
Sign up to set email alerts
|

A hybrid de novo genome assembly of the honeybee, Apis mellifera, with chromosome-length scaffolds

Abstract: Background The ability to generate long sequencing reads and access long-range linkage information is revolutionizing the quality and completeness of genome assemblies. Here we use a hybrid approach that combines data from four genome sequencing and mapping technologies to generate a new genome assembly of the honeybee Apis mellifera . We first generated contigs based on PacBio sequencing libraries, which were then merged with linked-read 10x Chromium data followed by sc… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

13
163
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 168 publications
(179 citation statements)
references
References 74 publications
13
163
0
Order By: Relevance
“…These repeat sequences include long interspersed nuclear elements (LINEs) (0.04%), long terminal repeats (LTRs) (0.20%), DNA elements (0.65%), small RNAs (0.01%), simple repeats (4.03%), low complexity sequences (1.06%), and unclassified repeat sequences (3.16%). Of them, simple repeats with total length of 8,693,708 bp are the richest type (Table 3), which is consistent with the previous reports (Park et al, 2015;Diao et al, 2018) and is the same as in A. mellifera (Wallberg et al, 2019), suggesting that simple repeats are the major type of repeat sequences in honeybees.…”
Section: Genome Annotation and Evaluationsupporting
confidence: 90%
See 1 more Smart Citation
“…These repeat sequences include long interspersed nuclear elements (LINEs) (0.04%), long terminal repeats (LTRs) (0.20%), DNA elements (0.65%), small RNAs (0.01%), simple repeats (4.03%), low complexity sequences (1.06%), and unclassified repeat sequences (3.16%). Of them, simple repeats with total length of 8,693,708 bp are the richest type (Table 3), which is consistent with the previous reports (Park et al, 2015;Diao et al, 2018) and is the same as in A. mellifera (Wallberg et al, 2019), suggesting that simple repeats are the major type of repeat sequences in honeybees.…”
Section: Genome Annotation and Evaluationsupporting
confidence: 90%
“…Compared with the second generation sequencing technologies, the TGS technologies also maintain the advantages of high throughput and low cost. Typical TGS technologies are single-molecule real-time (SMRT) sequencing technology developed by Pacific Biosciences company and Nanopore sequencing technology launched by Oxford Nanopore company, which are now widely used in genome sequencing of animals and plants (Jiang et al, 2014;Jiao et al, 2017;Gong et al, 2018;Kronenberg et al, 2018;Ghurye et al, 2019;Low et al, 2019;Wallberg et al, 2019;Zhang et al, 2019).…”
Section: Introductionmentioning
confidence: 99%
“…Interestingly, we find that in CSP genes such as GB19453, one intron is inserted only a few nucleotides after the start codon encoding the amino acid methionine (Figure 2A). The same observation (intron1 inserted shortly after the start of the signal peptide) was made in AAJJ1196A and BmorCSP19 ( Figure S1) [14,38,[41][42][43]. In the case of these genes, the intron is inserted after the third base and therefore does not cause codon disruption (phase 0 intron).…”
Section: Genome-wide Identification Comparative Genomics and Evolutisupporting
confidence: 57%
“…In the initial analysis of the honeybee genome, the existence of six CSPs has been reported in A. mellifera: six single-intron structures [37,39,40]. Analyzing a new assembly (sequence update) of the honeybee genome [41], we confirm the existence of only six CSPs in bees (AmelASP3c, AmelGB10389, AmelGB13325, AmelGB17875, AmelGB19242 and AmelGB19453) and localize them on specific chromosomes (Figure 2A, Table S1). However, we find that there is an additional intron inserted in the signal peptide of GB19453.…”
Section: Genome-wide Identification Comparative Genomics and Evolutisupporting
confidence: 54%
See 1 more Smart Citation