Episyrphus balteatus can provide dual ecosystem services including pest control and pollination, which the larvae are excellent predators of aphid pest whereas adults are efficient pollinator. In this study, we assembled a high-quality genome of E. balteatus from northern China geographical population at the chromosome level by using Illumina, PacBio long reads, and Hi-C technologies. The 467.42 Mb genome was obtained from 723 contigs, with a contig N50 of 9.16 Mb and Scaffold N50 of 118.85 Mb, and 90.25% (431.75 Mb) of the assembly was anchored to 4 pseudo-autosomes and one pseudoheterosome. In total, 14,848 protein-coding genes were annotated, and 95.14% of genes were fully represented in NR, GO, KEGG databases. Besides, we also obtained the mitochondrial genome of E. balteatus of 16, 837 bp in length with 37 typical mitochondrial genes. Overall, this high-quality genome is valuable for evolutionary and genetic studies of E. balteatus and other Syrphidae hoverfly species.
Background & SummaryEpisyrphus balteatus, also known as the marmalade hoverfly, acts as pollinator and predator, which the larvae can control aphid pests and adults feeding on pollen and nectar, can be used as pollinators in plants (Fig. 1a). As a long-distance migratory insect, it travels high above between high-and low-latitude regions seasonally each year, transports billions of pollen grains, consumes trillions of aphids, and make billions of flower visits on the annual fluxes 1 . Considering that the populations of many pollinator insects, especially bees (Fig. 1b), are seriously declining 2 . Hoverflies including E. balteatus are becoming more important in the ecosystem 3 . To date, plentiful studies about genome, reproduction, behavior, and phylogenetic relationship have been performed on various hoverfly species. However, the lack of high-quality reference genomes has hindered deeper gene function exploration of this species.In this study, we propose a high-quality genome-assembly at chromosome level and conduct a whole life cycle transcriptome of E. balteatus using a combination of Illumina short-read sequencing, PacBio continuous long read (CLR), and chromosome conformation capture (Hi-C) techniques (Table 1). Through CLR sequencing, 8,740, 850 continuous long reads with N50 of 39.791 kb were obtained, and then a total of 517.92 Mb genome was assembled from 729 contigs with a contig N50 of 9.16 Mb (Tables 2, 3). In Hi-C strategy, 90.25% (467.42 Mb) of the assembly was anchored to 4 chromosomes (Table 4) with a scaffold N50 of 118.85 Mb (Tables 2, 3), while Contig 1996 was proved as the pseudo-X chromosome via whole-genome synteny with other hoverfly species