Sequencing the large genomes of sharks. We focused on the brownbanded bamboo shark Chiloscyllium punctatum, for which we recently tabled embryonic stages 8 , and the cloudy catshark Scyliorhinus torazame. Their whole genomes, measured to be approximately 4.7 and 6.7 Gbp, respectively, were sequenced de novo to obtain assemblies including megabase-long scaffolds (Supplementary Note 1.1). We also assembled the genome of the whale shark Rhincodon typus using short sequence reads previously generated 3 (Supplementary Note 1.2). Using these genome assemblies, we performed genome-wide gene prediction, assisted by transcript evidence and protein-level homology to other vertebrates. The obtained genome assemblies and gene models exhibit high coverage (Supplementary Fig. 1), and of these, the bamboo shark genome assembly achieved the highest continuity (N50 scaffold length, 1.9 Mbp) and completeness (97% of reference orthologues identified at least partially). Using the novel gene models, we constructed orthologue groups encompassing a diverse array of vertebrate species (see below). Our products outperform existing
The taxon Elasmobranchii (sharks and rays) contains one of the long-established evolutionary lineages of vertebrates with a tantalizing collection of species occupying critical aquatic habitats. To overcome the current limitation in molecular resources, we launched the Squalomix Consortium in 2020 to promote a genome-wide array of molecular approaches, specifically targeting shark and ray species. Among the various bottlenecks in working with elasmobranchs are their elusiveness and low fecundity as well as the large and highly repetitive genomes. Their peculiar body fluid composition has also hindered the establishment of methods to perform routine cell culturing required for their karyotyping. In the Squalomix consortium, these obstacles are expected to be solved through a combination of in-house cytological techniques including karyotyping of cultured cells, chromatin preparation for Hi-C data acquisition, and high fidelity long-read sequencing. The resources and products obtained in this consortium, including genome and transcriptome sequences, a genome browser powered by JBrowse2 to visualize sequence alignments, and comprehensive matrices of gene expression profiles for selected species are accessible through https://github.com/Squalomix/info.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.