The vision of the Earth BioGenome Project1 is to complete reference genomes for all of the planet’s ~2M described eukaryotic species in the coming decade. To contribute to this global endeavour, the Darwin Tree of Life Project (DToL2) was launched in 2019 with the aim of generating complete genomes for the ~70k described eukaryotic species that can be found in Britain and Ireland. One of the early tasks of the DToL project was to determine, define, and standardise the important metadata that must accompany every sample contributing to this ambitious project. This ensures high-quality contextual information is available for the associated data, enabling a richer set of information upon which to search and filter datasets as well as enabling interoperability between datasets used for downstream analysis. Here we describe some of the key factors we considered in the process of determining, defining, and documenting the metadata required for DToL project samples. The manifest and Standard Operating Procedure that are referred to throughout this paper are likely to be useful for other projects, and we encourage re-use while maintaining the standards and rules set out here.
The Darwin Tree of Life (DToL) project aims to sequence and assemble high-quality genomes from all eukaryote species in Britain and Ireland, with the first phase of the project concentrating on family-level coverage plus species of particular ecological, biomedical or evolutionary interest. We summarise the processes involved in (1) assessing the UK arthropod fauna and the status of individual species on UK lists; (2) prioritising and collecting species for initial genome sequencing; (3) handling methods to ensure that high-quality genomic DNA is preserved; and (4) compiling standard operating procedures for processing specimens for genome sequencing, identification verification and voucher specimen curation. We briefly explore some lessons learned from the pilot phase of DToL and the impact of the Covid-19 pandemic.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.