Here the Human Pangenome Reference Consortium presents a first draft of the human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals1. These assemblies cover more than 99% of the expected sequence in each genome and are more than 99% accurate at the structural and base pair levels. Based on alignments of the assemblies, we generate a draft pangenome that captures known variants and haplotypes and reveals new alleles at structurally complex loci. We also add 119 million base pairs of euchromatic polymorphic sequences and 1,115 gene duplications relative to the existing reference GRCh38. Roughly 90 million of the additional base pairs are derived from structural variation. Using our draft pangenome to analyse short-read data reduced small variant discovery errors by 34% and increased the number of structural variants detected per haplotype by 104% compared with GRCh38-based workflows, which enabled the typing of the vast majority of structural variant alleles per sample.
Surfperches (Family Embiotocidae) are viviparous temperate reef fishes that brood their young. This life history trait translates into limited dispersal, strong population structure, and an unusually strong potential for local adaptation in a marine fish. As part of the California Conservation Genomics Project (CCGP), we sequenced the genome of the Black Surfperch, Embiotoca jacksoni, to establish a genomic model for understanding phylogeographic patterns of marine organisms in California. These patterns, in turn, may inform the design of marine protected areas using dispersal models based on genomic data. The genome of E. jacksoni is typical of marine fishes at less than 1Gb (genome size = 635 Mb), and our assembly is near-chromosome level (contig N50 = 6.5Mb, scaffold N50 = 15.5 Mb, BUSCO = 98.1%). Within the context of the CCGP, the genome will be used as a reference for future whole genome resequencing projects aimed at enhancing our knowledge of the population structure of the species, and efficacy of Marine Protected Areas across the state.
Genome assemblies are increasingly being used to identify adaptive genetic variation that can help prioritize the population management of protected species. This approach may be particularly relevant to species like Blainville’s horned lizard, Phrynosoma blainvillii, due to its specialized diet on noxious harvester ants, numerous adaptative traits for avoiding predation (e.g., cranial horns, dorsoventrally compressed body, cryptic coloration, and blood squirting from the orbital sinuses), and status as Species of Special Concern in California. Rangewide decline since the early 20 th century, the basis of its conservation status, has been driven mainly by habitat conversion, over-collecting, and invasion of a non-native ant that displaces its native ant prey base. Here, we report on a scaffold-level genome assembly for P. blainvillii as part of the California Conservation Genomics Project (CCGP), produced using Pacific Biosciences HiFi long reads and Hi-C chromatin-proximity sequencing technology. The de novo assembly has 78 scaffolds, a total length of ~2.21 Gb, a scaffold N50 length of ~352 Mb, and BUSCO score of 97.4%. This is the second species of Phrynosoma for which a reference genome has been assembled and represents a considerable improvement in terms of contiguity and completeness. Combined with the landscape genomics data being compiled by the CCGP, this assembly will help strategize efforts to maintain and/or restore local genetic diversity, where interventions like genetic rescue, translocation, and strategic land preservation may be the only means by which P. blainvillii and other low vagility species can survive in the fragmented habitats of California.
Keystone species are known to play a critical role in kelp forest health, including the well-known killer whales,sea otter,sea urchin,kelp trophic cascade in the Aleutian Islands, Alaska, USA. In California, a major player in the regulation of sea urchin abundance, and in turn, the health of kelp forests ecosystems, is a large wrasse, the California Sheephead, Semicossyphus pulcher. We present a reference genome for this ecologically important species that will serve as a key resource for future conservation research of California’s inshore marine environment utilizing genomic tools to address changes in life history traits, dispersal, range shifts, and ecological interactions among members of the kelp forest ecological assemblages. Our genome assembly of S. pulcher has a total length of 0.794 Gb, which is similar to many other marine fishes. The assembly is largely contiguous (N50 = 31.9Mb) and nearly complete (BUSCO single-copy core gene content = 98.1%). Within the context of the California Conservation Genomics Project (CCGP), the genome of Semicossyphus pulcher will be used as an important reference resource for ongoing whole genome resequencing efforts of the species.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.