The giant sequoia ( Sequoiadendron giganteum ) of California are massive, long-lived trees that grow along the U.S. Sierra Nevada mountains. As they grow primarily in isolated groves within a narrow range, conservation of existing trees has been a national goal for over 150 years.Genomic data are limited in giant sequoia, and the assembly and annotation of the first giant sequoia genome has been an important goal to allow marker development for restoration and management. Using Illumina and Oxford Nanopore sequencing combined with Dovetail chromosome conformation capture libraries, 8.125 Gbp of sequence was assembled into eleven chromosome-scale scaffolds. This giant sequoia assembly represents the first genome sequenced in the Cupressaceae family, and lays a foundation for using genomic tools to aid in giant sequoia conservation and management. Beyond conservation and management applications, the giant sequoia assembly is a resource for answering questions about the life history of this enigmatic and robust species. Here we provide an example by taking an inventory of the large and complex family of NLR type disease resistance genes.assembly and annotation presented here is an unprecedented resource in conifer genomics, both for the quality of the assembly and because it represents an understudied branch of the gymnosperm tree of life.
MATERIALS AND METHODS
Sequencing and assembly
Megagametophyte DNA extraction and sequencingCones were collected from a 1,360-year-old giant sequoia (SEGI21, Sillett et al., 2015) in Sequoia/Kings Canyon National Park in 2012. As in previous conifer genome sequencing projects (e.g. Zimin et al., 2014), the megagametophyte from a single fertilized seed was dissected out and its haploid DNA extracted with a Qiagen DNeasy Plant Kit (Hilden, Germany), followed by library preparation with an Illumina TruSeq Nano kit using the low throughput protocol. This megagametophyte library was then sequenced on 10 lanes of an Illumina HiSeq 4000 with 150 bp paired-end reads at the UC Davis Genome Center DNA Technologies Core facility.
Foliage DNA extraction and Nanopore sequencingIn 2017 foliage was collected from the upper canopy of the same giant sequoia tree (SEGI21).From this foliage, high molecular weight DNA was extracted following the protocol described here (dx.doi.org/10.17504/protocols.io.4vbgw2n) . Briefly, purified genomic DNA was isolated through a nuclei extraction and lysis protocol. First, mature leaf tissue was homogenized in liquid nitrogen until well-ground, then added to a gentle lysis buffer (after Zhang et al.,