The rodent Peromyscus leucopus is the natural reservoir of several tick-borne infections, including Lyme disease. To expand the knowledge base for this key species in life cycles of several pathogens, we assembled and scaffolded the P. leucopus genome. The resulting assembly was 2.45 Gb in total length, with 24 chromosome-length scaffolds harboring 97% of predicted genes. RNA sequencing following infection of P. leucopus with Borreliella burgdorferi, a Lyme disease agent, shows that, unlike blood, the skin is actively responding to the infection after several weeks. P. leucopus has a high level of segregating nucleotide variation, suggesting that natural resistance alleles to Crispr gene targeting constructs are likely segregating in wild populations. The reference genome will allow for experiments aimed at elucidating the mechanisms by which this widely distributed rodent serves as natural reservoir for several infectious diseases of public health importance, potentially enabling intervention strategies.
Background The spider Trichonephila antipodiana (Araneidae), commonly known as the batik golden web spider, preys on arthropods with body sizes ranging from ∼2 mm in length to insects larger than itself (>20‒50 mm), indicating its polyphagy and strong dietary detoxification abilities. Although it has been reported that an ancient whole-genome duplication event occurred in spiders, lack of a high-quality genome has limited characterization of this event. Results We present a chromosome-level T. antipodiana genome constructed on the basis of PacBio and Hi-C sequencing. The assembled genome is 2.29 Gb in size with a scaffold N50 of 172.89 Mb. Hi-C scaffolding assigned 98.5% of the bases to 13 pseudo-chromosomes, and BUSCO completeness analysis revealed that the assembly included 94.8% of the complete arthropod universal single-copy orthologs (n = 1,066). Repetitive elements account for 59.21% of the genome. We predicted 19,001 protein-coding genes, of which 96.78% were supported by transcriptome-based evidence and 96.32% matched protein records in the UniProt database. The genome also shows substantial expansions in several detoxification-associated gene families, including cytochrome P450 mono-oxygenases, carboxyl/cholinesterases, glutathione-S-transferases, and ATP-binding cassette transporters, reflecting the possible genomic basis of polyphagy. Further analysis of the T. antipodiana genome architecture reveals an ancient whole-genome duplication event, based on 2 lines of evidence: (i) large-scale duplications from inter-chromosome synteny analysis and (ii) duplicated clusters of Hox genes. Conclusions The high-quality T. antipodiana genome represents a valuable resource for spider research and provides insights into this species’ adaptation to the environment.
Background Despite marked recent improvements in long-read sequencing technology, the assembly of diploid genomes remains a difficult task. A major obstacle is distinguishing between alternative contigs that represent highly heterozygous regions. If primary and secondary contigs are not properly identified, the primary assembly will overrepresent both the size and complexity of the genome, which complicates downstream analysis such as scaffolding. Results Here we illustrate a new method, which we call HapSolo, that identifies secondary contigs and defines a primary assembly based on multiple pairwise contig alignment metrics. HapSolo evaluates candidate primary assemblies using BUSCO scores and then distinguishes among candidate assemblies using a cost function. The cost function can be defined by the user but by default considers the number of missing, duplicated and single BUSCO genes within the assembly. HapSolo performs hill climbing to minimize cost over thousands of candidate assemblies. We illustrate the performance of HapSolo on genome data from three species: the Chardonnay grape (Vitis vinifera), with a genome of 490 Mb, a mosquito (Anopheles funestus; 200 Mb) and the Thorny Skate (Amblyraja radiata; 2650 Mb). Conclusions HapSolo rapidly identified candidate assemblies that yield improvements in assembly metrics, including decreased genome size and improved N50 scores. Contig N50 scores improved by 35%, 9% and 9% for Chardonnay, mosquito and the thorny skate, respectively, relative to unreduced primary assemblies. The benefits of HapSolo were amplified by down-stream analyses, which we illustrated by scaffolding with Hi-C data. We found, for example, that prior to the application of HapSolo, only 52% of the Chardonnay genome was captured in the largest 19 scaffolds, corresponding to the number of chromosomes. After the application of HapSolo, this value increased to ~ 84%. The improvements for the mosquito’s largest three scaffolds, representing the number of chromosomes, were from 61 to 86%, and the improvement was even more pronounced for thorny skate. We compared the scaffolding results to assemblies that were based on PurgeDups for identifying secondary contigs, with generally superior results for HapSolo.
Pummelo (Citrus maxima or Citrus grandis) is a basic species and an important type for breeding in Citrus. Pummelo is used not only for fresh consumption but also for medicinal purposes. However, the molecular basis of medicinal traits is unclear. Here, compared with wild citrus species/Citrus-related genera, the content of 43 bioactive metabolites and their derivatives increased in the pummelo. Furthermore, we assembled the genome sequence of a variety for medicinal purposes with a long history, Citrus maxima 'Huazhouyou-tomentosa' (HZY-T), at the chromosome level with a genome size of 349.07 Mb. Comparative genomics showed that the expanded gene family in the pummelo genome was enriched in flavonoids-, terpenoid-, and phenylpropanoid biosynthesis. Using the metabolome and transcriptome of six developmental stages of HZY-T and Citrus maxima 'Huazhouyou-smooth' (HZY-S) fruit peel, we generated the regulatory networks of bioactive metabolites and their derivatives. We identified a novel MYB transcription factor, CmtMYB108, as an important regulator of flavone pathways. Both mutations and expression of CmtMYB108, which targets the genes PAL (phenylalanine ammonia-lyase) and FNS (flavone synthase), displayed differential expression between Citrus-related genera, wild citrus species and pummelo species. This study provides insights into the evolution-associated changes in bioactive metabolism during the origin process of pummelo.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.