The koala, the only extant species of the marsupial family Phascolarctidae, is classified as 'vulnerable' due to habitat loss and widespread disease. We sequenced the koala genome, producing a complete and contiguous marsupial reference genome, including centromeres. We reveal that the koala's ability to detoxify eucalypt foliage may be due to expansions within a cytochrome P450 gene family, and its ability to smell, taste and moderate ingestion of plant secondary metabolites may be due to expansions in the vomeronasal and taste receptors. We characterized novel lactation proteins that protect young in the pouch and annotated immune genes important for response to chlamydial disease. Historical demography showed a substantial population crash coincident with the decline of Australian megafauna, while contemporary populations had biogeographic boundaries and increased inbreeding in populations affected by historic translocations. We identified genetically diverse populations that require habitat corridors and instituting of translocation programs to aid the koala's survival in the wild.
The virulence of chlamydial infection in wild koalas is highly variable between individuals. Some koalas can be infected (PCR positive) with Chlamydia for long periods but remain asymptomatic, whereas others develop clinical disease. Chlamydia in the koala has traditionally been studied without regard to coinfection with other pathogens, although koalas are usually subject to infection with koala retrovirus (KoRV). Retroviruses can be immunosuppressive, and there is evidence of an immunosuppressive effect of KoRV in vitro. Originally thought to be a single endogenous strain, a new, potentially more virulent exogenous variant (KoRV-B) was recently reported. We hypothesized that KoRV-B might significantly alter chlamydial disease outcomes in koalas, presumably via immunosuppression. By studying sub-groups of Chlamydia and KoRV infected koalas in the wild, we found that neither total KoRV load (either viraemia or proviral copies per genome), nor chlamydial infection level or strain type, was significantly associated with chlamydial disease risk. However, PCR positivity with KoRV-B was significantly associated with chlamydial disease in koalas (p = 0.02961). This represents an example of a recently evolved virus variant that may be predisposing its host (the koala) to overt clinical disease when co-infected with an otherwise asymptomatic bacterial pathogen (Chlamydia).
A fundamental challenge in resolving evolutionary relationships across the tree of life is to account for heterogeneity in the evolutionary signal across loci. Studies of marsupial mammals have demonstrated that this heterogeneity can be substantial, leaving considerable uncertainty in the evolutionary timescale and relationships within the group. Using simulations and a new phylogenomic data set comprising nucleotide sequences of 1550 loci from 18 of the 22 extant marsupial families, we demonstrate the power of a method for identifying clusters of loci that support different phylogenetic trees. We find two distinct clusters of loci, each providing an estimate of the species tree that matches previously proposed resolutions of the marsupial phylogeny. We also identify a well-supported placement for the enigmatic marsupial moles (Notoryctes) that contradicts previous molecular estimates but is consistent with morphological evidence. The pattern of gene-tree variation across tree-space is characterized by changes in information content, GC content, substitution-model adequacy, and signatures of purifying selection in the data. In a simulation study, we show that incomplete lineage sorting can explain the division of loci into the two tree-topology clusters, as found in our phylogenomic analysis of marsupials. We also demonstrate the potential benefits of minimizing uncertainty from phylogenetic conflict for molecular dating. Our analyses reveal that Australasian marsupials appeared in the early Paleocene, whereas the diversification of present-day families occurred primarily during the late Eocene and early Oligocene. Our methods provide an intuitive framework for improving the accuracy and precision of phylogenetic inference and molecular dating using genome-scale data.
The koala retrovirus (KoRV) is implicated in several diseases affecting the koala (Phascolarctos cinereus). KoRV provirus can be present in the genome of koalas as an endogenous retrovirus (present in all cells via germline integration) or as exogenous retrovirus responsible for somatic integrations of proviral KoRV (present in a limited number of cells). This ongoing invasion of the koala germline by KoRV provides a powerful opportunity to assess the viral strategies used by KoRV in an individual. Analysis of a high-quality genome sequence of a single koala revealed 133 KoRV integration sites. Most integrations contain full-length, endogenous provirus; KoRV-A subtype. The second most frequent integrations contain an endogenous recombinant element (recKoRV) in which most of the KoRV protein-coding region has been replaced with an ancient, endogenous retroelement. A third set of integrations, with very low sequence coverage, may represent somatic cell integrations of KoRV-A, KoRV-B and two recently designated additional subgroups, KoRV-D and KoRV-E. KoRV-D and KoRV-E are missing several genes required for viral processing, suggesting they have been transmitted as defective viruses. Our results represent the first comprehensive analyses of KoRV integration and variation in a single animal and provide further insights into the process of retroviral-host species interactions.
SignificanceEndogenous retroviruses (ERVs) are proviral sequences that result from host germ-line invasion by exogenous retroviruses. The majority of ERVs are degraded. Using the koala retrovirus (KoRV) as a model system, we demonstrate that recombination with an ancient koala retroelement disables KoRV, and that recombination occurs frequently and early in the invasion process. Recombinant KoRVs (recKoRVs) are then able to proliferate in the koala germ line. This may in part explain the generally degraded nature of ERVs in vertebrate genomes and suggests that degradation via recombination is one of the earliest processes shaping retroviral genomic invasions.
BackgroundThe koala, Phascolarctos cinereus, is a biologically unique and evolutionarily distinct Australian arboreal marsupial. The goal of this study was to sequence the transcriptome from several tissues of two geographically separate koalas, and to create the first comprehensive catalog of annotated transcripts for this species, enabling detailed analysis of the unique attributes of this threatened native marsupial, including infection by the koala retrovirus.ResultsRNA-Seq data was generated from a range of tissues from one male and one female koala and assembled de novo into transcripts using Velvet-Oases. Transcript abundance in each tissue was estimated. Transcripts were searched for likely protein-coding regions and a non-redundant set of 117,563 putative protein sequences was produced. In similarity searches there were 84,907 (72%) sequences that aligned to at least one sequence in the NCBI nr protein database. The best alignments were to sequences from other marsupials. After applying a reciprocal best hit requirement of koala sequences to those from tammar wallaby, Tasmanian devil and the gray short-tailed opossum, we estimate that our transcriptome dataset represents approximately 15,000 koala genes. The marsupial alignment information was used to look for potential gene duplications and we report evidence for copy number expansion of the alpha amylase gene, and of an aldehyde reductase gene.Koala retrovirus (KoRV) transcripts were detected in the transcriptomes. These were analysed in detail and the structure of the spliced envelope gene transcript was determined. There was appreciable sequence diversity within KoRV, with 233 sites in the KoRV genome showing small insertions/deletions or single nucleotide polymorphisms. Both koalas had sequences from the KoRV-A subtype, but the male koala transcriptome has, in addition, sequences more closely related to the KoRV-B subtype. This is the first report of a KoRV-B-like sequence in a wild population.ConclusionsThis transcriptomic dataset is a useful resource for molecular genetic studies of the koala, for evolutionary genetic studies of marsupials, for validation and annotation of the koala genome sequence, and for investigation of koala retrovirus. Annotated transcripts can be browsed and queried at http://koalagenome.org.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2164-15-786) contains supplementary material, which is available to authorized users.
Natural history museums harbour a plethora of biological specimens which are of potential use in population and conservation genetic studies. Although technical advancements in museum genomics have enabled genome‐wide markers to be generated from aged museum specimens, the suitability of these data for robust biological inference is not well characterized. The aim of this study was to test the utility of museum specimens in population and conservation genomics by assessing the biological and technical validity of single nucleotide polymorphism (SNP) data derived from such samples. To achieve this, we generated thousands of SNPs from 47 red‐tailed black cockatoo (Calyptorhychus banksii) traditional museum samples (i.e. samples that were not collected with the primary intent of DNA analysis) and 113 fresh tissue samples (cryopreserved liver/muscle) using a restriction site‐associated DNA marker approach (DArTseq™). Thousands of SNPs were successfully generated from most of the traditional museum samples (with a mean age of 44 years, ranging from 5 to 123 years), although 38% did not provide useful data. These SNPs exhibited higher error rates and contained significantly more missing data compared with SNPs from fresh tissue samples, likely due to considerable DNA fragmentation. However, based on simulation results, the level of genotyping error had a negligible effect on inference of population structure in this species. We did identify a bias towards low diversity SNPs in older samples that appears to compromise temporal inferences of genetic diversity. This study demonstrates the utility of a RADseq‐based method to produce reliable genome‐wide SNP data from traditional museum specimens.
The Australian continent exhibits complex biogeographic patterns but studies of the impacts of Pleistocene climatic oscillation on the mesic environments of the Southern Hemisphere are limited. The koala (Phascolarctos cinereus), one of Australia’s most iconic species, was historically widely distributed throughout much of eastern Australia but currently represents a complex conservation challenge. To better understand the challenges to koala genetic health, we assessed the phylogeographic history of the koala. Variation in the maternally inherited mitochondrial DNA (mtDNA) Control Region (CR) was examined in 662 koalas sampled throughout their distribution. In addition, koala CR haplotypes accessioned to Genbank were evaluated and consolidated. A total of 53 unique CR haplotypes have been isolated from koalas to date (including 15 haplotypes novel to this study). The relationships among koala CR haplotypes were indicative of a single Evolutionary Significant Unit and do not support the recognition of subspecies, but were separated into four weakly differentiated lineages which correspond to three geographic clusters: a central lineage, a southern lineage and two northern lineages co-occurring north of Brisbane. The three geographic clusters were separated by known Pleistocene biogeographic barriers: the Brisbane River Valley and Clarence River Valley, although there was evidence of mixing amongst clusters. While there is evidence for historical connectivity, current koala populations exhibit greater structure, suggesting habitat fragmentation may have restricted female-mediated gene flow. Since mtDNA data informs conservation planning, we provide a summary of existing CR haplotypes, standardise nomenclature and make recommendations for future studies to harmonise existing datasets. This holistic approach is critical to ensuring management is effective and small scale local population studies can be integrated into a wider species context.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.