Abstract:Background
Microsporidia are diverse spore forming, fungal-related obligate intracellular pathogens infecting a wide range of hosts. This diversity is reflected at the genome level with sizes varying by an order of magnitude, ranging from less than 3 Mb in Encephalitozoon species (the smallest known in eukaryotes) to more than 50 Mb in Edhazardia spp. As a paradigm of genome reduction in eukaryotes, the small Encephalitozoon genomes have attracted much attention with investigations revealing ge… Show more
“…2021 ). The structural completeness of our assembly was assessed using scripts from Mascarenhas dos Santos et al . (2023) , which allow for the identification of telomeres.…”
Section: Resultsmentioning
confidence: 99%
“…The highest quality genomes within the microsporidia are from the genus Encephalitozoon , which have an order of magnitude smaller genomes than H . tvaerminnensis and telomeric repeats of (TTAGG) n on both ends of all eleven chromosomes ( Mascarenhas dos Santos et al . 2023 ).…”
Section: Resultsmentioning
confidence: 99%
“…2018 ) and manual curation. The absence of contaminants in this final assembly was ascertained by BLAST homology searches against the NCBI database, and its completeness was assessed with check_for_telomeres.pl v.0.3 ( Mascarenhas dos Santos et al . 2023 ).…”
Section: Methodsmentioning
confidence: 99%
“…2022 ). Following recent successful efforts to obtain telomere-to-telomere assemblies in Encephalitozoon ( Mascarenhas dos Santos et al . 2023 ), a genus of microsporidia with small genomes, we focus here on a large-genome species, Hamiltosporidium tvaerminnensis .…”
Microsporidia are intracellular parasitic fungi whose genomes rank among the smallest of all known eukaryotes. A number of outstanding questions remain concerning the evolution of their large-scale variation in genome architecture, responsible for genome size variation of more than an order of magnitude. This genome report presents a first near-chromosomal assembly of a large-genome microsporidium, Hamiltosporidium tvaerminnensis. Combined Oxford Nanopore, Pacific Biosciences, and Illumina sequencing led to a genome assembly of 17 contigs, 11 of which represent complete chromosomes. Our assembly is 21.64 Mb in length, has an N50 of 1.44 Mb, and consists of 39.56% interspersed repeats. We introduce a novel approach in microsporidia, PacBio Iso-Seq, as part of a larger annotation pipeline for obtaining high-quality annotations of 3,573 protein-coding genes. Based on direct evidence from the full-length Iso-Seq transcripts, we present evidence for alternative polyadenylation and variation in splicing efficiency, which are potential regulation mechanisms for gene expression in microsporidia. The generated high-quality genome assembly is a necessary resource for comparative genomics that will help elucidate the evolution of genome architecture in response to intracellular parasitism.
“…2021 ). The structural completeness of our assembly was assessed using scripts from Mascarenhas dos Santos et al . (2023) , which allow for the identification of telomeres.…”
Section: Resultsmentioning
confidence: 99%
“…The highest quality genomes within the microsporidia are from the genus Encephalitozoon , which have an order of magnitude smaller genomes than H . tvaerminnensis and telomeric repeats of (TTAGG) n on both ends of all eleven chromosomes ( Mascarenhas dos Santos et al . 2023 ).…”
Section: Resultsmentioning
confidence: 99%
“…2018 ) and manual curation. The absence of contaminants in this final assembly was ascertained by BLAST homology searches against the NCBI database, and its completeness was assessed with check_for_telomeres.pl v.0.3 ( Mascarenhas dos Santos et al . 2023 ).…”
Section: Methodsmentioning
confidence: 99%
“…2022 ). Following recent successful efforts to obtain telomere-to-telomere assemblies in Encephalitozoon ( Mascarenhas dos Santos et al . 2023 ), a genus of microsporidia with small genomes, we focus here on a large-genome species, Hamiltosporidium tvaerminnensis .…”
Microsporidia are intracellular parasitic fungi whose genomes rank among the smallest of all known eukaryotes. A number of outstanding questions remain concerning the evolution of their large-scale variation in genome architecture, responsible for genome size variation of more than an order of magnitude. This genome report presents a first near-chromosomal assembly of a large-genome microsporidium, Hamiltosporidium tvaerminnensis. Combined Oxford Nanopore, Pacific Biosciences, and Illumina sequencing led to a genome assembly of 17 contigs, 11 of which represent complete chromosomes. Our assembly is 21.64 Mb in length, has an N50 of 1.44 Mb, and consists of 39.56% interspersed repeats. We introduce a novel approach in microsporidia, PacBio Iso-Seq, as part of a larger annotation pipeline for obtaining high-quality annotations of 3,573 protein-coding genes. Based on direct evidence from the full-length Iso-Seq transcripts, we present evidence for alternative polyadenylation and variation in splicing efficiency, which are potential regulation mechanisms for gene expression in microsporidia. The generated high-quality genome assembly is a necessary resource for comparative genomics that will help elucidate the evolution of genome architecture in response to intracellular parasitism.
“…Microsporidia from the genus Encephalitozoon are human-infecting pathogens causing chronic diarrhea, bronchitis, conjunctivitis and/or encephalitis in afflicted patients [9]. With tiny yet complete nuclear genomes totaling less than 3 Mbp, these obligate intracellular pathogens constitute paradigms of genome streamlining in eukaryotes [10]. Their closest known relatives from the genus Ordospora , whose members infect crustaceans but not humans, also arbor diminutive genomes [11].…”
Investigating collinearity between chromosomes is often used in comparative genomics to help identify gene orthologs, pinpoint genes that might have been overlooked as part of annotation processes and/or perform various evolutionary inferences. Collinear segments, also known as syntenic blocks, can be inferred from sequence alignments and/or from the identification of genes arrayed in the same order and relative orientations between investigated genomes. To help perform these analyses and assess their outcomes, we built a simple pipeline called SYNY (for synteny) that implements the two distinct approaches and produces different visualizations. The SYNY pipeline was built with ease of use in mind and runs on modest hardware. The pipeline is written in Perl and Python and is available on GitHub (https://github.com/PombertLab/SYNY) under the permissive MIT license.
Genomics is a powerful toolkit for unravelling how evolutionary processes drive organisms’ small- and large-scale genetic variation. Several outstanding questions remain concerning the evolution of genome size and architecture, especially in intracellular parasites. Microsporidia became a model for this field of study as they exhibit genome size variation of more than an order of magnitude. Here, we discuss evolution in the large-genome microsporidium Hamiltosporidium tvaerminnensis, a parasite of a water flea.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.