Motivation In silico identification of linear B-cell epitopes represents an important step in the development of diagnostic tests and vaccine candidates, by providing potential high-probability targets for experimental investigation. Current predictive tools were developed under a generalist approach, training models with heterogeneous data sets to develop predictors that can be deployed for a wide variety of pathogens. However, continuous advances in processing power and the increasing amount of epitope data for a broad range of pathogens indicate that training organism or taxon-specific models may become a feasible alternative, with unexplored potential gains in predictive performance. Results This paper shows how organism-specific training of epitope prediction models can yield substantial performance gains across several quality metrics when compared to models trained with heterogeneous and hybrid data, and with a variety of widely-used predictors from the literature. These results suggest a promising alternative for the development of custom-tailored predictive models with high predictive power, which can be easily implemented and deployed for the investigation of specific pathogens. Availability The data underlying this article, as well as the full reproducibility scripts, are available at https://github.com/fcampelo/OrgSpec-paper. The R package that implements the organism-specific pipeline functions is available at https://github.com/fcampelo/epitopes. Supplementary information Supplementary materials are available at Bioinformatics online.
Trypanosoma cruzi, a zoonotic kinetoplastid protozoan parasite, is the causative agent of American trypanosomiasis (Chagas disease). Having a very plastic, repetitive and complex genome, the parasite displays a highly diverse repertoire of surface molecules, with pivotal roles in cell invasion, immune evasion and pathogenesis. Before 2016, the complexity of the genomic regions containing these genes impaired the assembly of a genome at chromosomal level, making it impossible to study the structure and function of the several thousand repetitive genes encoding the surface molecules of the parasite. We here describe the genome assembly of the Sylvio X10/1 genome sequence, which since 2016 has been used as a reference genome sequence for T. cruzi clade I (TcI), produced using high coverage PacBio single-molecule sequencing. It was used to analyze deep Illumina sequence data from 34 T. cruzi TcI isolates and clones from different geographic locations, sample sources and clinical outcomes. Resolution of the surface molecule gene distribution showed the unusual duality in the organization of the parasite genome, a synteny of the core genomic region with related protozoa flanked by unique and highly plastic multigene family clusters encoding surface antigens. The presence of abundant interspersed retrotransposons in these multigene family clusters suggests that these elements are involved in a recombination mechanism for the generation of antigenic variation and evasion of the host immune response on these TcI strains. The comparative genomic analysis of the cohort of TcI strains revealed multiple cases of such recombination events involving surface molecule genes and has provided new insights into T. cruzi population structure.
Trypanosoma cruzi is the causative agent of Chagas disease, a neglected tropical disease that affects approximately 6 to 8 million people and for which there is no effective treatment or vaccine. The parasite expresses a family of surface proteins, named trans-sialidases, responsible for transferring sialic acid from host glycoconjugates to parasite mucins.
DNA topoisomerases are enzymes that modulate DNA topology. Among them, topoisomerase 3α is engaged in genomic maintenance acting in DNA replication termination, sister chromatid separation, and dissolution of recombination intermediates. To evaluate the role of this enzyme in Trypanosoma cruzi, the etiologic agent of Chagas disease, a topoisomerase 3α knockout parasite (TcTopo3α KO) was generated, and the parasite growth, as well as its response to several DNA damage agents, were evaluated. There was no growth alteration caused by the TcTopo3α knockout in epimastigote forms, but a higher dormancy rate was observed. TcTopo3α KO trypomastigote forms displayed reduced invasion rates in LLC-MK2 cells when compared with the wild-type lineage. Amastigote proliferation was also compromised in the TcTopo3α KO, and a higher number of dormant cells was observed. Additionally, TcTopo3α KO epimastigotes were not able to recover cell growth after gamma radiation exposure, suggesting the involvement of topoisomerase 3α in homologous recombination. These parasites were also sensitive to drugs that generate replication stress, such as cisplatin (Cis), hydroxyurea (HU), and methyl methanesulfonate (MMS). In response to HU and Cis treatments, TcTopo3α KO parasites showed a slower cell growth and was not able to efficiently repair the DNA damage induced by these genotoxic agents. The cell growth phenotype observed after MMS treatment was similar to that observed after gamma radiation, although there were fewer dormant cells after MMS exposure. TcTopo3α KO parasites showed a population with sub-G1 DNA content and strong γH2A signal 48 h after MMS treatment. So, it is possible that DNA-damaged cell proliferation due to the absence of TcTopo3α leads to cell death. Whole genome sequencing of MMS-treated parasites showed a significant reduction in the content of the multigene families DFG-1 and RHS, and also a possible erosion of the sub-telomeric region from chromosome 22, relative to non-treated knockout parasites. Southern blot experiments suggest telomere shortening, which could indicate genomic instability in TcTopo3α KO cells owing to MMS treatment. Thus, topoisomerase 3α is important for homologous recombination repair and replication stress in T. cruzi, even though all the pathways in which this enzyme participates during the replication stress response remains elusive.
Cruzipains are the main papain-like cysteine proteases of Trypanosoma cruzi, the protozoan parasite that causes Chagas disease. Encoded by a multigenic family, previous studies have estimated the presence of dozens of copies spread over multiple chromosomes in different parasite strains. Here, we describe the complete gene repertoire of cruzipain in three parasite strains, their genomic organization, and expression pattern throughout the parasite life cycle. Furthermore, we have analyzed primary sequence variations among distinct family members as well as structural differences between the main groups of cruzipains. Based on phylogenetic inferences and residue positions crucial for enzyme function and specificity, we propose the classification of cruzipains into two families (I and II), whose genes are distributed in two or three separate clusters in the parasite genome, according with the strain. Family I comprises nearly identical copies to the previously characterized cruzipain 1/cruzain, whereas Family II encompasses three structurally distinct sub-types, named cruzipain 2, cruzipain 3, and cruzipain 4. RNA-seq data derived from the CL Brener strain indicates that Family I genes are mainly expressed by epimastigotes, whereas trypomastigotes mainly express Family II genes. Significant differences in the active sites among the enzyme sub-types were also identified, which may play a role in their substrate selectivity and impact their inhibition by small molecules.
Leishmania braziliensis is the main causative agent of Tegumentary Leishmaniasis in the Americas. However, difficulties related to genome manipulation, experimental infection, and parasite growth have so far limited studies with this species. CRISPR-Cas9-based technology has made genome editing more accessible, and here we have successfully employed the LeishGEdit approach to attenuate L. braziliensis. We generated a transgenic cell line expressing Cas9 and T7 RNA polymerase, which was employed for the targeted deletion of centrin, a calcium-binding cytoskeletal protein involved in the centrosome duplication in eukaryotes. Centrin-deficient Leishmania exhibit growth arrest at the amastigote stage. Whole-genome sequencing of centrin-deficient L. braziliensis (LbCen−/−) did not indicate the presence of off-target mutations. In vitro, the growth rates of LbCen−/− and wild-type promastigotes were similar, but axenic and intracellular LbCen−/− amastigotes showed a multinucleated phenotype with impaired survival following macrophage infection. Upon inoculation into BALB/c mice, LbCen−/− were detected at an early time point but failed to induce lesion formation, contrary to control animals, infected with wild-type L. braziliensis. A significantly lower parasite burden was also observed in mice inoculated with LbCen−/−, differently from control mice. Given that centrin-deficient Leishmania sp. have become candidates for vaccine development, we propose that LbCen−/− can be further explored for the purposes of immunoprophylaxis against American Tegumentary Leishmaniasis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.