Alternative splicing and polyadenylation were observed pervasively in eukaryotic messenger RNAs. These alternative isoforms could either be consequences of physiological regulation or stochastic noise of RNA processing. To quantify the extent of stochastic noise in splicing and polyadenylation, we analyzed the alternative usage of splicing and polyadenylation sites in Entamoeba histolytica using RNA-Seq. First, we identified a large number of rarely spliced alternative junctions and then showed that the occurrence of these alternative splicing events is correlated with splicing site sequence, occurrence of constitutive splicing events and messenger RNA abundance. Our results implied the majority of these alternative splicing events are likely to be stochastic error of splicing machineries, and we estimated the corresponding error rates. Second, we observed extensive microheterogeneity of polyadenylation cleavage sites, and the extent of such microheterogeneity is correlated with the occurrence of constitutive cleavage events, suggesting most of such microheterogeneity is likely to be stochastic. Overall, we only observed a small fraction of alternative splicing and polyadenylation isoforms that are unlikely to be solely stochastic, implying the functional relevance of alternative splicing and polyadenylation in E. histolytica is limited. Lastly, we revised the gene models and annotated their 3′UTR in AmoebaDB, providing valuable resources to the community.
Background Mycobacteria occupy various ecological niches and can be isolated from soil, tap water and ground water. Several cause diseases in humans and animals. To get deeper insight into our understanding of mycobacterial evolution focusing on tRNA and non-coding (nc)RNA, we conducted a comparative genome analysis of Mycobacterium mucogenicum ( Mmuc ) and Mycobacterium neoaurum ( Mneo ) clade members. Results Genome sizes for Mmuc- and Mneo -clade members vary between 5.4 and 6.5 Mbps with the complete Mmuc T (type strain) genome encompassing 6.1 Mbp. The number of tRNA genes range between 46 and 79 (including one pseudo tRNA gene) with 39 tRNA genes common among the members of these clades, while additional tRNA genes were probably acquired through horizontal gene transfer. Selected tRNAs and ncRNAs (RNase P RNA, tmRNA, 4.5S RNA, Ms1 RNA and 6C RNA) are expressed, and the levels for several of these are higher in stationary phase compared to exponentially growing cells. The rare tRNA Ile TAT isoacceptor and two for mycobacteria novel ncRNAs: the Lactobacillales -derived GOLLD RNA and a homolog to the antisense Salmonella typhimurium phage Sar RNA, were shown to be present and expressed in certain Mmuc -clade members. Conclusions Phages, IS elements, horizontally transferred tRNA gene clusters, and phage-derived ncRNAs appears to have influenced the evolution of the Mmuc - and Mneo -clades. While the number of predicted coding sequences correlates with genome size, the number of tRNA coding genes does not. The majority of the tRNA genes in mycobacteria are transcribed mainly from single genes and the levels of certain ncRNAs, including RNase P RNA (essential for the processing of tRNAs), are higher at stationary phase compared to exponentially growing cells. We provide supporting evidence that Ms1 RNA represents a mycobacterial 6S RNA variant. The evolutionary routes for the ncRNAs RNase P RNA, tmRNA and Ms1 RNA are different from that of the core genes. Electronic supplementary material The online version of this article (10.1186/s12862-019-1447-7) contains supplementary material, which is available to authorized users.
Mycobacterium marinum is the causative agent for the tuberculosis-like disease mycobacteriosis in fish and skin lesions in humans. Ubiquitous in its geographical distribution, M. marinum is known to occupy diverse fish as hosts. However, information about its genomic diversity is limited. Here, we provide the genome sequences for 15 M. marinum strains isolated from infected humans and fish. Comparative genomic analysis of these and four available genomes of the M. marinum strains M, E11, MB2 and Europe reveal high genomic diversity among the strains, leading to the conclusion that M. marinum should be divided into two different clusters, the “M”- and the “Aronson”-type. We suggest that these two clusters should be considered to represent two M. marinum subspecies. Our data also show that the M. marinum pan-genome for both groups is open and expanding and we provide data showing high number of mutational hotspots in M. marinum relative to other mycobacteria such as Mycobacterium tuberculosis. This high genomic diversity might be related to the ability of M. marinum to occupy different ecological niches.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.