Alternative splicing and polyadenylation were observed pervasively in eukaryotic messenger RNAs. These alternative isoforms could either be consequences of physiological regulation or stochastic noise of RNA processing. To quantify the extent of stochastic noise in splicing and polyadenylation, we analyzed the alternative usage of splicing and polyadenylation sites in Entamoeba histolytica using RNA-Seq. First, we identified a large number of rarely spliced alternative junctions and then showed that the occurrence of these alternative splicing events is correlated with splicing site sequence, occurrence of constitutive splicing events and messenger RNA abundance. Our results implied the majority of these alternative splicing events are likely to be stochastic error of splicing machineries, and we estimated the corresponding error rates. Second, we observed extensive microheterogeneity of polyadenylation cleavage sites, and the extent of such microheterogeneity is correlated with the occurrence of constitutive cleavage events, suggesting most of such microheterogeneity is likely to be stochastic. Overall, we only observed a small fraction of alternative splicing and polyadenylation isoforms that are unlikely to be solely stochastic, implying the functional relevance of alternative splicing and polyadenylation in E. histolytica is limited. Lastly, we revised the gene models and annotated their 3′UTR in AmoebaDB, providing valuable resources to the community.
Background Mycobacteria occupy various ecological niches and can be isolated from soil, tap water and ground water. Several cause diseases in humans and animals. To get deeper insight into our understanding of mycobacterial evolution focusing on tRNA and non-coding (nc)RNA, we conducted a comparative genome analysis of Mycobacterium mucogenicum ( Mmuc ) and Mycobacterium neoaurum ( Mneo ) clade members. Results Genome sizes for Mmuc- and Mneo -clade members vary between 5.4 and 6.5 Mbps with the complete Mmuc T (type strain) genome encompassing 6.1 Mbp. The number of tRNA genes range between 46 and 79 (including one pseudo tRNA gene) with 39 tRNA genes common among the members of these clades, while additional tRNA genes were probably acquired through horizontal gene transfer. Selected tRNAs and ncRNAs (RNase P RNA, tmRNA, 4.5S RNA, Ms1 RNA and 6C RNA) are expressed, and the levels for several of these are higher in stationary phase compared to exponentially growing cells. The rare tRNA Ile TAT isoacceptor and two for mycobacteria novel ncRNAs: the Lactobacillales -derived GOLLD RNA and a homolog to the antisense Salmonella typhimurium phage Sar RNA, were shown to be present and expressed in certain Mmuc -clade members. Conclusions Phages, IS elements, horizontally transferred tRNA gene clusters, and phage-derived ncRNAs appears to have influenced the evolution of the Mmuc - and Mneo -clades. While the number of predicted coding sequences correlates with genome size, the number of tRNA coding genes does not. The majority of the tRNA genes in mycobacteria are transcribed mainly from single genes and the levels of certain ncRNAs, including RNase P RNA (essential for the processing of tRNAs), are higher at stationary phase compared to exponentially growing cells. We provide supporting evidence that Ms1 RNA represents a mycobacterial 6S RNA variant. The evolutionary routes for the ncRNAs RNase P RNA, tmRNA and Ms1 RNA are different from that of the core genes. Electronic supplementary material The online version of this article (10.1186/s12862-019-1447-7) contains supplementary material, which is available to authorized users.
Mycobacterium marinum is the causative agent for the tuberculosis-like disease mycobacteriosis in fish and skin lesions in humans. Ubiquitous in its geographical distribution, M. marinum is known to occupy diverse fish as hosts. However, information about its genomic diversity is limited. Here, we provide the genome sequences for 15 M. marinum strains isolated from infected humans and fish. Comparative genomic analysis of these and four available genomes of the M. marinum strains M, E11, MB2 and Europe reveal high genomic diversity among the strains, leading to the conclusion that M. marinum should be divided into two different clusters, the “M”- and the “Aronson”-type. We suggest that these two clusters should be considered to represent two M. marinum subspecies. Our data also show that the M. marinum pan-genome for both groups is open and expanding and we provide data showing high number of mutational hotspots in M. marinum relative to other mycobacteria such as Mycobacterium tuberculosis. This high genomic diversity might be related to the ability of M. marinum to occupy different ecological niches.
We have used RNASeq and qRT-PCR to study mRNA levels for all σ-factors in different Mycobacterium marinum strains under various growth and stress conditions. We also studied their levels in M. marinum from infected fish and mosquito larvae. The annotated σ-factors were expressed and transcripts varied in relation to growth and stress conditions. Some were highly abundant such as sigA, sigB, sigC, sigD, sigE and sigH while others were not. The σ-factor mRNA profiles were similar after heat stress, during infection of fish and mosquito larvae. The similarity also applies to some of the known heat shock genes such as the α-crystallin gene. Therefore, it seems probable that the physiological state of M. marinum is similar when exposed to these different conditions. Moreover, the mosquito larvae data suggest that this is the state that the fish encounter when infected, at least with respect to σ-factor mRNA levels. Comparative genomic analysis of σ-factor gene localizations in three M. marinum strains and Mycobacterium tuberculosis H37Rv revealed chromosomal rearrangements that changed the localization of especially sigA, sigB, sigD, sigE, sigF and sigJ after the divergence of these two species. This may explain the variation in species-specific expression upon exposure to different growth conditions.
We provide the genome sequences of the type strains of the polychlorophenol-degrading Mycobacterium chlorophenolicum (DSM43826), the degrader of chlorinated aliphatics Mycobacterium chubuense (DSM44219) and Mycobacterium obuense (DSM44075) that has been tested for use in cancer immunotherapy. The genome sizes of M. chlorophenolicum, M. chubuense, and M. obuense are 6.93, 5.95, and 5.58 Mb with GC-contents of 68.4%, 69.2%, and 67.9%, respectively. Comparative genomic analysis revealed that 3,254 genes are common and we predicted approximately 250 genes acquired through horizontal gene transfer from different sources including proteobacteria. The data also showed that the biodegrading Mycobacterium spp. NBB4, also referred to as M. chubuense NBB4, is distantly related to the M. chubuense type strain and should be considered as a separate species, we suggest it to be named Mycobacterium ethylenense NBB4. Among different categories we identified genes with potential roles in: biodegradation of aromatic compounds and copper homeostasis. These are the first nonpathogenic Mycobacterium spp. found harboring genes involved in copper homeostasis. These findings would therefore provide insight into the role of this group of Mycobacterium spp. in bioremediation as well as the evolution of copper homeostasis within the Mycobacterium genus.
Heart failure affects 2–3% of adult Western population. Prevalence of heart failure with preserved left ventricular (LV) ejection fraction (HFpEF) increases. Studies suggest HFpEF patients to have altered myocardial structure and functional changes such as incomplete relaxation and increased cardiac stiffness. We hypothesised that patients undergoing elective coronary bypass surgery (CABG) with HFpEF characteristics would show distinctive gene expression compared to patients with normal LV physiology. Myocardial biopsies for mRNA expression analysis were obtained from sixteen patients with LV ejection fraction ≥45%. Five out of 16 patients (31%) had echocardiographic characteristics and increased NTproBNP levels indicative of HFpEF and this group was used as HFpEF proxy, while 11 patients had Normal LV physiology. Utilising principal component analysis, the gene expression data clustered into two groups, corresponding to HFpEF proxy and Normal physiology, and 743 differentially expressed genes were identified. The associated top biological functions were cardiac muscle contraction, oxidative phosphorylation, cellular remodelling and matrix organisation. Our results also indicate that upstream regulatory events, including inhibition of transcription factors STAT4 , SRF and TP53 , and activation of transcription repressors HEY2 and KDM5A , could provide explanatory mechanisms to observed gene expression differences and ultimately cardiac dysfunction in the HFpEF proxy group.
Members of the Mycobacterium chelonae-abscessus complex (MCAC) are close to the mycobacterial ancestor and includes both human, animal and fish pathogens. We present the genomes of 14 members of this complex: the complete genomes of Mycobacterium salmoniphilum and Mycobacterium chelonae type strains, seven M. salmoniphilum isolates, and five M. salmoniphilum-like strains including strains isolated during an outbreak in an animal facility at Uppsala University. Average nucleotide identity (ANI) analysis and core gene phylogeny revealed that the M. salmoniphilum-like strains are variants of the human pathogen Mycobacterium franklinii and phylogenetically close to Mycobacterium abscessus. Our data further suggested that M. salmoniphilum separates into three branches named group I, II and III with the M. salmoniphilum type strain belonging to group II. Among predicted virulence factors, the presence of phospholipase C (plcC), which is a major virulence factor that makes M. abscessus highly cytotoxic to mouse macrophages, and that M. franklinii originally was isolated from infected humans make it plausible that the outbreak in the animal facility was caused by a M. salmoniphilum-like strain. Interestingly, M. salmoniphilum-like was isolated from tap water suggesting that it can be present in the environment. Moreover, we predicted the presence of mutational hotspots in the M. salmoniphilum isolates and 26% of these hotspots overlap with genes categorized as having roles in virulence, disease and defense. We also provide data about key genes involved in transcription and translation such as sigma factor, ribosomal protein and tRNA genes.
The organization of genomic sequences is dynamic and undergoes change during the process of evolution. Many of the variations arise spontaneously and the observed genomic changes can either be distributed uniformly throughout the genome or be preferentially localized to some regions (hot spots) compared to others. Conversely cold spots may tend to accumulate very few variations or none at all. In order to identify such regions statistically, we have developed a method based on Shewhart Control Chart. The method was used for identification of hot and cold spots of single-nucleotide variations (SNVs) in Mycobacterium tuberculosis genomes. The predictions have been validated by sequencing some of these regions derived from clinical isolates. This method can be used for analysis of other genome sequences particularly infectious microbes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.