Rfam is a database of RNA families where each of the 3444 families is represented by a multiple sequence alignment of known RNA sequences and a covariance model that can be used to search for additional members of the family. Recent developments have involved expert collaborations to improve the quality and coverage of Rfam data, focusing on microRNAs, viral and bacterial RNAs. We have completed the first phase of synchronising microRNA families in Rfam and miRBase, creating 356 new Rfam families and updating 40. We established a procedure for comprehensive annotation of viral RNA families starting with Flavivirus and Coronaviridae RNAs. We have also increased the coverage of bacterial and metagenome-based RNA families from the ZWD database. These developments have enabled a significant growth of the database, with the addition of 759 new families in Rfam 14. To facilitate further community contribution to Rfam, expert users are now able to build and submit new families using the newly developed Rfam Cloud family curation system. New Rfam website features include a new sequence similarity search powered by RNAcentral, as well as search and visualisation of families with pseudoknots. Rfam is freely available at https://rfam.org.
Sequence analyses of RNA virus genomes remain challenging owing to the exceptional genetic plasticity of these viruses. Because of high mutation and recombination rates, genome replication by viral RNA-dependent RNA polymerases leads to populations of closely related viruses, so-called "quasispecies." Standard (short-read) sequencing technologies are ill-suited to reconstruct large numbers of full-length haplotypes of (1) RNA virus genomes and (2) subgenome-length (sg) RNAs composed of noncontiguous genome regions. Here, we used a full-length, direct RNA sequencing (DRS) approach based on nanopores to characterize viral RNAs produced in cells infected with a human coronavirus. By using DRS, we were able to map the longest (∼26-kb) contiguous read to the viral reference genome. By combining Illumina and Oxford Nanopore sequencing, we reconstructed a highly accurate consensus sequence of the human coronavirus (HCoV)-229E genome (27.3 kb). Furthermore, by using long reads that did not require an assembly step, we were able to identify, in infected cells, diverse and novel HCoV-229E sg RNAs that remain to be characterized. Also, the DRS approach, which circumvents reverse transcription and amplification of RNA, allowed us to detect methylation sites in viral RNAs. Our work paves the way for haplotype-based analyses of viral quasispecies by showing the feasibility of intra-sample haplotype separation. Even though several technical challenges remain to be addressed to exploit the potential of the nanopore technology fully, our work illustrates that DRS may significantly advance genomic studies of complex virus populations, including predictions on long-range interactions in individual full-length viral RNA haplotypes.
SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) is a novel virus of the family Coronaviridae. The virus causes the infectious disease COVID-19. The biology of coronaviruses has been studied for many years. However, bioinformatics tools designed explicitly for SARS-CoV-2 have only recently been developed as a rapid reaction to the need for fast detection, understanding and treatment of COVID-19. To control the ongoing COVID-19 pandemic, it is of utmost importance to get insight into the evolution and pathogenesis of the virus. In this review, we cover bioinformatics workflows and tools for the routine detection of SARS-CoV-2 infection, the reliable analysis of sequencing data, the tracking of the COVID-19 pandemic and evaluation of containment measures, the study of coronavirus evolution, the discovery of potential drug targets and development of therapeutic strategies. For each tool, we briefly describe its use case and how it advances research specifically for SARS-CoV-2. All tools are free to use and available online, either through web applications or public code repositories. Contact: evbc@unj-jena.de
Structure predictions suggest a partial conservation of RNA structure elements in coronavirus terminal genome regions. Here, we determined the structures of stem-loops (SL) 1 and 2 of two alphacoronaviruses, human coronavirus (HCoV) 229E and NL63, by RNA structure probing and studied the functional relevance of these putative cis-acting elements. HCoV-229E SL1 and SL2 mutants generated by reverse genetics were used to study the effects on viral replication of single-nucleotide substitutions predicted to destabilize the SL1 and SL2 structures. The data provide conclusive evidence for the critical role of SL1 and SL2 in HCoV-229E replication and, in some cases, revealed parallels with previously characterized betacoronavirus SL1 and SL2 elements. Also, we were able to rescue viable HCoV-229E mutants carrying replacements of SL2 with equivalent betacoronavirus structural elements. The data obtained in this study reveal a remarkable degree of structural and functional conservation of 5'-terminal RNA structural elements across coronavirus genus boundaries.
Zika virus (ZIKV) is an arthropod-borne emerging pathogen causing febrile illness. ZIKV is associated Guillain-Barré syndrome and other neurological complications. Infection during pregnancy is associated with pregnancy complications and developmental and neurological abnormalities collectively defined as congenital Zika syndrome. There is still no vaccine or specific treatment for ZIKV infection. To identify host factors that can rescue cells from ZIKV infection, we used a genomescale CRISPR activation screen. Our highly ranking hits included a short list of interferon-stimulated genes (ISGs) previously reported to have antiviral activity. Validation of the screen results highlighted interferon lambda 2 (IFN-2) and interferon alpha-inducible protein 6 (IFI6) as genes providing high levels of protection from ZIKV. Activation of these genes had an effect on an early stage in viral infection. In addition, infected cells expressing single guide RNAs (sgRNAs) for both of these genes displayed lower levels of cell death than did the controls. Furthermore, the identified genes were significantly induced in ZIKV-infected placenta explants. Thus, these results highlight a set of ISGs directly relevant for rescuing cells from ZIKV infection or its associated cell death and substantiate CRISPR activation screens as a tool to identify host factors impeding pathogen infection. IMPORTANCE Zika virus (ZIKV) is an emerging vector-borne pathogen causing a febrile disease. ZIKV infection might also trigger Guillain-Barré syndrome, neuropathy, and myelitis. Vertical transmission of ZIKV can cause fetus demise, stillbirth, or severe congenital abnormalities and neurological complications. There is no vaccine or specific antiviral treatment against ZIKV. We used a genome-wide CRISPR activation screen, where genes are activated from their native promoters to identify host cell factors that protect cells from ZIKV infection or associated cell death. The results provide a better understanding of key host factors that protect cells from ZIKV infection and might assist in identifying novel antiviral targets.
Sequence analyses of RNA virus genomes remain challenging due to the exceptional genetic plasticity of these viruses. Because of high mutation and recombination rates, genome replication by viral RNA-dependent RNA polymerases leads to populations of closely related viruses, so-called 'quasispecies'. Standard (short-read) sequencing technologies are ill-suited to reconstruct large numbers of full-length haplotypes of (i) RNA virus genomes and (ii) subgenome-length (sg) RNAs comprised of noncontiguous genome regions. Here, we used a full-length, direct RNA sequencing (DRS) approach based on nanopores to characterize viral RNAs produced in cells infected with a human coronavirus. Using DRS, we were able to map the longest (∼26 kb) contiguous read to the viral reference genome. By combining Illumina and nanopore sequencing, we reconstructed a highly accurate consensus sequence of the human coronavirus (HCoV) 229E genome (27.3 kb). Furthermore, using long reads that did not require an assembly step, we were able to identify, in infected cells, diverse and novel HCoV-229E sg RNAs that remain to be characterized. Also, the DRS approach, which circumvents reverse transcription and amplification of RNA, allowed us to detect methylation sites in viral RNAs. Our work paves the way for haplotype-based analyses of viral quasispecies by demonstrating the feasibility of intra-sample haplotype separation. Even though several technical challenges remain to be addressed to exploit the potential of the nanopore technology fully, our work illustrates that direct RNA sequencing may significantly advance genomic studies of complex virus populations, including predictions on long-range interactions in individual full-length viral RNA haplotypes.
To identify genome-based features characteristic of the avian and human pathogen Chlamydia(C.) psittaci and related chlamydiae, we analyzed whole-genome sequences of 33 strains belonging to 12 species. Using a novel genome analysis tool termed Roary ILP Bacterial Annotation Pipeline (RIBAP), this panel of strains was shown to share a large core genome comprising 784 genes and representing approximately 80% of individual genomes. Analyzing the most variable genomic sites, we identified a set of features of C. psittaci that in its entirety is characteristic of this species: (i) a relatively short plasticity zone of less than 30,000 nt without a tryptophan operon (also in C. abortus, C. avium, C. gallinacea, C. pneumoniae), (ii) a characteristic set of of Inc proteins comprising IncA, B, C, V, X, Y (with homologs in C. abortus, C. caviae and C. felis as closest relatives), (iii) a 502-aa SinC protein, the largest among Chlamydia spp., and (iv) an elevated number of Pmp proteins of subtype G (14 in C. psittaci, 14 in Cand. C. ibidis). In combination with future functional studies, the common and distinctive criteria revealed in this study provide important clues for understanding the complexity of host-specific behavior of individual Chlamydia spp.
Thogotoviruses are tick-borne arboviruses that comprise a unique genus within the Orthomyxoviridae family. Infections with thogotoviruses primarily cause disease in livestock with occasional reports of human infections suggesting a zoonotic potential. In the past, multiple genetically distinct thogotoviruses were isolated mostly from collected ticks. However, many aspects regarding their phylogenetic relationships, morphological characteristics and virulence in mammals remain unclear. For the present comparative study, we used a collection of ten different thogotovirus isolates from different geographic areas. Next generation sequencing and subsequent phylogenetic analyses revealed a distinct separation of these viruses into two major clades – the Thogoto-like and Dhori-like viruses. Electron microscopy demonstrated a heterogeneous morphology with spherical and filamentous particles being present in virus preparations. To study their pathogenicity, we analyzed the viruses in a small animal model system. In intraperitoneally infected C57BL/6 mice, all isolates showed a tropism for liver, lung and spleen. Importantly, we did not observe horizontal transmission to uninfected, highly susceptible contact mice. The isolates enormously differed in their capacity to induce disease, ranging from subclinical to fatal outcomes. In vivo multi-step passaging experiments of two low-pathogenic isolates showed no increased virulence and sequence analyses of the passaged viruses indicated a high stability of the viral genomes after ten mouse passages. In summary, our analysis demonstrates the broad genetic and phenotypic variability within the thogotovirus genus. Moreover, thogotoviruses are well adapted to mammals but their horizontal transmission seems to depend on ticks as their vectors. Importance Since their discovery over sixty years ago, fifteen genetically distinct members of the thogotovirus genus have been isolated. These arboviruses belong to the Orthomyxovirus family and share many features with influenza viruses. However, numerous of these isolates have not been characterized in depth. In the present study, we comparatively analyzed a collection of ten different thogotovirus isolates to answer basic questions about their phylogenetic relationships, morphology and pathogenicity in mice. Our results highlight shared and unique characteristics of this diverse genus. Taken together, these observations provide a framework for the phylogenic classification and phenotypic characterization of newly identified thogotovirus isolates that could potentially cause severe human infections as exemplified by the recently reported, fatal Bourbon virus cases in the United States.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.