Viral metagenomics can be roughly divided into species richness-based studies and species abundance-based analyses. Viromic methods with different principles have been developed, but rational selection of these techniques according to different purposes requires comprehensive understanding of their properties.
Group A rotaviruses of the family Reoviridae is one of the important intestinal pathogens causing diarrhea in piglets and humans. A human-porcine reassortment rotavirus, GDJM1, was identified from outbreak of diarrhea in suckling piglets and it associated with 60.00% (324/540) morbidity and 20.99% (68/324) mortality in Guangdong Province of China in 2022. Thus, to further characterize the evolutionary diversity of GDJM1, all gene segments were analyzed. The genome constellation was G9-P[19]-I5-R1-C1-M1-A8-N1-T1-E1-H1. Nucleotide sequence identity and phylogenetic analyses showed that the VP6, VP7, NSP4 and NSP5 genes of GDJM1 were the most closely related to the respective genes of porcine strains, with the highest homology ranging from 95.65–98.55% identity. The remaining seven genes (VP1-VP4, NSP1-NSP3) were the most closely related to human strains, with the highest homology ranging from 91.83–96.69% similarity. Therefore, it is likely that GDJM1 emerged as the result of genetic reassortment between porcine and human rotaviruses. To our knowledge, this is the first report that a human-porcine reassortment G9P[19] RVA strain has been identified in mainland China, which providing important insights into evolutionary characterization of G9P[19] RVA strain, and reveals that the strain has a potential risk of cross-species transmission.
Porcine parvoviruses (PPVs) are a group of small non-enveloped viruses with seven species (porcine parvovirus 1–7, PPV1-7) have been identified. In this study, a novel porcine parvovirus, provisionally named porcine parvovirus 8 (PPV8), was initially identified via high-throughput sequencing (HTS) in porcine reproductive and respiratory syndrome virus-positive samples collected from swine herds in Guangdong province, 2021. The nearly full-length genome of PPV8 strain GDJM2021 is 4,380 nucleotides in length with two overlapping open ORFs encoding NS1 and VP1 respectively. Sequence analysis indicated that PPV8 shared 16.23–44.18% sequence identity at the genomic levels to PPV1-7 with the relatively highest homology to PPV1. PPV8-GDJM2021 shared 31.86–32.68% aa sequence identity of NS1 protein with those of PPV1 and porcine bufavirus (PBuV), and formed an independent branch neighboring to those formed by members of the genus Protoparvovirus. Of the 211 clinical samples collected from 1990 to 2021, 37 samples (17.5%) distributed over 12 regions in China were positive for PPV8 with time spanning 24 years (1998–2021). To our knowledge, this is the first report on the genomic characterization of the novel PPV8 and its epidemiological situations in China.
Widespread in public databases, the notorious contamination in virus reference databases often leads to confusing even wrong conclusions in applications like viral disease diagnosis and viromic analysis, highlighting the need of a high-quality database. Here, we report the comprehensive scrutiny and the purification of the largest viral sequence collections of GenBank and UniProt by detection and characterization of heterogeneous sequences (HGSs). A total of 766 nucleotide- and 276 amino acid-HGSs were determined with length up to 6,605 bp, which were widely distributed in 39 families, with many involving highly public health-related viruses, such as hepatitis C virus, Crimea-Congo hemorrhagic fever virus and filovirus. Majority of these HGSs are sequences of a wide range of hosts including humans, with the rest resulting from vectors, misclassification and laboratory components. However, these HGSs cannot be simply considered as exotic contaminants, since part of which are resultants of natural occurrence or artificial engineering of the viruses. Nevertheless, they significantly disturb the genomic analysis, and hence were deleted from the database. A further augmentation was implemented with addition of the risk and vaccine sequences, which finally results in a high-quality eukaryotic virus reference database (EVRD). EVRD showed higher accuracy and less time-consuming without coverage compromise by reducing false positives than other integrated databases in viromic analysis. EVRD is freely accessible with favorable application in viral disease diagnosis, taxonomic clustering, viromic analysis and novel virus detection.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.