UNIQmin, an alignment-free tool to study viral sequence diversity across taxonomic lineages: a case study of monkeypox virus

Li, Chong; Khan, Mohammad Asif

doi:10.1101/2022.08.09.503271

Cited by 1 publication

(1 citation statement)

References 43 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The utility of UNIQmin was demonstrated for the species Dengue virus , genus Flavivirus , family Flaviviridae , and the superkingdom Viruses (all datasets before the COVID-19 pandemic) 1 . Herein, we applied UNIQmin to protein sequence data of SARS-CoV-2 and its higher ranks of taxonomic lineages, namely species (with and without the SARS-CoV-2 sub-species), genus and family ( Figure 1 ) to evaluate the effective viral sequence diversity at each rank 2 .…”

Section: Introductionmentioning

confidence: 99%

Negligible peptidome diversity of SARS-CoV-2 and its higher taxonomic ranks

Khan

2022

Preprint

Self Cite

View full text Add to dashboard Cite

The unprecedented increase in SARS-CoV-2 sequence data limits the application of alignment-dependent approaches to study viral diversity. Herein, we applied our recently published UNIQmin, an alignment-free tool to study the protein sequence diversity of SARS-CoV-2 (sub-species) and its higher taxonomic lineage ranks (species, genus, and family). Only less than 0.5% of the reported SARS-CoV-2 protein sequences are required to represent the inherent viral peptidome diversity, which only increases to a mere ~2% at the family rank. This is expected to remain relatively the same even with further increases in the sequence data. The findings have important implications in the design of vaccines, drugs, and diagnostics, whereby the number of sequences required for consideration of such studies is drastically reduced, short-circuiting the discovery process, while still providing for a systematic evaluation and coverage of the pathogen diversity.

show abstract

Section: Introductionmentioning

confidence: 99%