Abstract:A key step for broad viral detection using high-throughput sequencing (HTS) is optimizing the sample preparation strategy for extracting viral-specific nucleic acids since viral genomes are diverse: They can be single-stranded or double-stranded RNA or DNA, and can vary from a few thousand bases to over millions of bases, which might introduce biases during nucleic acid extraction. In addition, viral particles can be enveloped or non-enveloped with variable resistance to pre-treatment, which may influence thei… Show more
“…High-throughput sequencing (HTS) is a non-specific technique with the potential to detect both known and unknown adventitious agents including viruses 7,[14][15][16][17] . High-throughput molecular biology methods (HTS combined with a pan-viral microarray) had succeeded in detecting the contamination of Rotarix vaccine by a porcine circovirus 18 .…”
High-throughput sequencing (HTS) is capable of broad virus detection encompassing both known and unknown adventitious viruses in a variety of sample matrices. We describe the development of a general-purpose HTS-based method for the detection of adventitious viruses. Performance was evaluated using 16 viruses equivalent to well-characterized National Institutes of Health (NIH) virus stocks and another six viruses of interest. A viral vaccine crude harvest and a cell substrate matrix were spiked with 22 viruses. Specificity was demonstrated for all 22 viruses at the species level. Our method was capable of detecting and identifying adventitious viruses spiked at 10 4 genome copies per milliliter in a viral vaccine crude harvest and 0.01 viral genome copies spiked per cell in a cell substrate matrix. Moreover, 9 of the 11 NIH model viruses with published in vivo data were detected by HTS with an equivalent or better sensitivity (in a viral vaccine crude harvest). Our general-purpose HTS method is unbiased and highly sensitive for the detection of adventitious viruses, and has a large breadth of detection, which may obviate the need to perform in vivo testing.
“…High-throughput sequencing (HTS) is a non-specific technique with the potential to detect both known and unknown adventitious agents including viruses 7,[14][15][16][17] . High-throughput molecular biology methods (HTS combined with a pan-viral microarray) had succeeded in detecting the contamination of Rotarix vaccine by a porcine circovirus 18 .…”
High-throughput sequencing (HTS) is capable of broad virus detection encompassing both known and unknown adventitious viruses in a variety of sample matrices. We describe the development of a general-purpose HTS-based method for the detection of adventitious viruses. Performance was evaluated using 16 viruses equivalent to well-characterized National Institutes of Health (NIH) virus stocks and another six viruses of interest. A viral vaccine crude harvest and a cell substrate matrix were spiked with 22 viruses. Specificity was demonstrated for all 22 viruses at the species level. Our method was capable of detecting and identifying adventitious viruses spiked at 10 4 genome copies per milliliter in a viral vaccine crude harvest and 0.01 viral genome copies spiked per cell in a cell substrate matrix. Moreover, 9 of the 11 NIH model viruses with published in vivo data were detected by HTS with an equivalent or better sensitivity (in a viral vaccine crude harvest). Our general-purpose HTS method is unbiased and highly sensitive for the detection of adventitious viruses, and has a large breadth of detection, which may obviate the need to perform in vivo testing.
“…Following RNA extraction and viral RNA enrichment from the host RNA (Forth and Hoper, 2019;Houldcroft et al, 2017;Sathiamoorthy et al, 2018;Singanallur et al, 2019), the first critical quality control (QC) step (Fig. 2) is to test both quantity and integrity of the starting viral RNA (Hauck et al, 2018;Ng et al, 2018;Yang et al, 2016). The necessity to control virus tire or genome copy numbers in comparative studies has been demonstrated in several studies as false positive variant calls become more evident with lower material inputs (Gallet et al, 2017;Illingworth et al, 2017;McCrone and Lauring, 2016).…”
A B S T R A C TNext-generation sequencing (NGS) has revolutionized the scale and depth of biomedical sciences. Because of its unique ability for the detection of sub-clonal variants within genetically diverse populations, NGS has been successfully applied to analyze and quantify the exceptionally-high diversity within viral quasispecies, and many low-frequency drug-or vaccine-resistant mutations of therapeutic importance have been discovered. Although many works have intensively discussed the latest NGS approaches and applications in general, none of them has focused on applying NGS in viral quasispecies studies, mostly due to the limited ability of current NGS technologies to accurately detect and quantify rare viral variants. Here, we summarize several error-correction strategies that have been developed to enhance the detection accuracy of minority variants. We also discuss critical considerations for preparing a sequencing library from viral RNAs and for analyzing NGS data to unravel the mutational landscape.
“…High background from cellular nucleic acids will reduce the sensitivity of HTS virus detection [ 13 , 14 ]. In most cases, the cellular nucleic acids are the most crucial factor for influencing the sensitivity of virus detection, since they typically dominate the viral read number [ 11 , 15 , 16 ]. Reducing the cellular nucleic acid, such as by nuclease treatment prior to sequencing, may enhance the sensitivity of virus detection in downstream bioinformatics analysis.…”
Section: Factors Influencing Sensitivity Of Virus Detectionmentioning
confidence: 99%
“…As shown in Figure 1 (closed circles), without incorporation of steps to reduce the background, even an entire Illumina HiSeq flow cell run, generating 2.5 × 10 9 reads in a paired-end mode (for example 2 × 100 bp) would not likely produce a single PCV read below a sensitivity of 1 × 10 4 genomes/mL. While mechanisms of target enrichment and background removal are beyond the scope of this paper [ 11 ], when we explore the effect of improving this signal-to-noise ratio (S/N) by 1000× ( Figure 1 , closed squares), similar levels of sensitivity could then be obtained using a more modest platform (MiSeq, ~1.2 × 10 7 paired-end reads).…”
Section: Factors Influencing Sensitivity Of Virus Detectionmentioning
confidence: 99%
“…This paper presents our current thinking on bioinformatics pipelines with focus on the choices made in the analysis pipeline—including, but not limited to, sequences, aligners, databases, assembly, and how unmapped reads are managed. Discussions on sample selection and preparation, as well as virus spiking to determine sensitivity, are presented in the context of their influence on the pipeline analysis and details are discussed in another paper from AVDTIG members [ 11 ].…”
High-throughput sequencing (HTS) has demonstrated capabilities for broad virus detection based upon discovery of known and novel viruses in a variety of samples, including clinical, environmental, and biological. An important goal for HTS applications in biologics is to establish parameter settings that can afford adequate sensitivity at an acceptable computational cost (computation time, computer memory, storage, expense or/and efficiency), at critical steps in the bioinformatics pipeline, including initial data quality assessment, trimming/cleaning, and assembly (to reduce data volume and increase likelihood of appropriate sequence identification). Additionally, the quality and reliability of the results depend on the availability of a complete and curated viral database for obtaining accurate results; selection of sequence alignment programs and their configuration, that retains specificity for broad virus detection with reduced false-positive signals; removal of host sequences without loss of endogenous viral sequences of interest; and use of a meaningful reporting format, which can retain critical information of the analysis for presentation of readily interpretable data and actionable results. Furthermore, after alignment, both automated and manual evaluation may be needed to verify the results and help assign a potential risk level to residual, unmapped reads. We hope that the collective considerations discussed in this paper aid toward optimization of data analysis pipelines for virus detection by HTS.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.