2018
DOI: 10.1101/402370
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

PathoLive – Real-time pathogen identification from metagenomic Illumina datasets

Abstract: Over the past years, NGS has been applied in time critical applications such as pathogen diagnostics with promising results. Yet, long turnaround times have to be accepted to generate sufficient data, as the analysis can only be performed sequentially after the sequencing has finished. Additionally, the interpretation of results can be further complicated by various types of contaminations, clinically irrelevant sequences, and the sheer amount and complexity of the data. We designed and implemented PathoLive, … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
4
1

Relationship

4
1

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 85 publications
0
5
0
Order By: Relevance
“…In our experience, smaller databases that include only subsets of eukaryotic genomes increase the false-positive classification rate immensely. The resulting species list is filtered for human, animal or plant pathogens as well as for select CDC or USDA agents using the risk group database from the American Biological Safety Association (ABSA) as it was shown before for viruses in clinical samples (Tausch et al, 2018). For selected pathogens of interest, classified reads are extracted from the metagenomic dataset in order to (i) verify the classification with BLASTn and the nucleotide database from NCBI, (ii) estimate the closest distance to a published reference genome with Mash for subspecies identification and (iii) identify virulence factors using SRST2 using the Virulence Factor Database (VFDB).…”
Section: Resultsmentioning
confidence: 99%
“…In our experience, smaller databases that include only subsets of eukaryotic genomes increase the false-positive classification rate immensely. The resulting species list is filtered for human, animal or plant pathogens as well as for select CDC or USDA agents using the risk group database from the American Biological Safety Association (ABSA) as it was shown before for viruses in clinical samples (Tausch et al, 2018). For selected pathogens of interest, classified reads are extracted from the metagenomic dataset in order to (i) verify the classification with BLASTn and the nucleotide database from NCBI, (ii) estimate the closest distance to a published reference genome with Mash for subspecies identification and (iii) identify virulence factors using SRST2 using the Virulence Factor Database (VFDB).…”
Section: Resultsmentioning
confidence: 99%
“…DeePaC-Live could also form a part of more complex real-time pathogen detection workflows like PathoLive (Tausch et al, 2018b). For example, PAIPline (Andrusch et al, 2018) identifies pathogens in metagenomic and clinical samples via mapping and a BLAST follow-up analysis, but can only start the analysis after the sequencing is finished.…”
Section: Discussionmentioning
confidence: 99%
“…Since the analysis of HTS data for virus diagnostics requires bioinformatics as well as virological knowledge, collaboration between the two disciplines has been emphasized (32). Furthermore, automated pipelines for HTS-based virus diagnostics with unbiased evaluation of the pathogenicity and relevance of the pathogen detected have been implemented; these can help harmonize the analysis and interpretation of HTS sequence results (33).…”
Section: Discussionmentioning
confidence: 99%