2023
DOI: 10.1093/bioinformatics/btad041
|View full text |Cite
|
Sign up to set email alerts
|

NGSNGS: next-generation simulator for next-generation sequencing data

Abstract: Summary With the rapid expansion of the capabilities of the DNA sequencers throughout the different sequencing generations, the quantity of generated data has likewise increased. This evolution has also led to new bioinformatical methods, for which in silico data has become crucial when verifying the accuracy of a model or the robustness of a genomic analysis pipeline. Here we present a multithreaded next-generation simulation tool for next-generation sequencing data (NGSNGS), which simulates… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(8 citation statements)
references
References 9 publications
0
3
0
Order By: Relevance
“…We used NGSNGS 23 to simulate single end sequencing data from forty individuals with varying data qualities– modern, degraded A, and degraded B (see Methods , Figure 2B-C ). The two sets of degraded DNA differ in data quality due to variability in deamination rates and fragmentation patterns.…”
Section: Resultsmentioning
confidence: 99%
“…We used NGSNGS 23 to simulate single end sequencing data from forty individuals with varying data qualities– modern, degraded A, and degraded B (see Methods , Figure 2B-C ). The two sets of degraded DNA differ in data quality due to variability in deamination rates and fragmentation patterns.…”
Section: Resultsmentioning
confidence: 99%
“…We simulated different input data in order to examine the impact of data sources and program parameters on the results, including sequencing depth, reference and parameter‐bootstrapping. We generated simulation datasets with coverage levels from 1 to 100×, leveraging the genome of A. thaliana from the Arabidopsis Information Resource (TAIR), specifically release TAIR10 (Lamesch et al., 2012), utilizing NGSNGS version 0.9 (Henriksen et al., 2023). Our approach incorporated two distinct sets of reference sequences: one set comprised actual reference sequences, identical to those used in Test I with Brassicaceae, and the other set consisted of simulated reference sequences.…”
Section: Methodsmentioning
confidence: 99%
“…We generated simulation datasets with coverage levels from 1 to 100×, leveraging the genome of A. thaliana from the Arabidopsis Information Resource (TAIR), specifically release TAIR10 (Lamesch et al, 2012), utilizing NGSNGS version 0.9 (Henriksen et al, 2023).…”
Section: Parameter and Performance Testingmentioning
confidence: 99%
“…To determine the accuracy of our regression framework, we used ngsBriggs inference on simulated files. Using the simulation software NGSNGS [10] we generated 100 files for the biotin- and 100 for the non-biotin model, equally separated into five groups with a varying number of reads, i.e. 10 3 , 10 4 , 10 5 , 10 6 and 10 7 to test different scenarios.…”
Section: Methodsmentioning
confidence: 99%