2009
DOI: 10.1007/978-1-4419-0176-7_6
|View full text |Cite
|
Sign up to set email alerts
|

A Parallel General-Purpose Synthetic Data Generator1

Abstract: PSDG is a parallel synthetic data generator designed to generate "industrial sized" data sets quickly using cluster computing. PSDG depends on SDDL, a synthetic data description language that provides flexibility in the types of data we can generate.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 3 publications
0
1
0
Order By: Relevance
“…Benchmarks typically offer tools to generate datasets of varying size to test systems in different situations [29,43,31,44,30,18]. SmartBench's data generation tool 2 uses seed data from a real IoT deployment (our University building) including sensor data and metadata about the building and sensors, which is able to scale up/down to create a synthetic dataset.…”
Section: Data and Query Generatormentioning
confidence: 99%
“…Benchmarks typically offer tools to generate datasets of varying size to test systems in different situations [29,43,31,44,30,18]. SmartBench's data generation tool 2 uses seed data from a real IoT deployment (our University building) including sensor data and metadata about the building and sensors, which is able to scale up/down to create a synthetic dataset.…”
Section: Data and Query Generatormentioning
confidence: 99%