2008
DOI: 10.1145/1328897.1328488
|View full text |Cite
|
Sign up to set email alerts
|

From dirt to shovels

Abstract: An ad hoc data source is any semistructured data source for which useful data analysis and transformation tools are not readily available. Such data must be queried, transformed and displayed by systems administrators, computational biologists, financial analysts and hosts of others on a regular basis. In this paper, we demonstrate that it is possible to generate a suite of useful data processing tools, including a semi-structured query engine, several format converters, a statistical analyzer and data visuali… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2010
2010
2020
2020

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 20 publications
(1 citation statement)
references
References 27 publications
0
1
0
Order By: Relevance
“…In these contexts there have been many proposals aimed at minimizing the user effort needed to identify and annotate informative examples, e.g., Zhang (2008); Dalvi et al (2016); Budlong et al (2013); Bajaj et al (2015). Some proposals advocated the execution of an initial, fully unsupervised analysis of the data corpus followed by some explicit instructions from the user on how to proceed with the processing of, e.g., logs (Fisher et al, 2008), collections of facts (Etzioni et al, 2005), or relations (Yates et al, 2007).…”
Section: Related Workmentioning
confidence: 99%
“…In these contexts there have been many proposals aimed at minimizing the user effort needed to identify and annotate informative examples, e.g., Zhang (2008); Dalvi et al (2016); Budlong et al (2013); Bajaj et al (2015). Some proposals advocated the execution of an initial, fully unsupervised analysis of the data corpus followed by some explicit instructions from the user on how to proceed with the processing of, e.g., logs (Fisher et al, 2008), collections of facts (Etzioni et al, 2005), or relations (Yates et al, 2007).…”
Section: Related Workmentioning
confidence: 99%