2006
DOI: 10.1007/11890850_6
|View full text |Cite
|
Sign up to set email alerts
|

Towards Low Overhead Provenance Tracking in Near Real-Time Stream Filtering

Abstract: Abstract. Data streams flowing from the physical environment are as unpredictable as the environment itself. Radars go down, long haul networks drop packets, and readings are corrupted on the wire. Yet the data driven scientific models and data mining algorithms do not necessarily account for the inaccuracies when assimilating the data. Low overhead provenance collection partially solves this problem. We propose a data model and collection model for near real time provenance collection. We define a system arch… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
0

Year Published

2008
2008
2022
2022

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 38 publications
(19 citation statements)
references
References 14 publications
0
19
0
Order By: Relevance
“…It can be categorized with coarse-grained provenance methods that identify dependencies between streams or sets of streams [27,26], and fine-grained methods that identify dependencies among individual stream elements [23,10,18,22]. Sansrimahachai et al [23] propose the Stream Ancestor Function -a reverse mapping function to express precise dependencies between input and output stream elements (fine-grained).…”
Section: Related Workmentioning
confidence: 99%
“…It can be categorized with coarse-grained provenance methods that identify dependencies between streams or sets of streams [27,26], and fine-grained methods that identify dependencies among individual stream elements [23,10,18,22]. Sansrimahachai et al [23] propose the Stream Ancestor Function -a reverse mapping function to express precise dependencies between input and output stream elements (fine-grained).…”
Section: Related Workmentioning
confidence: 99%
“…They incorporated both accuracy and provenance as an integrated part of data management and query processing. Vijayakkumar and Plale [32] introduced the provenance collection framework for an event processing system. Sarma et al [33] developed an architecture based on a decoupled strategy to compute provenance and confidence in probabilistic databases.…”
Section: History Of Trustworthiness Frameworkmentioning
confidence: 99%
“…First, Vijayakumar and Plale [32], proposed a provenance collection framework for collecting provenance with low overhead in stream filtering applications. They addressed the challenges of the data streaming environment and proposed data collection models with low overhead for handling rapid streams.…”
Section: Some Other Prominent Frameworkmentioning
confidence: 99%
“…Sensors and data streaming techniques are increasingly used in a wide range of applications, from science (weather forecast [393]) to health care (remote health monitoring [287]). Such streamed data are used by sophisticated simulation, modelling and analysis tools.…”
Section: Data Collections and Streamsmentioning
confidence: 99%
“…Vijayakumar and Plale [393,392] adopt a stream as the unit of data they track provenance for, so as to ensure a lightweight and low overhead approach to provenance recording in distributed stream applications. The downside is that the granularity of their provenance model is so coarse that it is unable to answer queries related to individual stream elements.…”
Section: Data Collections and Streamsmentioning
confidence: 99%