2021
DOI: 10.31219/osf.io/udqjr
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Reusing Interactive Analysis Workflows

Abstract: Interactive visual analysis has many advantages, but has the disadvantage that analysis processes and workflows cannot be easily stored and reused, which is in contrast to scripted analysis workflows using a programming language such as Python. In this paper, we introduce methods to semantically capture workflows in interactive visualization systems for different interactions such as selections, filters, categorizing/grouping, labeling, and aggregation. We design these workflows to be robust to updates in the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 29 publications
0
5
0
Order By: Relevance
“…For example, if a dataset is refined over time, possibly because of discrepancies recorded as data hunches, a new version of a dataset could be overlaid with data hunches recorded for the old version of the dataset, to see whether the hunches expressed still apply. Overlaying data hunches could be combined with an explicit comparison of datasets [24]. However, such a workflow would incur additional visual complexity and hence would require dedicated methods to manage that complexity.…”
Section: Discussion and Future Workmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, if a dataset is refined over time, possibly because of discrepancies recorded as data hunches, a new version of a dataset could be overlaid with data hunches recorded for the old version of the dataset, to see whether the hunches expressed still apply. Overlaying data hunches could be combined with an explicit comparison of datasets [24]. However, such a workflow would incur additional visual complexity and hence would require dedicated methods to manage that complexity.…”
Section: Discussion and Future Workmentioning
confidence: 99%
“…By identifying data hunches as productive and insightful expert knowledge, we can re-interpret past work on collaborative visualization tools with this framing. For example, a visualization designer can incorporate commenting and discussion features to promote externalization of data hunches [30,47]; apply provenance tracking to record actions they took based on hunches to wrangle the data [15,19,24,25]; use visualization techniques like linked views and visualization states to show a collection of data hunches [35,46,67]. We see a wealth of opportunities for incorporating data hunches into old and new ways of visually analyzing data.…”
Section: Data Hunchesmentioning
confidence: 99%
“…Computational notebooks, such as Jupyter Notebooks [40] and R Markdown [41,42] are often discussed as a remedy for the issues we discuss here: they can be used to describe datasets and analysis steps, contain visualizations, and also contain executable code that (in theory [43]) ensures reproducibility of analysis. Due to these advantages, significant research has been devoted to understanding how analysts use computational notebooks [44,45,46] and to improve them [47,48,49,50].…”
Section: Methods Of Documenting Data and Analysismentioning
confidence: 99%
“…First, we need to make GUI-based visual analysis tools reproducible. The GUI-based tools our participants use do not support annotated histories or workflows, unlike various research prototypes [74,75,76,47]. For example, there is typically no way to comment on why some data was filtered out in the tools used by our participants.…”
Section: Design Opportunitiesmentioning
confidence: 99%
“…Recent works have advocated for the development of interactive visualizations in such computational environments in order to support reproducibility, streamline analysis and increase adoption of visualization systems [43,65]. To support these goals, tools have been developed that help users embed interactive visualizations [65], create dashboards [64,67], and reuse workflows [14] in JupyterLab. Studies have also developed tools to condense notebooks for better collaboration [51] and communication through interactive data comics [37] and presentation slides [71].…”
Section: Visualizations In Computational Environmentsmentioning
confidence: 99%