2019
DOI: 10.1109/tvcg.2019.2934209
|View full text |Cite
|
Sign up to set email alerts
|

Selection Bias Tracking and Detailed Subset Comparison for High-Dimensional Data

Abstract: Fig. 1. Exploratory cohort selection in high-dimensional datasets can lead to selection bias-unintended side-effects in variable distributions-that may go unnoticed by the user. Our selection bias tracking system and detailed cohort comparison visualizations, deployed in a medical temporal event sequence visual analytics tool, include (a) a cohort provenance tree to keep track of created cohorts and indicate when selection bias may have occurred, (b-d) a suite of high-dimensional cohort comparison visualizatio… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
30
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 21 publications
(33 citation statements)
references
References 52 publications
1
30
0
Order By: Relevance
“…All these interactions enable the selection of particular sentences and then further exploration of the content with suggestions stemming from the system side. Abstract/elaborate is another interaction technique found in, e.g., Borland et al [BWZ*20] and can be interpreted as different granularities that the visualization allows users to explore the data. Sevastjanova et al [SBE*18] argue that a combination of visualization and verbalization methods is advantageous for generating wide and versatile insights into the structure and decision‐making processes of ML models.…”
Section: In‐depth Categorization Of Trust Against Facets Of Interamentioning
confidence: 99%
“…All these interactions enable the selection of particular sentences and then further exploration of the content with suggestions stemming from the system side. Abstract/elaborate is another interaction technique found in, e.g., Borland et al [BWZ*20] and can be interpreted as different granularities that the visualization allows users to explore the data. Sevastjanova et al [SBE*18] argue that a combination of visualization and verbalization methods is advantageous for generating wide and versatile insights into the structure and decision‐making processes of ML models.…”
Section: In‐depth Categorization Of Trust Against Facets Of Interamentioning
confidence: 99%
“…Gratzl et al [GGL ∗ 14] propose Domino that supports the flexible exploration of subsets and their relationships. Borland et al [BWZ ∗ 19] propose a visual analytics method to unbiasedly selecting a representative subset for a large dataset. Gotz et al [GZW ∗ 19] propose a method for interactively determining the most informative event subset in a specific analysis context.…”
Section: Related Workmentioning
confidence: 99%
“…We classified Similarity measures like cosine, Jaccard and Hellinger distance together with clustering, because they are often used to group data. For example, Borland et al (2020;Figure 5d) hierarchically aggregate similar events in an icicle plot to track selection bias in patient cohorts; Barlowe et al (2013;Figure 2b) rank protein flexibility plots by similarity to a target.…”
Section: Algorithmmentioning
confidence: 99%
“…( 2) Collapsing components removes visual clutter, for example lines in line graphs (Afzal et al, 2011) or parallel coordinates (Huang et al, 2019), identical rows in matrices (Dang et al, 2015), or similar points in scatter plots (Kwon et al, 2018). Conversely, expanding components shows extra information like individual lines instead of ribbons in parallel sets (Kwon et al, 2018), subsequences of sequential health records (Malik et al, 2015), or groups in icicle plots (Borland et al, 2020). ( 3) Zooming enlarges a visualization (e.g., Males et al, 2020;Kumar et al, 2015, Figure 3a).…”
Section: Interaction In Visual Analyticsmentioning
confidence: 99%