Danielle Albers Szafir scite author profile

Color is frequently used to encode values in visualizations. For color encodings to be effective, the mapping between colors and values must preserve important differences in the data. However, most guidelines for effective color choice in visualization are based on either color perceptions measured using large, uniform fields in optimal viewing environments or on qualitative intuitions. These limitations may cause data misinterpretation in visualizations, which frequently use small, elongated marks. Our goal is to develop quantitative metrics to help people use color more effectively in visualizations. We present a series of crowdsourced studies measuring color difference perceptions for three common mark types: points, bars, and lines. Our results indicate that peoples' abilities to perceive color differences varies significantly across mark types. Probabilistic models constructed from the resulting data can provide objective guidance for designers, allowing them to anticipate viewer perceptions in order to inform effective encoding design.

show abstract

Four types of ensemble coding in data visualizations

Szafir

et al. 2016

View full text Add to dashboard Cite

Ensemble coding supports rapid extraction of visual statistics about distributed visual information. Researchers typically study this ability with the goal of drawing conclusions about how such coding extracts information from natural scenes. Here we argue that a second domain can serve as another strong inspiration for understanding ensemble coding: graphs, maps, and other visual presentations of data. Data visualizations allow observers to leverage their ability to perform visual ensemble statistics on distributions of spatial or featural visual information to estimate actual statistics on data. We survey the types of visual statistical tasks that occur within data visualizations across everyday examples, such as scatterplots, and more specialized images, such as weather maps or depictions of patterns in text. We divide these tasks into four categories: identification of sets of values, summarization across those values, segmentation of collections, and estimation of structure. We point to unanswered questions for each category and give examples of such cross-pollination in the current literature. Increased collaboration between the data visualization and perceptual psychology research communities can inspire new solutions to challenges in visualization while simultaneously exposing unsolved problems in perception research.

show abstract

Designing for Depth Perceptions in Augmented Reality

Diaz

Walker²,

Szafir³

et al. 2017

View full text Add to dashboard Cite

Grand Challenges in Immersive Analytics

et al. 2021

View full text Add to dashboard Cite

Immersive Analytics is a quickly evolving field that unites several areas such as visualisation, immersive environments, and humancomputer interaction to support human data analysis with emerging technologies. This research has thrived over the past years with Publication rights licensed to ACM. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

show abstract

Where's My Data? Evaluating Visualizations with Missing Data

Song

Szafir

2019

IEEE Trans. Visual. Comput. Graphics

View full text Add to dashboard Cite

Many real-world datasets are incomplete due to factors such as data collection failures or misalignments between fused datasets. Visualizations of incomplete datasets should allow analysts to draw conclusions from their data while effectively reasoning about the quality of the data and resulting conclusions. We conducted a pair of crowdsourced studies to measure how the methods used to impute and visualize missing data may influence analysts' perceptions of data quality and their confidence in their conclusions. Our experiments used different design choices for line graphs and bar charts to estimate averages and trends in incomplete time series datasets. Our results provide preliminary guidance for visualization designers to consider when working with incomplete data in different domains and scenarios.

show abstract

Zika discourse in the Americas: A multilingual topic analysis of Twitter

et al. 2019

View full text Add to dashboard Cite

This work examines Twitter discussion surrounding the 2015 outbreak of Zika, a virus that is most often mild but has been associated with serious birth defects and neurological syndromes. We introduce and analyze a collection of 3.9 million tweets mentioning Zika geolocated to North and South America, where the virus is most prevalent. Using a multilingual topic model, we automatically identify and extract the key topics of discussion across the dataset in English, Spanish, and Portuguese. We examine the variation in Twitter activity across time and location, finding that rises in activity tend to follow to major events, and geographic rates of Zika-related discussion are moderately correlated with Zika incidence ( ρ = .398).

show abstract

Color Crafting: Automating the Construction of Designer Quality Color Ramps

Smart

Szafir

2020

IEEE Trans. Visual. Comput. Graphics

View full text Add to dashboard Cite

Fig. 1. Our approach uses design mining and unsupervised clustering techniques to produce automatically generated color ramps that capture designer practices. The four choropleth maps shown above utilize color ramps generated from our approach. Developers select a single guiding seed color, shown in the squares below each map, to generate a ramp. We then fit curves capturing structural patterns in designer practices in CIELAB (bottom) to these seed colors to generate ramps (middle, seed colors indicated by a black dot). We embody this technique in Color Crafter, a web-based tool that enables designers of all ability levels to generate high-quality custom color ramps.Abstract-Visualizations often encode numeric data using sequential and diverging color ramps. Effective ramps use colors that are sufficiently discriminable, align well with the data, and are aesthetically pleasing. Designers rely on years of experience to create high-quality color ramps. However, it is challenging for novice visualization developers that lack this experience to craft effective ramps as most guidelines for constructing ramps are loosely defined qualitative heuristics that are often difficult to apply. Our goal is to enable visualization developers to readily create effective color encodings using a single seed color. We do this using an algorithmic approach that models designer practices by analyzing patterns in the structure of designer-crafted color ramps. We construct these models from a corpus of 222 expert-designed color ramps, and use the results to automatically generate ramps that mimic designer practices. We evaluate our approach through an empirical study comparing the outputs of our approach with designer-crafted color ramps. Our models produce ramps that support accurate and aesthetically pleasing visualizations at least as well as designer ramps and that outperform conventional mathematical approaches.

show abstract

Rainbows Revisited: Modeling Effective Colormap Design for Graphical Inference

Reda

Szafir

2021

IEEE Trans. Visual. Comput. Graphics

View full text Add to dashboard Cite

blues plasma grey-red turbo Fig. 1. Eight example stimuli from Experiment 1. A single stimulus consists of a lineup of four color-coded scalar fields shown in a 2×2 grid. For each lineup, which of the four plots stands out as different? The answers are in Section 10. This graphical inference test enables us to determine the discriminative power of competing colormap designs. Our results give rise a new model for predicting a colormap's usefulness, particularly for tasks involving model-based inference and judgement.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.