Doom or Deliciousness: Challenges and Opportunities for Visualization in the Age of Generative Models

Schetinger, Victor; Bartolomeo, Sara Di; El‐Assady, Mennatallah; McNutt, Andrew; Miller, Matthias; Adams, Jane Lydia

doi:10.31219/osf.io/3jrcm

Cited by 5 publications

(8 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The rise of text‐to‐image generative models [RBL*21, RDN*22, SCS*22] has sparked numerous interface designs [SCK*23, BWS*23, FWW*23, SCM*23] to help humans co‐create with generative models – even for the creation of visualizations [SDBEA*23, WCA23]. Some designs focus on editing a single image at a time [ZRA23, CA23] using methods that either control the sampling process of a diffusion model, or explicitly train a new diffusion model for a target mode of interaction.…”

Section: Related Workmentioning

confidence: 99%

CUPID: Contextual Understanding of Prompt‐conditioned Image Distributions

Zhao,

Li,

Berger

2024

Computer Graphics Forum

View full text Add to dashboard Cite

We present CUPID: a visualization method for the contextual understanding of prompt‐conditioned image distributions. CUPID targets the visual analysis of distributions produced by modern text‐to‐image generative models, wherein a user can specify a scene via natural language, and the model generates a set of images, each intended to satisfy the user's description. CUPID is designed to help understand the resulting distribution, using contextual cues to facilitate analysis: objects mentioned in the prompt, novel, synthesized objects not explicitly mentioned, and their potential relationships. Central to CUPID is a novel method for visualizing high‐dimensional distributions, wherein contextualized embeddings of objects, those found within images, are mapped to a low‐dimensional space via density‐based embeddings. We show how such embeddings allows one to discover salient styles of objects within a distribution, as well as identify anomalous, or rare, object styles. Moreover, we introduce conditional density embeddings, whereby conditioning on a given object allows one to compare object dependencies within the distribution. We employ CUPID for analyzing image distributions produced by large‐scale diffusion models, where our experimental results offer insights on language misunderstanding from such models and biases in object composition, while also providing an interface for discovery of typical, or rare, synthesized scenes.

show abstract

Section: Related Workmentioning

confidence: 99%

CUPID: Contextual Understanding of Prompt‐conditioned Image Distributions

Zhao,

Li,

Berger

2024

Computer Graphics Forum

View full text Add to dashboard Cite

show abstract

“…Similar to García-Peñalvo & Vázquez-Ingelmo (2023), we use "generative AI" as an umbrella term to describe the emerging class of text-to-text, text-to-image, and image-to-image models. These models have great potential to augment and replace human creativity in many application domains, such as visualization (Schetinger et al 2023) or as an aid to map generation (Juhász et al 2023). Only Juhász explored potential applications of text-to-text models in cartography.…”

Section: Literature Reviewmentioning

confidence: 99%

“…It has been trained on 5.85 billion images (Schuhmann et al 2022) and published by the CompVis group at LMU Munich and Stability AI (Rombach et al 2022). Schetinger et al (2023) illustrate a number of challenges and opportunities that have emerged. One of the key issues discussed is agency, which is used to describe the ability of analysts to modify the outcome of the content generation process.…”

Section: Introductionmentioning

confidence: 99%

Generative text-to-image diffusion for automated map production based on geosocial media data

Dunkel,

Burghardt,

Gugulica

2023

Preprint

View full text Add to dashboard Cite

The state of generative AI has taken a leap forward with the availability of open source diffusion models. Here, we demonstrate an integrated workflow that uses text-to-image Stable Diffusion at its core to automatically generate icon maps such as for the area of the Großer Garten, a tourist hotspot in Dresden, Germany. The provided workflow is based on the aggregation of geosocial media data from Twitter, Flickr, Instagram, and iNaturalist. This data is used to create diffusion prompts, to account for the collective attribution of meaning and importance by the population in map generation. Specifically, we contribute methods for simplifying the variety of contexts communicated on social media, through spatial clustering and semantic filtering, for use in prompts, and then demonstrate how this human-contributed baseline data can be used in prompt engineering to automatically generate icon maps. Replacing labels on maps with expressive graphics has the general advantage of reaching a broader audience, such as children and other illiterate groups. For example, the resulting maps can be used to inform tourists of all backgrounds about important activities, points of interest, and landmarks without the need for translation. Several challenges are identified and possible future optimizations are described for different steps of the process. The code and data are fully provided and shared in several Jupyter notebooks, allowing for transparent replication of the workflow and adoption to other areas or datasets.

show abstract

“…In the visualization context, aesthetics refers to a quality or characteristic of a visual representation distinct from how clear, informative, or memorable it is. An alternate definition refers to the visual appeal or beauty of the representation [26,37]. We seek to systematically evaluate the applicability of the premise that beauty and functionality are intrinsically intertwined [5,10,22,38,39,41] for visualization.…”

Section: Visualization Aestheticsmentioning

confidence: 99%

PMU Tracker: A Visualization Platform for Epicentric Event Propagation Analysis in the Power Grid

Arunkumar

Pinceti

Sankar

et al. 2022

IEEE Trans. Visual. Comput. Graphics

View full text Add to dashboard Cite

The electrical power grid is a critical infrastructure, with disruptions in transmission having severe repercussions on daily activities, across multiple sectors. To identify, prevent, and mitigate such events, power grids are being refurbished as 'smart' systems that include the widespread deployment of GPS-enabled phasor measurement units (PMUs). PMUs provide fast, precise, and time-synchronized measurements of voltage and current, enabling real-time wide-area monitoring and control. However, the potential benefits of PMUs, for analyzing grid events like abnormal power oscillations and load fluctuations, are hindered by the fact that these sensors produce large, concurrent volumes of noisy data. In this paper, we describe working with power grid engineers to investigate how this problem can be addressed from a visual analytics perspective. As a result, we have developed PMU Tracker, an event localization tool that supports power grid operators in visually analyzing and identifying power grid events and tracking their propagation through the power grid's network. As a part of the PMU Tracker interface, we develop a novel visualization technique which we term an epicentric cluster dendrogram, which allows operators to analyze the effects of an event as it propagates outwards from a source location. We robustly validate PMU Tracker with: (1) a usage scenario demonstrating how PMU Tracker can be used to analyze anomalous grid events, and (2) case studies with power grid operators using a real-world interconnection dataset. Our results indicate that PMU Tracker effectively supports the analysis of power grid events; we also demonstrate and discuss how PMU Tracker's visual analytics approach can be generalized to other domains composed of time-varying networks with epicentric event characteristics.

show abstract

Doom or Deliciousness: Challenges and Opportunities for Visualization in the Age of Generative Models

Cited by 5 publications

References 12 publications

CUPID: Contextual Understanding of Prompt‐conditioned Image Distributions

CUPID: Contextual Understanding of Prompt‐conditioned Image Distributions

Generative text-to-image diffusion for automated map production based on geosocial media data

PMU Tracker: A Visualization Platform for Epicentric Event Propagation Analysis in the Power Grid

Contact Info

Product

Resources

About