CoSE: Compositional Stroke Embeddings

Aksan, Emre; Deselaers, Thomas; Tagliasacchi, Andrea; Hilliges, Otmar

doi:10.48550/arxiv.2006.09930

Cited by 3 publications

(3 citation statements)

References 19 publications

(28 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…-'harms' to mean impediments to technology deployment -'bad intentions' to be of users and not of technology developers -'harm' only in relation to war, government, or 'mass disaster' -or using terms like 'reliable', 'secure' without specifying for who "our work can bring both beneficial and harmful impacts and it really depends on the motivation of the users" (Hu et al, 2020) and "[the work is] academic in nature, and does not pose foreseeable risks regarding defense, security, and other sensitive fields." (Aksan et al, 2020;Wang et al, 2020b;Jiang et al, 2020) Outsourcing the ethical responsibility to others or other stages of technology deployment by ignoring theoretical or technical affordances for misuse and instead referencing biased inputs, engineering mistakes or malicious uses "there exist risks that some engineers [can] deliberately use the algorithm [in a way that would] harm the performance of the designed system" (Hu et al, 2020) Confusing technical advances with positive impact, by -assuming adoption of technical solutions to constitute a benefit -failing to question assumptions behind performance metrics -treating impact statement as a 'sales pitch' "Further extensions include applying [the method] in robotics. Machine learning for robotics is increasingly growing as a field and has potential of revolutionizing technology in the unprecedented way."…”

Section: Themes Example Quotes Examples Of Concerning Trendsmentioning

confidence: 99%

Overcoming Failures of Imagination in AI Infused System Development and Deployment

Boyarskaya,

Olteanu,

Crawford

2020

Preprint

View full text Add to dashboard Cite

NeurIPS 2020 requested that research paper submissions include impact statements on "potential nefarious uses and the consequences of failure." However, as researchers, practitioners and system designers, a key challenge to anticipating risks is overcoming what Clarke (1962) called 'failures of imagination.' The growing research on bias, fairness, and transparency in computational systems aims to illuminate and mitigate harms, and could thus help inform reflections on possible negative impacts of particular pieces of technical work. The prevalent notion of computational harms-narrowly construed as either allocational or representational harms-does not fully capture the open, context dependent, and unobservable nature of harms across the wide range of AI infused systems. The current literature focuses on a small range of examples of harms to motivate algorithmic fixes, overlooking the wider scope of probable harms and the way these harms might affect different stakeholders. The system affordances may also exacerbate harms in unpredictable ways, as they determine stakeholders' control (including of non-users) over how they use and interact with a system output. To effectively assist in anticipating harmful uses, we argue that frameworks of harms must be context-aware and consider a wider range of potential stakeholders, system affordances, as well as viable proxies for assessing harms in the widest sense.

show abstract

Section: Themes Example Quotes Examples Of Concerning Trendsmentioning

confidence: 99%

Overcoming Failures of Imagination in AI Infused System Development and Deployment

Boyarskaya,

Olteanu,

Crawford

2020

Preprint

View full text Add to dashboard Cite

show abstract

“…Vector image generation, e.g., sketches, strokes and icons, catches attention until very recently, despite that raster image generation has achieved great success (Radford, Metz, and Chintala 2015;Zhu et al 2017;Arjovsky, Chintala, and Bottou 2017). For example, SketchRNN (Ha and Eck 2017) models all strokes in a sketch as a sequence; Sketchformer (Ribeiro et al 2020) leverages Transformer to learn longer term temporal structure in the stroke sequence; DeepSVG (Carlier et al 2020) disentangles high-level shapes from the low-level commands to reconstruct complex icons; and CoSE (Aksan et al 2020) factors local appearance of a stroke from the global structure of the drawing to model stroke-based data. As graphic layouts have different data structures with aforementioned vector images, recent progress on them cannot be directly adopted.…”

Section: Related Workmentioning

confidence: 99%

Coarse-to-Fine Generative Modeling for Graphic Layouts

Jiang

Sun

Zhu

et al. 2022

AAAI

View full text Add to dashboard Cite

Even though graphic layout generation has attracted growing attention recently, it is still challenging to synthesis realistic and diverse layouts, due to the complicated element relationships and varied element arrangements. In this work, we seek to improve the performance of layout generation by incorporating the concept of regions, which consist of a smaller number of elements and appears like a simple layout, into the generation process. Specifically, we leverage Variational Autoencoder (VAE) as the overall architecture and decompose the decoding process into two stages. The first stage predicts representations for regions, and the second stage fills in the detailed position for each element within the region based on the predicted region representation. Compared to prior studies that merely abstract the layout into a list of elements and generate all the element positions in one go, our approach has at least two advantages. First, by the two-stage decoding, our approach decouples the complex layout generation task into several simple layout generation tasks, which reduces the problem difficulty. Second, the predicted regions can help the model roughly know what the graphic layout looks like and serve as global context to improve the generation of detailed element positions. Qualitative and quantitative experiments demonstrate that our approach significantly outperforms the existing methods, especially on the complex graphic layouts.

show abstract

“…Arguably, PixelCNN [ Van den Oord et al, 2016] can be viewed as an extreme case of this model class that generates one pixel at a time conditioned on previously generated ones without considering a latent space. There are also stroke based generative models like SPIRAL [Ganin et al, 2018], Cose [Aksan et al, 2020], and SketchEmbedNet [Wang et al, 2020]. SPIRAL generates images through a sequence of strokes while Cose and SketchEmbedNet focus on generating sketch images.…”

Section: Related Workmentioning

confidence: 99%

NP-DRAW: A Non-Parametric Structured Latent Variable Model for Image Generation

Zeng

Urtasun

Zemel

et al. 2021

Preprint

View full text Add to dashboard Cite

In this paper, we present a non-parametric structured latent variable model for image generation, called NP-DRAW, which sequentially draws on a latent canvas in a part-by-part fashion and then decodes the image from the canvas. Our key contributions are as follows. 1) We propose a nonparametric prior distribution over the appearance of image parts so that the latent variable "whatto-draw" per step becomes a categorical random variable. This improves the expressiveness and greatly eases the learning compared to Gaussians used in the literature. 2) We model the sequential dependency structure of parts via a Transformer, which is more powerful and easier to train compared to RNNs used in the literature. 3) We propose an effective heuristic parsing algorithm to pretrain the prior. Experiments on MNIST, Omniglot, CIFAR-10, and CelebA show that our method significantly outperforms previous structured image models like DRAW and AIR and is competitive to other generic generative models. Moreover, we show that our model's inherent compositionality and interpretability bring significant benefits in the low-data learning regime and latent space editing. Code is available at https://github.com/ZENGXH/NPDRAW.

show abstract

CoSE: Compositional Stroke Embeddings

Cited by 3 publications

References 19 publications

Overcoming Failures of Imagination in AI Infused System Development and Deployment

Overcoming Failures of Imagination in AI Infused System Development and Deployment

Coarse-to-Fine Generative Modeling for Graphic Layouts

NP-DRAW: A Non-Parametric Structured Latent Variable Model for Image Generation

Contact Info

Product

Resources

About