Yannic Kilcher scite author profile

Yannic Kilcher

5Publications

34Citation Statements Received

99Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

How does BERT capture semantics? A closer look at polysemous words

Yenicelik¹,

Schmidt²,

Kilcher³

2020

View full text Add to dashboard Cite

The recent paradigm shift to contextual word embeddings has seen tremendous success across a wide range of down-stream tasks. However, little is known on how the emergent relation of context and semantics manifests geometrically. We investigate polysemous words as one particularly prominent instance of semantic organization.Our rigorous quantitative analysis of linear separability and cluster organization in embedding vectors produced by BERT shows that semantics do not surface as isolated clusters but form seamless structures, tightly coupled with sentiment and syntax.

show abstract

Semantic Interpolation in Implicit Models

Kilcher¹,

Lucchi²,

Hofmann³

2017

Preprint

View full text Add to dashboard Cite

In implicit models, one often interpolates between sampled points in latent space. As we show in this paper, care needs to be taken to match-up the distributional assumptions on code vectors with the geometry of the interpolating paths. Otherwise, typical assumptions about the quality and semantics of in-between points may not be justified. Based on our analysis we propose to modify the prior code distribution to put significantly more probability mass closer to the origin. As a result, linear interpolation paths are not only shortest paths, but they are also guaranteed to pass through high-density regions, irrespective of the dimensionality of the latent space. Experiments on standard benchmark image datasets demonstrate clear visual improvements in the quality of the generated samples and exhibit more meaningful interpolation paths.

show abstract

FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control

Dimitri¹,

Biggio²,

Kilcher³

et al. 2022

Preprint

View full text Add to dashboard Cite

Generating music with deep neural networks has been an area of active research in recent years. While the quality of generated samples has been steadily increasing, most methods are only able to exert minimal control over the generated sequence, if any. We propose the self-supervised description-to-sequence task, which allows for fine-grained controllable generation on a global level. We do so by extracting high-level features about the target sequence and learning the conditional distribution of sequences given the corresponding high-level description in a sequence-tosequence modelling setup. We train FIGARO (FIne-grained music Generation via Attentionbased, RObust control) by applying descriptionto-sequence modelling to symbolic music. By combining learned high level features with domain knowledge, which acts as a strong inductive bias, the model achieves state-of-the-art results in controllable symbolic music generation and generalizes well beyond the training distribution.

show abstract

Boosting Search Engines with Interactive Agents

Adolphs¹,

Boerschinger²,

Buck³

et al. 2021

Preprint

View full text Add to dashboard Cite

Can machines learn to use a search engine as an interactive tool for finding information? That would have far reaching consequences for making the world's knowledge more accessible. This paper presents first steps in designing agents that learn meta-strategies for contextual query refinements. Our approach uses machine reading to guide the selection of refinement terms from aggregated search results. Agents are then empowered with simple but effective search operators to exert fine-grained and transparent control over queries and search results. We develop a novel way of generating synthetic search sessions, which leverages the power of transformer-based generative language models through (self-)supervised learning. We also present a reinforcement learning agent with dynamically constrained actions that can learn interactive search strategies completely from scratch. In both cases, we obtain significant improvements over one-shot search with a strong information retrieval baseline. Finally, we provide an in-depth analysis of the learned search policies. * Work carried out in part during internships at Google.

show abstract

Rethinking Neural Networks With Benford's Law

Sahu¹,

Java²,

Shaikh³

et al. 2021

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yannic Kilcher

How does BERT capture semantics? A closer look at polysemous words

Semantic Interpolation in Implicit Models

FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control

Boosting Search Engines with Interactive Agents

Rethinking Neural Networks With Benford's Law

Contact Info

Product

Resources

About