John Thickstun scite author profile

John Thickstun

5Publications

117Citation Statements Received

68Citation Statements Given

How they've been cited

127

115

How they cite others

Affiliations

University of Washington, Seattle University

Publications

Order By: Most citations

Invariances and Data Augmentation for Supervised Music Transcription

Thickstun

Harchaoui

Foster

et al. 2018

View full text Add to dashboard Cite

This paper explores a variety of models for frame-based music transcription, with an emphasis on the methods needed to reach state-of-the-art on human recordings. The translationinvariant network discussed in this paper, which combines a traditional filterbank with a convolutional neural network, was the top-performing model in the 2017 MIREX Multiple Fundamental Frequency Estimation evaluation. This class of models shares parameters in the log-frequency domain, which exploits the frequency invariance of music to reduce the number of model parameters and avoid overfitting to the training data. All models in this paper were trained with supervision by labeled data from the MusicNet dataset, augmented by random label-preserving pitch-shift transformations.

show abstract

An Information Bottleneck Approach for Controlling Conciseness in Rationale Extraction

Paranjape

Joshi

Thickstun

et al. 2020

View full text Add to dashboard Cite

Decisions of complex models for language understanding can be explained by limiting the inputs they are provided to a relevant subsequence of the original text -a rationale. Models that condition predictions on a concise rationale, while being more interpretable, tend to be less accurate than models that are able to use the entire context. In this paper, we show that it is possible to better manage the trade-off between concise explanations and high task accuracy by optimizing a bound on the Information Bottleneck (IB) objective. Our approach jointly learns an explainer that predicts sparse binary masks over input sentences without explicit supervision, and an end-task predictor that considers only the residual sentences. Using IB, we derive a learning objective that allows direct control of mask sparsity levels through a tunable sparse prior. Experiments on the ERASER benchmark demonstrate significant gains over previous work for both task performance and agreement with human rationales. Furthermore, we find that in the semi-supervised setting, a modest amount of gold rationales (25% of training examples with gold masks) can close the performance gap with a model that uses the full input. 1

show abstract

Diffusion-LM Improves Controllable Text Generation

Li¹,

Thickstun²,

Gulrajani³

et al. 2022

Preprint

View full text Add to dashboard Cite

Evaluating Human-Language Model Interaction

Lee¹,

Srivastava²,

Amelia³

et al. 2022

Preprint

View full text Add to dashboard Cite

An Information Bottleneck Approach for Controlling Conciseness in Rationale Extraction

Paranjape¹,

Joshi²,

Thickstun³

et al. 2020

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

John Thickstun

Invariances and Data Augmentation for Supervised Music Transcription

An Information Bottleneck Approach for Controlling Conciseness in Rationale Extraction

Diffusion-LM Improves Controllable Text Generation

Evaluating Human-Language Model Interaction

An Information Bottleneck Approach for Controlling Conciseness in Rationale Extraction

Contact Info

Product

Resources

About