Andrew Runge scite author profile

Andrew Runge

5Publications

34Citation Statements Received

98Citation Statements Given

How they've been cited

How they cite others

Affiliations

Carnegie Mellon University

Publications

Order By: Most citations

Optimizing segmentation granularity for neural machine translation

Salesky

Runge

Coda

et al. 2020

Machine Translation

View full text Add to dashboard Cite

In neural machine translation (NMT), it is has become standard to translate using subword units to allow for an open vocabulary and improve accuracy on infrequent words. Bytepair encoding (BPE) and its variants are the predominant approach to generating these subwords, as they are unsupervised, resource-free, and empirically effective. However, the granularity of these subword units is a hyperparameter to be tuned for each language and task, using methods such as grid search. Tuning may be done inexhaustively or skipped entirely due to resource constraints, leading to sub-optimal performance. In this paper, we propose a method to automatically tune this parameter using only one training pass. We incrementally introduce new vocabulary online based on the held-out validation loss, beginning with smaller, general subwords and adding larger, more specific units over the course of training. Our method matches the results found with grid search, optimizing segmentation granularity without any additional training time. We also show benefits in training efficiency and performance improvements for rare words due to the way embeddings for larger units are incrementally constructed by combining those from smaller units.

show abstract

The interactive reading task: Transformer-based automatic item generation

Attali¹,

Runge²,

LaFlair³

et al. 2022

Front. Artif. Intell.

View full text Add to dashboard Cite

Automatic item generation (AIG) has the potential to greatly expand the number of items for educational assessments, while simultaneously allowing for a more construct-driven approach to item development. However, the traditional item modeling approach in AIG is limited in scope to content areas that are relatively easy to model (such as math problems), and depends on highly skilled content experts to create each model. In this paper we describe the interactive reading task, a transformer-based deep language modeling approach for creating reading comprehension assessments. This approach allows a fully automated process for the creation of source passages together with a wide range of comprehension questions about the passages. The format of the questions allows automatic scoring of responses with high fidelity (e.g., selected response questions). We present the results of a large-scale pilot of the interactive reading task, with hundreds of passages and thousands of questions. These passages were administered as part of the practice test of the Duolingo English Test. Human review of the materials and psychometric analyses of test taker results demonstrate the feasibility of this approach for automatic creation of complex educational assessments.

show abstract

Optimizing Segmentation Granularity for Neural Machine Translation

Salesky¹,

Runge²,

Coda³

et al. 2018

Preprint

View full text Add to dashboard Cite

Exploring Neural Entity Representations for Semantic Information

Runge¹,

Hovy²

2020

View full text Add to dashboard Cite

Neural methods for embedding entities are typically extrinsically evaluated on downstream tasks and, more recently, intrinsically using probing tasks. Downstream task-based comparisons are often difficult to interpret due to differences in task structure, while probing task evaluations often look at only a few attributes and models. We address both of these issues by evaluating a diverse set of eight neural entity embedding methods on a set of simple probing tasks, demonstrating which methods are able to remember words used to describe entities, learn type, relationship and factual information, and identify how frequently an entity is mentioned. We also compare these methods in a unified framework on two entity linking tasks and discuss how they generalize to different model architectures and datasets.

show abstract

Exploring Neural Entity Representations for Semantic Information

Runge¹,

Hovy²

2020

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Andrew Runge

Optimizing segmentation granularity for neural machine translation

The interactive reading task: Transformer-based automatic item generation

Optimizing Segmentation Granularity for Neural Machine Translation

Exploring Neural Entity Representations for Semantic Information

Exploring Neural Entity Representations for Semantic Information

Contact Info

Product

Resources

About