Qiang Ning scite author profile

Standard test sets for supervised learning evaluate in-distribution generalization. Unfortunately, when a dataset has systematic gaps (e.g., annotation artifacts), these evaluations are misleading: a model can learn simple decision rules that perform well on the test set but do not capture the abilities a dataset is intended to test. We propose a more rigorous annotation paradigm for NLP that helps to close systematic gaps in the test data. In particular, after a dataset is constructed, we recommend that the dataset authors manually perturb the test instances in small but meaningful ways that (typically) change the gold label, creating contrast sets. Contrast sets provide a local view of a model's decision boundary, which can be used to more accurately evaluate a model's true linguistic capabilities. We demonstrate the efficacy of contrast sets by creating them for 10 diverse NLP datasets (e.g., DROP reading comprehension, UD parsing, and IMDb sentiment analysis). Although our contrast sets are not explicitly adversarial, model performance is significantly lower on them than on the original test sets-up to 25% in some cases. We release our contrast sets as new evaluation benchmarks and encourage future dataset construction efforts to follow similar annotation processes.

show abstract

A Multi-Axis Annotation Scheme for Event Temporal Relations

Ning

Wu²,

Roth

2018

135

View full text Add to dashboard Cite

Existing temporal relation (TempRel) annotation schemes often have low interannotator agreements (IAA) even between experts, suggesting that the current annotation task needs a better definition. This paper proposes a new multi-axis modeling to better capture the temporal structure of events. In addition, we identify that event end-points are a major source of confusion in annotation, so we also propose to annotate TempRels based on start-points only. A pilot expert annotation effort using the proposed scheme shows significant improvement in IAA from the conventional 60's to 80's (Cohen's Kappa). This better-defined annotation scheme further enables the use of crowdsourcing to alleviate the labor intensity for each annotator. We hope that this work can foster more interesting studies towards event understanding. 1 This paper proposes a new approach to handling

show abstract

A Structured Learning Approach to Temporal Relation Extraction

Ning¹,

Feng²,

Roth³

2017

122

View full text Add to dashboard Cite

Identifying temporal relations between events is an essential step towards natural language understanding. However, the temporal relation between two events in a story depends on, and is often dictated by, relations among other events. Consequently, effectively identifying temporal relations between events is a challenging problem even for human annotators. This paper suggests that it is important to take these dependencies into account while learning to identify these relations and proposes a structured learning approach to address this challenge. As a byproduct, this provides a new perspective on handling missing relations, a known issue that hurts existing methods. As we show, the proposed approach results in significant improvements on the two commonly used data sets for this problem.

show abstract

“Going on a vacation” takes longer than “Going for a walk”: A Study of Temporal Commonsense Understanding

Zhou¹,

Khashabi²,

Ning³

et al. 2019

100

View full text Add to dashboard Cite

Understanding time is crucial for understanding events expressed in natural language. Because people rarely say the obvious, it is often necessary to have commonsense knowledge about various temporal aspects of events, such as duration, frequency, and temporal order. However, this important problem has so far received limited attention. This paper systematically studies this temporal commonsense problem. Specifically, we define five classes of temporal commonsense, and use crowdsourcing to develop a new dataset, MCTACO , that serves as a test set for this task. We find that the best current methods used on MCTACO are still far behind human performance, by about 20%, and discuss several directions for improvement. We hope that the new dataset and our study here can foster more future research on this topic. 1 1 The dataset, annotation interfaces, guidelines, and qualification tests are available at:

show abstract

Joint Reasoning for Temporal and Causal Relations

Ning¹,

Feng²,

Wu³

et al. 2018

109

View full text Add to dashboard Cite

Understanding temporal and causal relations between events is a fundamental natural language understanding task. Because a cause must occur earlier than its effect, temporal and causal relations are closely related and one relation often dictates the value of the other. However, limited attention has been paid to studying these two relations jointly. This paper presents a joint inference framework for them using constrained conditional models (CCMs). Specifically, we formulate the joint problem as an integer linear programming (ILP) problem, enforcing constraints that are inherent in the nature of time and causality. We show that the joint inference framework results in statistically significant improvement in the extraction of both temporal and causal relations from text. 1

show abstract

Joint Event and Temporal Relation Extraction with Shared Representations and Structured Prediction

Han¹,

Ning²,

Peng³

2019

View full text Add to dashboard Cite

We propose a joint event and temporal relation extraction model with shared representation learning and structured prediction. The proposed method has two advantages over existing work. First, it improves event representation by allowing the event and relation modules to share the same contextualized embeddings and neural representation learner. Second, it avoids error propagation in the conventional pipeline systems by leveraging structured inference and learning methods to assign both the event labels and the temporal relation labels jointly. Experiments show that the proposed method can improve both event extraction and temporal relation extraction over state-of-the-art systems, with the end-to-end F 1 improved by 10% and 6.8% on two benchmark datasets respectively.

show abstract

TORQUE: A Reading Comprehension Dataset of Temporal Ordering Questions

Ning¹,

Wu²,

Han³

et al. 2020

View full text Add to dashboard Cite

A critical part of reading is being able to understand the temporal relationships between events described in a passage of text, even when those relationships are not explicitly stated. However, current machine reading comprehension benchmarks have practically no questions that test temporal phenomena, so systems trained on these benchmarks have no capacity to answer questions such as "what happened before/after [some event]?" We introduce TORQUE, a new English reading comprehension benchmark built on 3.2k news snippets with 21k human-generated questions querying temporal relationships. Results show that RoBERTa-large achieves an exact-match score of 51% on the test set of TORQUE, about 30% behind human performance. 1

show abstract

High‐resolution ¹H‐MRSI of the brain using short‐TE SPICE

Lam

Ning

et al. 2016

Magnetic Resonance in Med

View full text Add to dashboard Cite

Purpose To improve signal-to-noise ratio (SNR) for high-resolution spectroscopic imaging using a subspace-based technique known as SPICE (SPectroscopic Imaging by exploiting spatiospectral CorrElation). Methods The proposed method is based on a union-of-subspaces model of MRSI signals, which exploits the partial separability properties of water, lipid, baseline and metabolite signals. Enabled by this model, a special scheme is used for accelerated data acquisition, which includes a double-echo CSI component used to collect a “training” dataset (for determination of the basis functions) and a short-TE EPSI component used to collect a sparse “imaging” dataset (for determination of the overal spatiospectral distributions). A set of signal processing algorithms are developed to remove the water and lipid signals and jointly reconstruct the metabolite and baseline signals. Results In vivo 1H-MRSI results show that the proposed method can effectively remove the remaining water and lipid signals from sparse MRSI data acquired at 20 ms TE. Spatiospectral distributions of metabolite signals at 2 mm in-plane resolution with good SNR were obtained in a 15.5 min scan. Conclusions The proposed method can effectively remove nuisance signals and reconstruct high-resolution spatiospectral functions from sparse data to make short-TE SPICE possible. The method should prove useful for high-resolution 1H-MRSI of the brain with improved SNR.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Qiang Ning

Evaluating Models’ Local Decision Boundaries via Contrast Sets

A Multi-Axis Annotation Scheme for Event Temporal Relations

A Structured Learning Approach to Temporal Relation Extraction

“Going on a vacation” takes longer than “Going for a walk”: A Study of Temporal Commonsense Understanding

Joint Reasoning for Temporal and Causal Relations

Joint Event and Temporal Relation Extraction with Shared Representations and Structured Prediction

TORQUE: A Reading Comprehension Dataset of Temporal Ordering Questions

High‐resolution ¹H‐MRSI of the brain using short‐TE SPICE

Contact Info

Product

Resources

About

Qiang Ning

Evaluating Models’ Local Decision Boundaries via Contrast Sets

A Multi-Axis Annotation Scheme for Event Temporal Relations

A Structured Learning Approach to Temporal Relation Extraction

“Going on a vacation” takes longer than “Going for a walk”: A Study of Temporal Commonsense Understanding

Joint Reasoning for Temporal and Causal Relations

Joint Event and Temporal Relation Extraction with Shared Representations and Structured Prediction

TORQUE: A Reading Comprehension Dataset of Temporal Ordering Questions

High‐resolution 1H‐MRSI of the brain using short‐TE SPICE

Contact Info

Product

Resources

About

High‐resolution ¹H‐MRSI of the brain using short‐TE SPICE