Neville Ryant scite author profile

Abstract. Lexical classifications have proved useful in supporting various natural language processing (NLP) tasks. The largest verb classification for English is Levin's (1993) work which defined groupings of verbs based on syntactic and semantic properties. VerbNet (Kipper et al., 2000;Kipper-Schuler, 2005) -the largest computational verb lexicon currently available for English -provides detailed syntactic-semantic descriptions of Levin classes. While the classes included are extensive enough for some NLP use, they are not comprehensive. Korhonen and Briscoe (2004) have proposed a significant extension of Levin's classification which incorporates 57 novel classes for verbs not covered (comprehensively) by Levin. Korhonen and Ryant (2005) have recently supplemented this with another extension including 53 additional classes. This article describes the integration of these two extensions into VerbNet. The result is a comprehensive Levin-style classification for English verbs providing over 90% token coverage of the Proposition Bank data and thus can be highly useful for practical applications.

show abstract

The Second DIHARD Diarization Challenge: Dataset, Task, and Baselines

Ryant¹,

Church²,

Cieri³

et al. 2019

140

117

View full text Add to dashboard Cite

This paper introduces the second DIHARD challenge, the second in a series of speaker diarization challenges intended to improve the robustness of diarization systems to variation in recording equipment, noise conditions, and conversational domain. The challenge comprises four tracks evaluating diarization performance under two input conditions (single channel vs. multi-channel) and two segmentation conditions (diarization from a reference speech segmentation vs. diarization from scratch). In order to prevent participants from overtuning to a particular combination of recording conditions and conversational domain, recordings are drawn from a variety of sources ranging from read audiobooks to meeting speech, to child language acquisition recordings, to dinner parties, to web video. We describe the task and metrics, challenge design, datasets, and baseline systems for speech enhancement, speech activity detection, and diarization. 1 See, for instance, the release of IBM's diarization API in 2017. The feature worked well for simple cases, but when run by users on real inputs, the performance was found to be lacking, especially for overlaps, back-channels, and short turns.

show abstract

CHiME-6 Challenge: Tackling Multispeaker Speech Recognition for Unsegmented Recordings

Watanabe¹,

Mandel²,

Barker³

et al. 2020

167

108

View full text Add to dashboard Cite

From Light to Rich ERE: Annotation of Entities, Relations, and Events

Song¹,

Bies²,

Strassel³

et al. 2015

View full text Add to dashboard Cite

We describe the evolution of the Entities, Relations and Events (ERE) annotation task, created to support research and technology development within the DARPA DEFT program. We begin by describing the specification for Light ERE annotation, including the motivation for the task within the context of DEFT. We discuss the transition from Light ERE to a more complex Rich ERE specification, enabling more comprehensive treatment of phenomena of interest to DEFT.

show abstract

CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings

Watanabe¹,

Mandel

Barker³

et al. 2020

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Neville Ryant

A large-scale classification of English verbs

The Second DIHARD Diarization Challenge: Dataset, Task, and Baselines

CHiME-6 Challenge: Tackling Multispeaker Speech Recognition for Unsegmented Recordings

From Light to Rich ERE: Annotation of Entities, Relations, and Events

CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings

Contact Info

Product

Resources

About