Evaluating Learning Language Representations

Karlgren, Jussi; Callin, Jimmy; Collins-Thompson, Kevyn; Gyllensten, Amaru Cuba; Ekgren, Ariel; Jurgens, David; Korhonen, Anna; Olsson, Fredrik; Sahlgren, Magnus; Schütze, Hinrich

doi:10.1007/978-3-319-24027-5_25

Cited by 4 publications

(2 citation statements)

References 5 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Evaluating learning lexical resources is challenging for several reasons [1]. Firstly, operationalizable intrinsic measures for knowledge based models risk being irrelevant for system performance or measure outcome, rather than learning process.…”

Section: Know That the Component Does What Is Expected Of It;mentioning

confidence: 99%

“…To generate this task, we create a set of sentences containing a number of sentences which occurred naturally and have remained unaltered, as well as a number of sentences which occurred naturally but have had one or more words within them swapped for some other word. We call this set a coconut, as per examples given by Karlgren et al [1]. The task is then to sort these sentences in order of likelihood of having occurred naturally.…”

Section: Plausible Utterancesmentioning

confidence: 99%

See 1 more Smart Citation

Plausibility Testing for Lexical Resources

Parks

Karlgren

Stymne

2017

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

Abstract. This paper describes principles for evaluation metrics for lexical components and an implementation of them based on requirements from practical information systems. Evaluating information system componentsThe performance of a component in a complex processing pipeline can influence the function of downstream components, meaning that end-to-end testing also must be performed on entire systems, using approaches based on use cases with target notions that validate the function of the system for the purpose it is built, such as many of the evaluation measures formulated in workshops at CLEF. But a task-based evaluation does not reveal the performance of individual components. Evaluation of knowledge-based components in an information system should be done systematically, ideally in ways which are similar to unit tests done for other technical components, motivated by the need for a development and maintenance team to:

show abstract

Section: Know That the Component Does What Is Expected Of It;mentioning

confidence: 99%

Section: Plausible Utterancesmentioning

confidence: 99%