Bruno Taillé scite author profile

Bruno Taillé

3Publications

35Citation Statements Received

75Citation Statements Given

How they've been cited

How they cite others

Affiliations

BNP Paribas (France), Laboratoire de Recherche en Informatique de Paris 6

Publications

Order By: Most citations

Let’s Stop Incorrect Comparisons in End-to-end Relation Extraction!

Taillé¹,

Guigue²,

Scoutheeten³

et al. 2020

View full text Add to dashboard Cite

Despite efforts to distinguish three different evaluation setups (Bekoulis et al., 2018a,b), numerous end-to-end Relation Extraction (RE) articles present unreliable performance comparison to previous work. In this paper, we first identify several patterns of invalid comparisons in published papers and describe them to avoid their propagation. We then propose a small empirical study to quantify the most common mistake's impact and evaluate it leads to overestimating the final RE performance by around 5% on ACE05. We also seize this opportunity to study the unexplored ablations of two recent developments: the use of language model pretraining (specifically BERT) and span-level NER. This meta-analysis emphasizes the need for rigor in the report of both the evaluation setting and the dataset statistics. We finally call for unifying the evaluation setting in end-to-end RE 1 .

show abstract

Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization

Taillé

Guigue

Gallinari

2020

View full text Add to dashboard Cite

Contextualized embeddings use unsupervised language model pretraining to compute word representations depending on their context. This is intuitively useful for generalization, especially in Named-Entity Recognition where it is crucial to detect mentions never seen during training. However, standard English benchmarks overestimate the importance of lexical over contextual features because of an unrealistic lexical overlap between train and test mentions. In this paper, we perform an empirical analysis of the generalization capabilities of state-of-the-art contextualized embeddings by separating mentions by novelty and with out-of-domain evaluation. We show that they are particularly beneficial for unseen mentions detection, especially out-of-domain. For models trained on CoNLL03, language model contextualization leads to a +1.2% maximal relative micro-F1 score increase in-domain against +13% out-of-domain on the WNUT dataset 1 .

show abstract

Let's Stop Incorrect Comparisons in End-to-end Relation Extraction!

Taillé¹,

Guigue²,

Scoutheeten³

et al. 2020

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Bruno Taillé

Let’s Stop Incorrect Comparisons in End-to-end Relation Extraction!

Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization

Let's Stop Incorrect Comparisons in End-to-end Relation Extraction!

Contact Info

Product

Resources

About