Findings of the Association for Computational Linguistics: EMNLP 2020 2020
DOI: 10.18653/v1/2020.findings-emnlp.239
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical Pre-training for Sequence Labelling in Spoken Dialog

Abstract: Sequence labelling tasks like Dialog Act and Emotion/Sentiment identification are a key component of spoken dialog systems. In this work, we propose a new approach to learn generic representations adapted to spoken dialog, which we evaluate on a new benchmark we call Sequence labellIng evaLuatIon benChmark fOr spoken laN-guagE benchmark (SILICONE). SILICONE is model-agnostic and contains 10 di↵erent datasets of various sizes. We obtain our representations with a hierarchical encoder based on transformer archit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
3

Relationship

3
6

Authors

Journals

citations
Cited by 20 publications
(13 citation statements)
references
References 49 publications
0
10
0
Order By: Relevance
“…This work paves the way for using and developing new alternative methods to improve the learning (e.g new estimator of mu-tual information (Colombo et al, 2021a), Wasserstein Barycenters (Colombo et al, 2021b), Data Depths (Staerman et al, 2021), Extreme Value Theory (Jalalzai et al, 2020)). A future line of research involves using this methods for emotion (Colombo et al, 2019;Witon et al, 2018) and dialog act (Chapuis et al, 2021(Chapuis et al, , 2020a classification with pretrained model tailored for spoken language (Dinkar et al, 2020).…”
Section: Discussionmentioning
confidence: 99%
“…This work paves the way for using and developing new alternative methods to improve the learning (e.g new estimator of mu-tual information (Colombo et al, 2021a), Wasserstein Barycenters (Colombo et al, 2021b), Data Depths (Staerman et al, 2021), Extreme Value Theory (Jalalzai et al, 2020)). A future line of research involves using this methods for emotion (Colombo et al, 2019;Witon et al, 2018) and dialog act (Chapuis et al, 2021(Chapuis et al, , 2020a classification with pretrained model tailored for spoken language (Dinkar et al, 2020).…”
Section: Discussionmentioning
confidence: 99%
“…In addition, we use the TweetEval Dataset Xiong et al [2019] for tweet classification on sentiment (59,899), hate detection (12,970) and emotion recognition (5,052). Furthermore, the SILICONE Dataset Chapuis et al [2020] is also used for the tasks of emotion detection (Semaine 13,708) and utterance sentiment analysis (Meld-S 5,627).…”
Section: Datasetsmentioning
confidence: 99%
“…Among available contrast measures, the Fisher-Rao distance is parameter-free and thus, it is easy to use in practice while the AB-Divergence achieves better results but requires to select α and β. Future work includes extending our metrics to new tasks such as SLU (Chapuis et al 2020(Chapuis et al , 2021Dinkar et al 2020;Colombo, Clavel, and Piantanida 2021), controlled sentence generation (Colombo et al 2019(Colombo et al , 2021b and multi-modal learning (Colombo et al 2021a;Garcia et al 2019).…”
Section: Summary and Concluding Remarksmentioning
confidence: 99%