Interspeech 2020 2020
DOI: 10.21437/interspeech.2020-2666
|View full text |Cite
|
Sign up to set email alerts
|

Ensembling End-to-End Deep Models for Computational Paralinguistics Tasks: ComParE 2020 Mask and Breathing Sub-Challenges

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
23
0
3

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 25 publications
(26 citation statements)
references
References 18 publications
0
23
0
3
Order By: Relevance
“…In an other effort of correlating speech signals with breathing signals, an ensemble system with fusion at both feature and decision level of two approaches is presented by Markitantov et al. [15] . One of the two approaches is a 1D-CNN based end-to-end model having two LSTM layers stacked above it.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In an other effort of correlating speech signals with breathing signals, an ensemble system with fusion at both feature and decision level of two approaches is presented by Markitantov et al. [15] . One of the two approaches is a 1D-CNN based end-to-end model having two LSTM layers stacked above it.…”
Section: Introductionmentioning
confidence: 99%
“…The attention step is found to improve the metrics by 0.003 r-value absolute, from r = 0.728 to r = 0.731, and.726 % F1 value absolute, from 74.743 to 75.469 for the two tasks, respectively. All the three studies mentioned above [14] , [15] , [16] worked with the data set provided in the Breathing Sub-challenge of Interspeech 2020 ComParE [13] .…”
Section: Introductionmentioning
confidence: 99%
“…Markitantov et al. [58] submitted five different models to the MSC. These models are all based on two models, ResNet18v1 and ResNet18v2, which are variations of the standard ResNet18 [21] .…”
Section: Challenge Results and Contributionsmentioning
confidence: 99%
“…The approaches introduced in [58] are all generic audio-based approaches that depend on variations of the standard ResNet18 model. As such, they can easily be used for other audio tasks without much change.…”
Section: Challenge Results and Contributionsmentioning
confidence: 99%
See 1 more Smart Citation