Interspeech 2018 2018
DOI: 10.21437/interspeech.2018-1772
|View full text |Cite
|
Sign up to set email alerts
|

Recognition of Echolalic Autistic Child Vocalisations Utilising Convolutional Recurrent Neural Networks

Abstract: Autism spectrum conditions (ASC) are a set of neurodevelopmental conditions partly characterised by difficulties with communication. Individuals with ASC can show a variety of atypical speech behaviours, including echolalia or the 'echoing' of another's speech. We herein introduce a new dataset of 15 Serbian ASC children in a human-robot interaction scenario, annotated for the presence of echolalia amongst other ASC vocal behaviours. From this, we propose a four-class classification problem and investigate the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
2

Relationship

3
6

Authors

Journals

citations
Cited by 15 publications
(7 citation statements)
references
References 22 publications
0
6
0
1
Order By: Relevance
“…The recent advancements in Deep Learning (DL) suggest that end-to-end learning (i.e., ML models that learn directly from the raw data and no features are needed) can outperform the classic, feature-based ML. For example, DL has achieved breakthrough performance in tasks such as pattern recognition problems [4], image processing [5], [6], natural language processing [7], [8], speech and audio processing [9], [10], and sensor data processing [11], [12]. For CHF detection, a successful combination of classic ML and end-to-end DL can outperform each single approach [13].…”
Section: Introductionmentioning
confidence: 99%
“…The recent advancements in Deep Learning (DL) suggest that end-to-end learning (i.e., ML models that learn directly from the raw data and no features are needed) can outperform the classic, feature-based ML. For example, DL has achieved breakthrough performance in tasks such as pattern recognition problems [4], image processing [5], [6], natural language processing [7], [8], speech and audio processing [9], [10], and sensor data processing [11], [12]. For CHF detection, a successful combination of classic ML and end-to-end DL can outperform each single approach [13].…”
Section: Introductionmentioning
confidence: 99%
“…CapsNets were originally introduced to compensate the generalisation problem of CNNs towards observing novel viewpoints in images [16], [24]. CNNs using spectrograms as input often encounter difficulties when recognising data from classes with fine-grained similarities [31], [45], [46]. With this in mind, we developed our framework around CapsNets to evaluate whether a better generalisation for spectrogram classification can be achieved.…”
Section: Discussionmentioning
confidence: 99%
“…spektrogramy, cepstrogramy, chromagramy itp.) [6,30,31] lub postać parametryczną sygnału muzycznego, czyli wektor cech (np. wektor współczynników mel-cepstralnych lub parametry wykorzystujące standard MPEG-7) [13,14,16].…”
Section: Klasyfikacja Emocjiunclassified