2021
DOI: 10.1109/tetc.2020.3003085
|View full text |Cite
|
Sign up to set email alerts
|

Bag of Models Based Embeddings for Assessment of Neurological Disorders Using Speech Intelligibility

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 30 publications
0
1
0
Order By: Relevance
“…We use speaker embeddings extracted from a pre-trained x-vector system [23] and apply principal component analysis (PCA) to reduce the dimensionality of these x-vectors from 512 to 32 to avoid overfitting. x-vectors have previously been used to estimate speech intelligibility [24] due to their ability to encode many properties about a speaker (i.e. speaking style, speaking rate, etc.)…”
Section: Datamentioning
confidence: 99%
“…We use speaker embeddings extracted from a pre-trained x-vector system [23] and apply principal component analysis (PCA) to reduce the dimensionality of these x-vectors from 512 to 32 to avoid overfitting. x-vectors have previously been used to estimate speech intelligibility [24] due to their ability to encode many properties about a speaker (i.e. speaking style, speaking rate, etc.)…”
Section: Datamentioning
confidence: 99%
“…Srinivasan et al [56] proposed a multi-view representation-based disordered speech recognition system based on auditory image-based features and cepstral characteristics, showing improved performance in recognizing very low intelligibility words compared to conventional methods. Chandrakala et al [57] presented a bag-ofmodels (BoM)-based approach that uses adjusted Gaussian mixture model (AGMM)-based embeddings for impaired speech-intelligibility evaluation. They tested the method on two datasets and discovered that it outperformed the supervector, hybrid GMM/SVM, i-vector, and x-vector-based techniques in terms of prediction error and reliability for intelligibilitylevel evaluation and score predictions.…”
Section: Assessing Speech-signal Impairmentsmentioning
confidence: 99%