2021
DOI: 10.30684/etj.v39i1b.1905
|View full text |Cite
|
Sign up to set email alerts
|

Age Estimation in Short Speech Utterances Based on Bidirectional Gated-Recurrent Neural Networks

Abstract: Recently, age estimates from speech have received growing interest as they are important for ‎many applications like custom call routing, targeted marketing, or user-profiling. In this work, an ‎automatic system to estimate age in short speech utterances without ‎depending on the text is proposed. From each utterance frame, four ‎groups of features are extracted and then 10 statistical functionals are measured for each ‎extracted dimension of the features, to be followed by dimensionality reduction using Linea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 24 publications
(36 reference statements)
0
1
0
Order By: Relevance
“…Multiple acoustic aspects of a speaker's speech have been retrieved by numerous research, however, it is still unclear which acoustic elements are most appropriate for the various tasks of speaker profiling [10]. Furthermore, despite ongoing research, accurately estimating height, age, and gender with a minimal feature set using advanced machine learning techniques is still difficult [11]. This is because there are many sources of variability that overlap, including the speaker's gender, health, and emotional state, which can all affect speech as well as the design of the sound production system.…”
Section: Introductionmentioning
confidence: 99%
“…Multiple acoustic aspects of a speaker's speech have been retrieved by numerous research, however, it is still unclear which acoustic elements are most appropriate for the various tasks of speaker profiling [10]. Furthermore, despite ongoing research, accurately estimating height, age, and gender with a minimal feature set using advanced machine learning techniques is still difficult [11]. This is because there are many sources of variability that overlap, including the speaker's gender, health, and emotional state, which can all affect speech as well as the design of the sound production system.…”
Section: Introductionmentioning
confidence: 99%