2023
DOI: 10.1049/rsn2.12525
|View full text |Cite
|
Sign up to set email alerts
|

Speaker identification using Ultra‐Wideband measurement of voice

Haoxuan Li,
Chong Tang,
Shelly Vishwakarma
et al.

Abstract: Voice identification is being increasingly adopted in various domains, including security infrastructures, intelligent home systems, and personalised digital assistants. Notably, it harbours significant promise in transforming healthcare, especially in electronic health record detecting and speech impairment monitoring such as aphasia. Current strategies such as acoustic models based on deep learning, voice bio‐metrics, and spectrogram analysis, have been identified with several drawbacks including vulnerabili… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 17 publications
0
4
0
Order By: Relevance
“…Zhenghui Li et al introduce an innovative approach to radarbased human activity recognition across six domains, with adaptive thresholding and holistic optimisation, significantly improving classification accuracy [2]. Li et al propose a groundbreaking voice identification method using Ultra-Wideband technology, leveraging micro-Doppler shifts during speech production to achieve close to 90% accuracy in healthcare applications [3]. [9].…”
Section: Papers In the Special Issuementioning
confidence: 99%
“…Zhenghui Li et al introduce an innovative approach to radarbased human activity recognition across six domains, with adaptive thresholding and holistic optimisation, significantly improving classification accuracy [2]. Li et al propose a groundbreaking voice identification method using Ultra-Wideband technology, leveraging micro-Doppler shifts during speech production to achieve close to 90% accuracy in healthcare applications [3]. [9].…”
Section: Papers In the Special Issuementioning
confidence: 99%
“…Obtaining large-scale, accurately annotated datasets of aphasic speech is challenging due to the time-consuming nature of manual annotation and the limited availability of such data [15,16,19,24,25]. Without sufficient data for training, deep learning models may struggle to generalize effectively across different types and severities of aphasia, leading to reduced performance and reliability in real-world applications [22,26].…”
Section: Limited Annotated Datamentioning
confidence: 99%
“…Balancing model complexity with computational efficiency is essential for developing effective deep learning solutions for aphasia [25,26]. While complex models may offer superior performance in recognizing the nuanced features of aphasic speech, they often come with increased computational costs and resource requirements, making them impractical for real-time applications or deployment on resource-constrained devices.…”
Section: Optimal Model Complexitymentioning
confidence: 99%
See 1 more Smart Citation