2023
DOI: 10.1109/access.2023.3243690
|View full text |Cite
|
Sign up to set email alerts
|

Streaming End-to-End Target-Speaker Automatic Speech Recognition and Activity Detection

Abstract: Automatic speech recognition of a target speaker in the presence of interfering speakers remains a challenging issue. One approach to tackle this problem is target-speaker speech recognition, which conditions the recognition process on an embedding that characterizes the voice of the target speaker. This enables recognizing only the speech of the target speaker while ignoring interferences. In this work, we propose an end-to-end target-speaker speech recognition system based on a neural transducer architecture… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 42 publications
0
1
0
Order By: Relevance
“…Target speaker tracking is employed in many speech-related tasks to retrieve the information of a specific speaker, including target speaker automatic speech recognition (TS-ASR) [55], target speaker speech separation [56] and TS-VAD [19]. The similarity of these tasks is that they all use a speaker profile to focus on the speech of interest, which consequently refines results corresponding to that particular speaker.…”
Section: Target Speaker Voice Activity Detectionmentioning
confidence: 99%
“…Target speaker tracking is employed in many speech-related tasks to retrieve the information of a specific speaker, including target speaker automatic speech recognition (TS-ASR) [55], target speaker speech separation [56] and TS-VAD [19]. The similarity of these tasks is that they all use a speaker profile to focus on the speech of interest, which consequently refines results corresponding to that particular speaker.…”
Section: Target Speaker Voice Activity Detectionmentioning
confidence: 99%