2011
DOI: 10.1007/s12046-011-0050-4
|View full text |Cite
|
Sign up to set email alerts
|

Current trends in multilingual speech processing

Abstract: Abstract. In this paper, we describe recent work at Idiap Research Institute in the domain of multilingual speech processing and provide some insights into emerging challenges for the research community. Multilingual speech processing has been a topic of ongoing interest to the research community for many years and the field is now receiving renewed interest owing to two strong driving forces. Firstly, technical advances in speech recognition and synthesis are posing new challenges and opportunities to researc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
15
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 28 publications
(16 citation statements)
references
References 86 publications
0
15
0
Order By: Relevance
“…In another direction, modelling speakers with a speaker discriminative Deep Neural Network (DNN) has shown good performance for SV [7,8]. Motivated by the success of DNNs in the context of speaker, speech [9,10] and image recognition tasks, we explore the application of DNNs for the Random-digit task. We believe that the DNN based speaker embedding features can be useful for representing the invariant speaker characteristics.…”
Section: Introductionmentioning
confidence: 99%
“…In another direction, modelling speakers with a speaker discriminative Deep Neural Network (DNN) has shown good performance for SV [7,8]. Motivated by the success of DNNs in the context of speaker, speech [9,10] and image recognition tasks, we explore the application of DNNs for the Random-digit task. We believe that the DNN based speaker embedding features can be useful for representing the invariant speaker characteristics.…”
Section: Introductionmentioning
confidence: 99%
“…Expanding the coverage of the world's languages in Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) systems have been attracting much interest in both academia and industry [1,2]. Conventional phonetically-based speech processing systems require pronunciation dictionaries that map phonetic units to words.…”
Section: Introductionmentioning
confidence: 99%
“…In recent years, cross-lingual speech synthesis has been a popular topic in text-to-speech synthesis (TTS) research [1], [2]. Since cross-lingual speech synthesis can synthesize speech in different languages with the same or a different speaker's voice, it has been widely used in human-computer interaction,…”
Section: Introductionmentioning
confidence: 99%