2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2018
DOI: 10.1109/icassp.2018.8461545
|View full text |Cite
|
Sign up to set email alerts
|

Bayesian Models for Unit Discovery on a Very Low Resource Language

Abstract: Developing speech technologies for low-resource languages has become a very active research field over the last decade. Among others, Bayesian models have shown some promising results on artificial examples but still lack of in situ experiments. Our work applies state-of-the-art Bayesian models to unsupervised Acoustic Unit Discovery (AUD) in a real low-resource language scenario. We also show that Bayesian models can naturally integrate information from other resourceful languages by means of informative prio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
15
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3

Relationship

4
4

Authors

Journals

citations
Cited by 16 publications
(15 citation statements)
references
References 14 publications
0
15
0
Order By: Relevance
“…There are two main research strands in UAM. The first strand formulates the problem as discovering a finite set of phoneme-like speech units [5], [6], [12]. This is often referred to as acoustic unit/model discovery (AUD) [5], [8].…”
Section: Introductionmentioning
confidence: 99%
“…There are two main research strands in UAM. The first strand formulates the problem as discovering a finite set of phoneme-like speech units [5], [6], [12]. This is often referred to as acoustic unit/model discovery (AUD) [5], [8].…”
Section: Introductionmentioning
confidence: 99%
“…The base measure p(η) defines a prior probability that a soundrepresented by an HMM with parameters η-is an acoustic unit. Earlier works on Bayesian AUD [8,9,15,16] use exponential family distributions as the base measure. These distributions, while mathematically convenient since they form conjugate priors, do not incorporate any knowledge about phones.…”
Section: Problem Definitionmentioning
confidence: 99%
“…2 Toplines and baselines. A baseline system is provided, consisting of a pipeline with a nonparametric Bayesian acoustic unit discovery system [6,7], and a parametric speech synthesizer based on Merlin [8]. As linguistic features, we use contextual information (leading and preceding phones, number of preceding and following phones in current sentence), but no features related to prosody, articulatory features (vowel, nasal, and so on), or part-of-speech (noun, verb, adjective, and so on).…”
Section: Unsupervised Unit Discovery For Speech Synthesismentioning
confidence: 99%