2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) 2017
DOI: 10.1109/sped.2017.7990428
|View full text |Cite
|
Sign up to set email alerts
|

The SWARA speech corpus: A large parallel Romanian read speech dataset

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
6
3
1

Relationship

4
6

Authors

Journals

citations
Cited by 22 publications
(8 citation statements)
references
References 7 publications
0
7
0
Order By: Relevance
“…High-quality speech corpora is essential in neural speech synthesis. In this work we start from the large parallel Romanian dataset called SWARA [20]. SWARA contains 17 volunteer speakers each reading aloud between 1000 and 1500 utterances (the same across all speakers) in a controlled studio environment.…”
Section: Speech Corpusmentioning
confidence: 99%
“…High-quality speech corpora is essential in neural speech synthesis. In this work we start from the large parallel Romanian dataset called SWARA [20]. SWARA contains 17 volunteer speakers each reading aloud between 1000 and 1500 utterances (the same across all speakers) in a controlled studio environment.…”
Section: Speech Corpusmentioning
confidence: 99%
“…The training data for our systems consists of the SWARA Romanian multispeaker parallel corpus [24]. It includes 18 speakers: 10 female and 8 male voices, with the number of utterances per speaker being between 1000 and 1500.…”
Section: A Training Data and Speaker Data Augmentationmentioning
confidence: 99%
“…The Irish script was generated from the Corpas na Gaeilge Comhaimseartha (Corpus of Contemporary Irish) [17]. The Romanian script was developed using the The SWARA Speech Corpus [18]. Not all accents have unique recording scripts.…”
Section: Current Resources 21 Language Resourcesmentioning
confidence: 99%