2016
DOI: 10.1007/978-3-319-43958-7_40
|View full text |Cite
|
Sign up to set email alerts
|

HAVRUS Corpus: High-Speed Recordings of Audio-Visual Russian Speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
9
0
3

Year Published

2017
2017
2020
2020

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 26 publications
(13 citation statements)
references
References 10 publications
0
9
0
3
Order By: Relevance
“…Among them we find the single speaker RM-3000 corpus [54] which contains a vocabulary of 1,000 different words and 3,000 utterances. In contrast, we find 360 several multi-speaker databases, namely OuluVS2 [82], TCD-TIMIT [84], HAVRUS [85], IBM AV-ASR [83], VLRF [37] and AV Digits [86], which contain 53, 62, 20, 262, 24 and 53 subjects, respectively. OuluVS2 contains recordings of speakers uttering phrases and 365 sentences; each speaker repeated three times a set of 10 daily-use phrases (similar to OuluVS) and read 10 TIMIT sentences randomly chosen from a total of 530 sentences.…”
Section: Word and Sentence Recognitionmentioning
confidence: 99%
See 2 more Smart Citations
“…Among them we find the single speaker RM-3000 corpus [54] which contains a vocabulary of 1,000 different words and 3,000 utterances. In contrast, we find 360 several multi-speaker databases, namely OuluVS2 [82], TCD-TIMIT [84], HAVRUS [85], IBM AV-ASR [83], VLRF [37] and AV Digits [86], which contain 53, 62, 20, 262, 24 and 53 subjects, respectively. OuluVS2 contains recordings of speakers uttering phrases and 365 sentences; each speaker repeated three times a set of 10 daily-use phrases (similar to OuluVS) and read 10 TIMIT sentences randomly chosen from a total of 530 sentences.…”
Section: Word and Sentence Recognitionmentioning
confidence: 99%
“…OuluVS2 contains recordings of speakers uttering phrases and 365 sentences; each speaker repeated three times a set of 10 daily-use phrases (similar to OuluVS) and read 10 TIMIT sentences randomly chosen from a total of 530 sentences. On the other hand, the TCD-TIMIT dataset contains more than 6,900 different sentences and nearly 370 14,000 utterances while the HAVRUYS database [85], in Russian, provides 4,000 utterances from 20 speakers. The IBM AV-ASR database is a large corpus whose sentences contain more than 10,000 words, but unfortunately it is not publicly available.…”
Section: Word and Sentence Recognitionmentioning
confidence: 99%
See 1 more Smart Citation
“…IV2 [192] database was a sentence level database based on French, with 300 people participating in the recording, each speaking 15 French sentences. There are also Czech databases UWB-05-HSAVC [187] and UWB-07-ICAV [193], NDUTAVSC [168] database for German, HAVRUS [199] database for Russian, BL [196] database for French, and VLRF [202] database for Spanish.…”
Section: ) Word Phrase and Sentence Recognitionmentioning
confidence: 99%
“…В качестве базы данных использовался корпус аудиовизуальной русской речи с высокоскоростными видеозаписями HAVRUS [3]. Корпус состоит из записи 20 русских дикторов (10 мужчин и 10 женщин), каждый из которых произносил по 200 подобранных фраз: 130 фраз для обучения были взяты из двух фонетически представительных текстов и были одинаковы для всех дикторов, 70 фраз для тестирования являлись телефонными номерами и отличались для всех дикторов.…”
unclassified