2022
DOI: 10.48550/arxiv.2203.16844
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset

Abstract: This paper introduces a high-quality rich annotated Mandarin conversational (RAMC) speech dataset called MagicData-RAMC. The MagicData-RAMC corpus contains 180 hours of conversational speech data recorded from native speakers of Mandarin Chinese over mobile phones with a sampling rate of 16 kHz. The dialogs in MagicData-RAMC are classified into 15 diversified domains and tagged with topic labels, ranging from science and technology to ordinary life. Accurate transcription and precise speaker voice activity tim… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 31 publications
(34 reference statements)
0
3
0
Order By: Relevance
“…• R05: MAGICDATA Mandarin Chinese Conversational Speech Corpus [22]. This dataset contains 180 hours of conversational speech from 633 speakers, recorded by mobile phone in a quiet environment.…”
Section: Design Policymentioning
confidence: 99%
“…• R05: MAGICDATA Mandarin Chinese Conversational Speech Corpus [22]. This dataset contains 180 hours of conversational speech from 633 speakers, recorded by mobile phone in a quiet environment.…”
Section: Design Policymentioning
confidence: 99%
“…We evaluate our proposed method on two Chinese conversation datasets: MagicData-RAMC [44] and HKUST [45]. The HKUST dataset comprises telephone recordings of conversations, while the MagicData-RAMC dataset consists of microphone recordings of conversations captured in a quiet environment.…”
Section: A Datasetmentioning
confidence: 99%
“…1) MagicData-RAMC: The MagicData-RAMC dataset [44] comprises 180 hours of Chinese conversational speech data, distributed as 150 hours for the training set, 20 hours for the development set, and 10 hours for the test set. The dataset features conversations from 663 distinct speakers.…”
Section: A Datasetmentioning
confidence: 99%