2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) 2017
DOI: 10.1109/asru.2017.8269012
|View full text |Cite
|
Sign up to set email alerts
|

Composite embedding systems for ZeroSpeech2017 Track1

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
48
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 15 publications
(49 citation statements)
references
References 14 publications
1
48
0
Order By: Relevance
“…A DNN was trained using these labels to generate BNF or posteriorgram representation. In [5], [14], language-mismatched ASR systems were utilized to decode the target speech, and frame labels were generated from the ASR decoding lattices. In [30], BNF representation was generated by applying multi-task learning with both indomain and out-of-domain data [25].…”
Section: Related Work a Deep Learning Approaches To Unsupervisementioning
confidence: 99%
See 4 more Smart Citations
“…A DNN was trained using these labels to generate BNF or posteriorgram representation. In [5], [14], language-mismatched ASR systems were utilized to decode the target speech, and frame labels were generated from the ASR decoding lattices. In [30], BNF representation was generated by applying multi-task learning with both indomain and out-of-domain data [25].…”
Section: Related Work a Deep Learning Approaches To Unsupervisementioning
confidence: 99%
“…The frame labels for out-of-domain data were obtained by HMM forced alignment, while the labels for in-domain data were from DPGMM clustering [12]. In [5], [14], [31], a DNN AM was trained with transcribed data of an out-of-domain language, and used to extract BNFs or posteriorgrams from target speech.…”
Section: Related Work a Deep Learning Approaches To Unsupervisementioning
confidence: 99%
See 3 more Smart Citations