2014
DOI: 10.1016/j.csl.2013.05.002
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised training of an HMM-based self-organizing unit recognizer with applications to topic classification and keyword discovery

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
50
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 73 publications
(50 citation statements)
references
References 5 publications
0
50
0
Order By: Relevance
“…Each speech segment is represented by the mean of frame-level spectral feature vectors (e.g., MFCC), and -means is used to cluster the mean feature vectors. Segmental GMM (SGMM) [22] is another segment labeling method that is commonly used in the literature [7]- [19]. SGMM explicitly models the dynamic trajectory of the spectral features within each segment with a polynomial function of time.…”
Section: A Unsupervised Acoustic Modeling Techniquesmentioning
confidence: 99%
See 2 more Smart Citations
“…Each speech segment is represented by the mean of frame-level spectral feature vectors (e.g., MFCC), and -means is used to cluster the mean feature vectors. Segmental GMM (SGMM) [22] is another segment labeling method that is commonly used in the literature [7]- [19]. SGMM explicitly models the dynamic trajectory of the spectral features within each segment with a polynomial function of time.…”
Section: A Unsupervised Acoustic Modeling Techniquesmentioning
confidence: 99%
“…In [7], [19], [20], an approach similar to ASM was investigated, and the discovered speech units were referred to as self-organized units (SOUs). As discussed in Section I, the ASM framework consists of three stages.…”
Section: A Unsupervised Acoustic Modeling Techniquesmentioning
confidence: 99%
See 1 more Smart Citation
“…Figure 6 shows an example of a two level hierarchical representation of a speech signal. On the first hierarchical level the aim is to discover the acoustic building blocks of speech, the phonemes, and to learn a statistical model for each of them, the acoustic model [11,56,53,47]. In speech recognition, the acoustic model usually consists of Hidden Markov Models (HMMs), where each HMM emits a time series of vectors of cepstral coefficients.…”
Section: Representation Learning From Sequential Datamentioning
confidence: 99%
“…A similar model but, without constraints on the topology of the HMMs was studied in [12]. Siu et al [16] first use a segmental GMM (SGMM) to generate a transcription of the data and then iteratively train a standard HMM to improve the transcriptions. Note that the number of allowed states are here defined in advance.…”
Section: Introductionmentioning
confidence: 99%