TENCON 2015 - 2015 IEEE Region 10 Conference 2015
DOI: 10.1109/tencon.2015.7373137
|View full text |Cite
|
Sign up to set email alerts
|

Exploration of vowel onset and offset points for hybrid speech segmentation

Abstract: Automatic segmentation of speech using embedded reestimation of monophone hidden Markov models (HMMs) followed by forced alignment may not give accurate boundaries. Group delay (GD) processing for refining the boundaries at the syllable level is attempted earlier. This paper aims at exploring vowel onset point (VOP) and vowel offset or end point (VEP) for correcting the boundaries obtained using HMM alignment. HMM models the class information well, however may not detect the exact boundary. In case of VOPs and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 11 publications
0
3
0
Order By: Relevance
“…Using this manually marked starting label, we synchronize the source (loudspeaker) signal and the 4-channel recorded audio signals. Considering the start of the audio as an anchor point, we segment all the sample sounds with energy based evidence [27,28,29] and manual observation. In this way, we achieve 988 segmented audio files and a TSP signal for each DOA angle.…”
Section: Post-processingmentioning
confidence: 99%
“…Using this manually marked starting label, we synchronize the source (loudspeaker) signal and the 4-channel recorded audio signals. Considering the start of the audio as an anchor point, we segment all the sample sounds with energy based evidence [27,28,29] and manual observation. In this way, we achieve 988 segmented audio files and a TSP signal for each DOA angle.…”
Section: Post-processingmentioning
confidence: 99%
“…Other approaches are based on Hidden Markov Model (HMM) (Daniel and James, 2017) such as (Lefevre et al, 2002) that combines a K-Means classifier with Hidden Markov Models in order to analyze audio segment using several audio features based either on segment or frame. Another method base on HMM is (Biswajit et al, 2015) that aims at exploring Vowel Onset Point (VOP) and Vowel offset or End Point (VEP) for correcting the boundaries obtained using HMM alignment. HMM models the class information well, but it may not detect the exact boundary.…”
Section: Word (N)mentioning
confidence: 99%
“…In literature, sonorant segmentation is performed by using mel frequency cepstral coefficients (MFCCs), knowledge based acoustic features or a combination of both [2], [24]. Recently in [23], [25], features based on both spectral and source information are proposed and a hierarchical algorithm is developed to detect sonorant and non-sonorant regions in continuous speech. However, the feature may not have potential to further divide the sonorant regions based on the degree of sonority associated with the sound.…”
Section: B Usefulness Of Sonority Featurementioning
confidence: 99%