The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
Interspeech 2017 2017
DOI: 10.21437/interspeech.2017-790
|View full text |Cite
|
Sign up to set email alerts
|

Vowel Onset Point Detection Using Sonority Information

Abstract: Vowel onset point (VOP) refers to the starting event of a vowel, that may be reflected in different aspects of the speech signal. The major issue in VOP detection using existing methods is the confusion among the vowels and other categories of sounds preceding them. This work explores the usefulness of sonority information to reduce this confusion and improve VOP detection. Vowels are the most sonorant sounds followed by semivowels, nasals, voiced fricatives, voiced stops. The sonority feature is derived from … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 15 publications
0
2
0
Order By: Relevance
“…Using this manually marked starting label, we synchronize the source (loudspeaker) signal and the 4-channel recorded audio signals. Considering the start of the audio as an anchor point, we segment all the sample sounds with energy based evidence [27,28,29] and manual observation. In this way, we achieve 988 segmented audio files and a TSP signal for each DOA angle.…”
Section: Post-processingmentioning
confidence: 99%
“…Using this manually marked starting label, we synchronize the source (loudspeaker) signal and the 4-channel recorded audio signals. Considering the start of the audio as an anchor point, we segment all the sample sounds with energy based evidence [27,28,29] and manual observation. In this way, we achieve 988 segmented audio files and a TSP signal for each DOA angle.…”
Section: Post-processingmentioning
confidence: 99%
“…Similarly, entropy is considered as an evidence to detect the speech in noisy conditions [16]. The vowel-like regions belong to high SNR portion of speech signals and are less affected by noise [17][18][19]. Similarly, glottal activity detection and sonorant region detection are performed to identify the speech regions in a noisy scenario [20,21].…”
Section: Introductionmentioning
confidence: 99%