2013 International Conference on Computing, Management and Telecommunications (ComManTel) 2013
DOI: 10.1109/commantel.2013.6482394
|View full text |Cite
|
Sign up to set email alerts
|

Speech/non-speech detection in Malay language spontaneous speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
4
2
2

Relationship

1
7

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 16 publications
0
4
0
Order By: Relevance
“…Speech energy is closely related to the amplitude of the speech (Izzad et al, 2013). Instead of calculating the total energy of each frame, in this research the energy stability of the speech is measured based on the amplitude transition from one frame to another.…”
Section: Proposed Speech Energy Extraction Using Local Maximamentioning
confidence: 99%
See 1 more Smart Citation
“…Speech energy is closely related to the amplitude of the speech (Izzad et al, 2013). Instead of calculating the total energy of each frame, in this research the energy stability of the speech is measured based on the amplitude transition from one frame to another.…”
Section: Proposed Speech Energy Extraction Using Local Maximamentioning
confidence: 99%
“…The use of energy parameter is customary but not limited in endpoint detection only. It is also beneficial in consonant and vowel detection in (Izzad et al, 2013). However, sum of energy calculated from short time speech frame is unable to detect the energy variation from the consonant and vowel in the elongation.…”
mentioning
confidence: 99%
“…However, those observation was done with a goal to detect turn-taking, thus the major boundaries have role in signaling the completion of turn-taking unit of a speaker only at the end segment of the sentence. Similar approaches for audio sentence presented in [2,3] have been proposed to improve the automatic detection. However, PPh segment is usually defined as a group of words that carries the intended speech meaning.…”
Section: Introductionmentioning
confidence: 99%
“…Since users' sounds are usually natural and non-planned, they are usually illustrated by repetition, artificial initiate, partial words, discontinue in the core and re-starts, or additional linguistic occurrence such as cough etc.. The speech detection [2] section should be capable to extort away of the verbal communication indication, a speech progression permitting the semantic analyzer to infer the significance of the user's speech. The spontaneous speech recognition is mainly complicated due to representation divergence, for example when a person talks to another person as in conventions or discussions, interview or over telephone conversations.…”
Section: Introductionmentioning
confidence: 99%