2008
DOI: 10.1109/icassp.2008.4518720
|View full text |Cite
|
Sign up to set email alerts
|

New approach to voiced onset detection in speech signal and its application for frame error concealment

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
3
0
3

Year Published

2010
2010
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(6 citation statements)
references
References 2 publications
0
3
0
3
Order By: Relevance
“…Since the only difference in these two front-ends is the filter bank type, a tentative reason is that the mel triangular filter shape has sharper onset than the gammatone filter, thus may render better characterization of the temporal masking effect. This is based on the theory in [14,15] that human ears tend to focus more on the onset of the power envelope than on the falling edge. We also found (not shown in this work) that in the presence of stationary noise, such as white noise or pink noise, the same experiment had different results: the gammatone+power front-end performed constantly better than the mel+power front-end.…”
Section: Comparison With Other Front-endsmentioning
confidence: 99%
“…Since the only difference in these two front-ends is the filter bank type, a tentative reason is that the mel triangular filter shape has sharper onset than the gammatone filter, thus may render better characterization of the temporal masking effect. This is based on the theory in [14,15] that human ears tend to focus more on the onset of the power envelope than on the falling edge. We also found (not shown in this work) that in the presence of stationary noise, such as white noise or pink noise, the same experiment had different results: the gammatone+power front-end performed constantly better than the mel+power front-end.…”
Section: Comparison With Other Front-endsmentioning
confidence: 99%
“…Based on the component energy distributed in specific frequencies, the burst and voicing onsets could be located. The same idea was also utilized by Lemyre et al, 5 who employed a TEO on bandpass filtered signals to derive subband energy profiles. Then the energy profiles were compared to appropriate thresholds to determine the locations of voicing onsets.…”
Section: Introductionmentioning
confidence: 97%
“…Предварительная сегментация речевого сигнала на различные фонетические группы используется во многих алгоритмах обработки и кодирования речи [1,2,3,4,5]. Обработка сигнала, учитывающая его характеристики, позволяет улучшить качество звука в устройствах кодирования и декодирования.…”
Section: Introductionunclassified
“…Существует множество алгоритмов разделения речи на различные классы звуков [1,2,3,4,5]. Их общей особенностью является зависимость от характеристик речевого сигнала, для обработки которого они предназначены, и, при применении их к сигналам с отличными характеристиками, качество разделения обычно ухудшается [4].…”
Section: Introductionunclassified