Interspeech 2010 2010
DOI: 10.21437/interspeech.2010-614
|View full text |Cite
|
Sign up to set email alerts
|

Perceptual wavelet decomposition for speech segmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
3
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 7 publications
0
3
0
Order By: Relevance
“…DNN-based structures have demonstrated superior effectiveness compared to alternative architectures employed in the Arabic ASR system [51]. Various well-known techniques such as Principal Component Analysis (PCA) [52], Independent Component Analysis (ICA) [53], Wavelet Analysis [54], and Linear Discriminant Analysis (LDA) [55] have been employed for deriving speech characteristics from acoustic waveform. Among these methods, PCA is commonly used to find patterns in input data.…”
Section: B Recent Advances In Asr Technologiesmentioning
confidence: 99%
“…DNN-based structures have demonstrated superior effectiveness compared to alternative architectures employed in the Arabic ASR system [51]. Various well-known techniques such as Principal Component Analysis (PCA) [52], Independent Component Analysis (ICA) [53], Wavelet Analysis [54], and Linear Discriminant Analysis (LDA) [55] have been employed for deriving speech characteristics from acoustic waveform. Among these methods, PCA is commonly used to find patterns in input data.…”
Section: B Recent Advances In Asr Technologiesmentioning
confidence: 99%
“…This often includes orthographic or phonetic transcription data, which contributes in parallel to the speech or training algorithms. Methods falling into this category encompass techniques using Hidden Markov Models (HMMs) [24], Dynamic Time Warping (DTW) [25], or Artificial Neural Networks (ANNs) [26,27].…”
Section: ) Automatic Segmentationmentioning
confidence: 99%
“…A substantial body of research adopts aided segmentation techniques. For example, B Ziółko [25] employed Discrete Wavelet Transform (DWT) analysis using the Wavelet Method to identify the start and end of phonemes. Kamarauskas [28] utilized perceptron and backpropagation artificial neural networks to recognize distinctive phonemes, with backpropagation showing lower recognition error rates but requiring longer training times.…”
Section: ) Automatic Segmentationmentioning
confidence: 99%
“…The methods for boundary detection can be based on using bidirectional LSTM networks, 39,40 wavelet analysis, [42][43][44] graph-based structural analysis, 45 rules describing the power spectrum 46 or formants 47 and various features extracted from the spectrogram, for example, visual features 48,49 or auditory attention features. 50 The methods for boundary detection also have a relevant application in the task of segmentation with orthographic or phonetic transcription provided, where they can be used as additional boundary correction procedures.…”
Section: Related Workmentioning
confidence: 99%
“…The methods for boundary detection can be based on using bidirectional LSTM networks, 39,40 wavelet analysis, 42‐44 graph‐based structural analysis, 45 rules describing the power spectrum 46 or formants 47 and various features extracted from the spectrogram, for example, visual features 48,49 or auditory attention features 50 …”
Section: Introductionmentioning
confidence: 99%