IEEE International Conference on Acoustics Speech and Signal Processing 1993
DOI: 10.1109/icassp.1993.319366
|View full text |Cite
|
Sign up to set email alerts
|

An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
178
0
5

Year Published

2001
2001
2017
2017

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 279 publications
(183 citation statements)
references
References 4 publications
0
178
0
5
Order By: Relevance
“…시간 영역에서의 시간축 변환 기술의 대표적인 예 로는 Synchronized OverLap and Add(SOLA), [1] OverLapAdd technique based on Waveform Similarity(WSOLA), [2] Pitch Synchronized OverLap and Add(PSOLA) [3] [5,6] 참고문헌…”
Section: 임상준 김형순unclassified
“…시간 영역에서의 시간축 변환 기술의 대표적인 예 로는 Synchronized OverLap and Add(SOLA), [1] OverLapAdd technique based on Waveform Similarity(WSOLA), [2] Pitch Synchronized OverLap and Add(PSOLA) [3] [5,6] 참고문헌…”
Section: 임상준 김형순unclassified
“…That is, the original analysis phases are kept during synthesis for the transient bins. Subsequently, as the analysis window slides over the transient, the same gain reduction is applied for the transient bins, as during the onset of the transient (16). The bins are retained in the set of transient bins until their transientness decays to a value smaller than 0.5, or until the analysis frame slides completely away from the detected transient center.…”
Section: Transient Preservationmentioning
confidence: 99%
“…The main challenge in TSM is in simultaneously preserving the subjective quality of these distinct components. Standard time-domain TSM methods, such as the synchronized overlap-add (SOLA) [15], the waveform-similarity overlap-add [16], and the pitch-synchronous overlap-add [17] techniques, are considered to provide high-quality TSM for quasi-harmonic signals. When these methods are applied to polyphonic signals, however, only the most dominant periodic pattern of the input waveform is preserved, while other periodic components suffer from phase jump artifacts at the synthesis frame boundaries.…”
Section: Introductionmentioning
confidence: 99%
“…The existing discontinuity between the successive segments and the misaligned interpolation of them result to artefacts and distortions that are detrimental to speech quality. Waveform similarity OLA (WSOLA) [23], on the other hand, searches for a position that has a maximal local similarity (e.g., maximise the cross-correlation function) with the last…”
Section: Figmentioning
confidence: 99%