2017
DOI: 10.3390/app7121293
|View full text |Cite
|
Sign up to set email alerts
|

Audio Time Stretching Using Fuzzy Classification of Spectral Bins

Abstract: A novel method for audio time stretching has been developed. In time stretching, the audio signal's duration is expanded, whereas its frequency content remains unchanged. The proposed time stretching method employs the new concept of fuzzy classification of time-frequency points, or bins, in the spectrogram of the signal. Each time-frequency bin is assigned, using a continuous membership function, to three signal classes: tonalness, noisiness, and transientness. The method does not require the signal to be exp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
14
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 28 publications
(30 citation statements)
references
References 21 publications
0
14
0
Order By: Relevance
“…The difference in quality between β < 1 and β > 1 was mentioned briefly by Sylvestre and Kabal (1992) and shown in early results from this testing in (Roberts and Paliwal, 2019). Since the release of the MATLAB TSM Toolbox , the included algorithms, PV, IPL, WSOLA and HPTSM, have been used in most evaluations, while comparisons to commercial algorithms are rare (Damskägg and Välimäki, 2017;Karrer et al, 2006). The source audio used during testing also varies between papers with some papers using the files provided with the MATLAB TSM Toolbox.…”
Section: Algorithms and Quality Evaluationmentioning
confidence: 95%
See 1 more Smart Citation
“…The difference in quality between β < 1 and β > 1 was mentioned briefly by Sylvestre and Kabal (1992) and shown in early results from this testing in (Roberts and Paliwal, 2019). Since the release of the MATLAB TSM Toolbox , the included algorithms, PV, IPL, WSOLA and HPTSM, have been used in most evaluations, while comparisons to commercial algorithms are rare (Damskägg and Välimäki, 2017;Karrer et al, 2006). The source audio used during testing also varies between papers with some papers using the files provided with the MATLAB TSM Toolbox.…”
Section: Algorithms and Quality Evaluationmentioning
confidence: 95%
“…Fuzzy classification of spectral bins (FuzzyPV) (Damskägg and Välimäki, 2017), is an extension of the IPL. Spectral bins are given a degree of membership to three classes, sinusoidal, noise and transient, resulting in a fuzzy classification of each bin.…”
Section: Algorithms and Quality Evaluationmentioning
confidence: 99%
“…Damskägg and Välimäki [4] address a problem known as time-scale modification in which the objective is to temporally stretch or compress a given audio signal while preserving properties like pitch and timbre. To handle different signal characteristics, the main idea of the paper is to modify the phase of the signal's time-frequency bins in an adaptive fashion using an implicit bin-wise fuzzy classification based on three classes (sinusoid, noise, transient).…”
Section: Audio Signal Processingmentioning
confidence: 99%
“…The testing subset, containing 240 files, was created using three additional methods to time-scale 20 reference files at a random β in each band of 0.25 < β < 0.5, 0.5 < β < 0.8, 0.8 < β < 1 and 1 < β < 2. Elastique by Zplane Development, the Phase Vocoder using fuzzy classification of bins (FuzzyPV) by Damskägg and Välimäki (2017) and Non-Negative Matrix Factorisation Time-Scale Modification (NMFTSM) by Roma et al (2019) were used to generate the testing subset. Finally, an evaluation subset was generated by processing the testing subset reference files with all previously mentioned methods, in addition to the Scaled Phase-Locking Phase Vocoder (SPL) by Laroche and Dolson (1999), IPL and SPL variants of PhaVoRIT (IPL and SPL) by Karrer et al (2006) and Epoch Synchronous Overlap-Add (ESOLA) by Rudresh et al (2018).…”
mentioning
confidence: 99%