2016
DOI: 10.3390/app6110368
|View full text |Cite
|
Sign up to set email alerts
|

Spectral Envelope Transformation in Singing Voice for Advanced Pitch Shifting

Abstract: Abstract:The aim of the present work is to perform a step towards more natural pitch shifting techniques in singing voice for its application in music production and entertainment systems. In this paper, we present an advanced method to achieve natural modifications when applying a pitch shifting process to singing voice by modifying the spectral envelope of the audio excerpt. To this end, an all-pole model has been selected to model the spectral envelope, which is estimated using a constrained non-linear opti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 30 publications
(33 reference statements)
0
6
0
Order By: Relevance
“…The length of the median filter in the frequency direction was 500 Hz, which corresponds to 46 bins. In the time direction, the length of the median filter was chosen to be 200 ms, but the number of frames it corresponds to depends on the analysis hop size, which is determined by the TSM factor according to (10). Finally, the transient detection threshold was set to t d = 10 −4 = 0.00010.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…The length of the median filter in the frequency direction was 500 Hz, which corresponds to 46 bins. In the time direction, the length of the median filter was chosen to be 200 ms, but the number of frames it corresponds to depends on the analysis hop size, which is determined by the TSM factor according to (10). Finally, the transient detection threshold was set to t d = 10 −4 = 0.00010.…”
Section: Discussionmentioning
confidence: 99%
“…Audio time stretching has numerous applications, such as fast browsing of speech recordings [4], music production [5], foreign language and music learning [6], fitting of a piece of music to a prescribed time slot [7], and slowing down the soundtrack for slow-motion video [8]. Additionally, TSM is often used as a processing step in pitch shifting, which aims at changing the frequencies in the signal without changing its duration [2,3,7,9,10].…”
Section: Introductionmentioning
confidence: 99%
“…The same approach could also be used for applying other rules, such as proposed in [1] or [29] for tuning formants frequencies to the f0 for more realistic pitch transformations.…”
Section: Discussionmentioning
confidence: 99%
“…Another possible approach that we propose to use here is pole modification, based on an all-pole model of spectral envelope. This possibility is mentioned in [29], but has been discarded for being too complicated. A similar approach was used in [30] and [31] to modify formants, focusing on controlling pole interaction.…”
Section: Proposed Transformation Approachmentioning
confidence: 99%
“…Thirdly, the envelope theory (ET) is known as an auxiliary field method [28,29]; therefore, we could use this convenient method for the wave comparison between the simulation and the measurements. The envelope correlation algorithm is applied to compute the maximum correlation coefficient in order to choose the most similar envelope waveforms in a quantitative approach and locate the flaws.…”
Section: Approximation Based On Envelope Comparisonmentioning
confidence: 99%