2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2013
DOI: 10.1109/apsipa.2013.6694355
|View full text |Cite
|
Sign up to set email alerts
|

Temporally variable multi-aspect N-way morphing based on interference-free speech representations

Abstract: Voice morphing is a powerful tool for exploratory research and various applications. A temporally variable multiaspect morphing is extended to enable morphing of arbitrarily many voices in a single step procedure. The proposed method is implemented based on interference-free representations of periodic signals and found to yield highly-naturally sounding manipulated voices which are useful for investigating human perception of voice. The formulation of the proposed method is general enough to be applicable to … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
32
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
6
2
1
1

Relationship

1
9

Authors

Journals

citations
Cited by 41 publications
(37 citation statements)
references
References 27 publications
0
32
0
Order By: Relevance
“…Since morphing techniques can enrich the level of stimulus without salient loss of naturalness, they have been used in many facial imagerelated experiments, such as those involving facial recognition [14] and attractiveness perception [15]. TANDEM-STRAIGHT [16] is a speech analysis, modification and re-synthesis framework, which can similarly deconstruct a speech signal based on the sourcefilter model. TANDEM-STRAIGHT extracts the F0 and aperiodicity of the input speech signal as the source parameters.…”
Section: Voice Morphingmentioning
confidence: 99%
“…Since morphing techniques can enrich the level of stimulus without salient loss of naturalness, they have been used in many facial imagerelated experiments, such as those involving facial recognition [14] and attractiveness perception [15]. TANDEM-STRAIGHT [16] is a speech analysis, modification and re-synthesis framework, which can similarly deconstruct a speech signal based on the sourcefilter model. TANDEM-STRAIGHT extracts the F0 and aperiodicity of the input speech signal as the source parameters.…”
Section: Voice Morphingmentioning
confidence: 99%
“…We plan to use WORLD for voice conversion purposes such as voice morphing [37] and statistic parametric speech synthesis [38]. Its real-time applications include a singing synthesizer.…”
Section: Discussionmentioning
confidence: 99%
“…Background noise removal and normalization of the recordings for average intensity (-12 dBFS) and duration (500 milliseconds) was done in Audacity software. These recordings were used to generate voice morphs spanning a voice identity continuum between two participants (acquaintances) by using TANDEM-STRAIGHT (Kawahara, Morise, Banno, & Skuk, 2013) (e.g. a voice morph can be generated such that it contains 30% of person A's, 70% of person B's voice).…”
Section: Auditory Tasksmentioning
confidence: 99%