Temporally variable multi-aspect N-way morphing based on interference-free speech representations

Kawahara, Hideki; Morise, Masanori; Banno, Hideki; Skuk, Verena G.

doi:10.1109/apsipa.2013.6694355

Cited by 41 publications

(37 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Since morphing techniques can enrich the level of stimulus without salient loss of naturalness, they have been used in many facial imagerelated experiments, such as those involving facial recognition [14] and attractiveness perception [15]. TANDEM-STRAIGHT [16] is a speech analysis, modification and re-synthesis framework, which can similarly deconstruct a speech signal based on the sourcefilter model. TANDEM-STRAIGHT extracts the F0 and aperiodicity of the input speech signal as the source parameters.…”

Section: Voice Morphingmentioning

confidence: 99%

Impact of acoustic similarity on efficiency of verbal information transmission via subtle prosodic cues

Chen

Kitaoka

Takeda

2016

J AUDIO SPEECH MUSIC PROC.

View full text Add to dashboard Cite

In this study, we investigate the effect of tiny acoustic differences on the efficiency of prosodic information transmission. Study participants listened to textually ambiguous sentences, which could be understood with prosodic cues, such as syllable length and pause length. Sentences were uttered in voices similar to the participant's own voice and in voices dissimilar to their own voice. The participants then identified which of four pictures the speaker was referring to. Both the eye movement and response time of the participants were recorded. Eye tracking and response time results both showed that participants understood the textually ambiguous sentences faster when listening to voices similar to their own. The results also suggest that tiny acoustic features, which do not contain verbal meaning can influence the processing of verbal information.

show abstract

Section: Voice Morphingmentioning

confidence: 99%

Impact of acoustic similarity on efficiency of verbal information transmission via subtle prosodic cues

Chen

Kitaoka

Takeda

2016

J AUDIO SPEECH MUSIC PROC.

View full text Add to dashboard Cite

show abstract

“…We plan to use WORLD for voice conversion purposes such as voice morphing [37] and statistic parametric speech synthesis [38]. Its real-time applications include a singing synthesizer.…”

Section: Discussionmentioning

confidence: 99%

WORLD: A Vocoder-Based High-Quality Speech Synthesis System for Real-Time Applications

Morise

Yokomori

Ozawa

2016

IEICE Trans. Inf. & Syst.

Self Cite

975

597

View full text Add to dashboard Cite

SUMMARYA vocoder-based speech synthesis system, named WORLD, was developed in an effort to improve the sound quality of realtime applications using speech. Speech analysis, manipulation, and synthesis on the basis of vocoders are used in various kinds of speech research. Although several high-quality speech synthesis systems have been developed, real-time processing has been difficult with them because of their high computational costs. This new speech synthesis system has not only sound quality but also quick processing. It consists of three analysis algorithms and one synthesis algorithm proposed in our previous research. The effectiveness of the system was evaluated by comparing its output with against natural speech including consonants. Its processing speed was also compared with those of conventional systems. The results showed that WORLD was superior to the other systems in terms of both sound quality and processing speed. In particular, it was over ten times faster than the conventional systems, and the real time factor (RTF) indicated that it was fast enough for real-time processing.

show abstract

“…Background noise removal and normalization of the recordings for average intensity (-12 dBFS) and duration (500 milliseconds) was done in Audacity software. These recordings were used to generate voice morphs spanning a voice identity continuum between two participants (acquaintances) by using TANDEM-STRAIGHT (Kawahara, Morise, Banno, & Skuk, 2013) (e.g. a voice morph can be generated such that it contains 30% of person A's, 70% of person B's voice).…”

Section: Auditory Tasksmentioning

confidence: 99%

Breathing affects self-other voice discrimination in a bodily state with increased otherness

Orepic¹,

Faivre²,

Rognini³

et al. 2020

Preprint

View full text Add to dashboard Cite

A growing number of studies have focused on identifying cognitive processes that are modulated by interoceptive signals. Here we investigated whether interoception affects self-processing, by assessing changes in self-voice perception as a function of respiratory and cardiac cycles. Considering the fundamental role interoception plays in bodily self-consciousness, we additionally applied conflicting sensorimotor stimulation inducing a state characterized by a loss of self and increased otherness, and investigated its effects in self-other voice perception. Our data reveal that breathing, but not heartbeat, affects self-voice perception, by showing that participants (N = 30) discriminated self-voice from other voices better during inspiration, while being in the state of increased otherness and especially when hearing voices of other people. Loudness judgement of equivalent self-related stimuli was unaffected by breathing. Combining interoception and voice perception with self-monitoring framework, these data extend recent findings on breathing-dependent cognition to self-processing.

show abstract

Temporally variable multi-aspect N-way morphing based on interference-free speech representations

Cited by 41 publications

References 27 publications

Impact of acoustic similarity on efficiency of verbal information transmission via subtle prosodic cues

Impact of acoustic similarity on efficiency of verbal information transmission via subtle prosodic cues

WORLD: A Vocoder-Based High-Quality Speech Synthesis System for Real-Time Applications

Breathing affects self-other voice discrimination in a bodily state with increased otherness

Contact Info

Product

Resources

About