Investigation of Efficiency and Observation of Voice Quality Parameters in Ideal User’s Voice Creation using UTAU and Interactive Differential Evolution

Inoue, Asami; Fukumoto, Manabu

doi:10.5057/jjske.tjske-d-19-00002

Cited by 2 publications

(3 citation statements)

References 2 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Paired comparison-based IDE, which is known as an efficient and faster way to evaluate, is used [16]. The paired comparison-based IDE was used in the previous studies to create content related to a sense of hearing [12,13]. Furthermore, the paired comparisonbased IDE was also used in the creation of content related to other senses [19,20].…”

Section: Interactive Differential Evolutionmentioning

confidence: 99%

“…Originally, UTAU is used for creating singing voices as the name says (UTAU comes from a Japanese verb "歌う" that means "sing"). Some studies used UTAU [11,12]; however, this area is undeveloped. Figure 2 is a screenshot of UTAU.…”

Section: Utaumentioning

confidence: 99%

“…Interactive Evolutionary Computation (IEC) was known as a method that obtains the user's media content based on a stochastic search [6,7], and previous studies created sound content using IECs [8][9][10][11][12][13][14]. The purpose of this study is to propose a method to produce the voice data of the model pronunciation similar to the user's voice by using UTAU [15] and interactive differential evolution (IDE) [16], one of the IECs.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Making an English Speech Similar to the User’s Voice using UTAU and Interactive Differential Evolution

Miyamoto

Gan

FUKUMOTO

2023

IJAE

Self Cite

View full text Add to dashboard Cite

Practicing English pronunciation is difficult for non-native speakers because of the differences in vowels and consonants. There are several ways to practice them such as Shadowing, however, if the voice's features greatly differ from the learner's voice, it should be difficult for learners to reproduce. To solve this problem, we propose a method to make the pronunciation data of the model pronunciation similar to the learner's voice by using UTAU and Interactive Differential Evolution. A listening experiment was conducted with the concrete system of IDE and UTAU. Twelve examinees participated in the experiment through ten generations based on paired comparisons for making the voices similar to their own voices inside their heads. As a result, we could successfully make the voices similar to the examinees' voices. Since it has paired comparison, we believe that the paired comparison-based IDE is a better method than the general Interactive Genetic Algorithm with scoring.

show abstract