Synthetic Voice Harmonization: A Fast and Precise Method

Gremes, Juan; Palavecino, Nicola; Seeber, Lucas; Herrero, Santiago Montero

doi:10.1109/ism.2015.122

Cited by 1 publication

(1 citation statement)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the information society, speech is used not only in its original form but also through many digital electronic devices (Zhang and Kuo, 2001) mobile phones grow rapidly, wired telephone systems become digital and Internet phones and audio are common in use. On the other hand, computers may also speak to humans by synthetic voice (Juan et al, 2015) and listen to us using speech recognition. To understand these processes for both human and machine, we have to study carefully the structures and functions of spoken language: how to produce and perceive it and how speech technology may help us to communicate (Hennig and Chellali, 2012) Speech segmentation is the process of splitting the speech into separately words and each word is saved in separated audio file for the upcoming processing as shown in Fig.…”

Section: Introductionmentioning

confidence: 99%

Speech Segmentation Using Dynamic Windows and Thresholds for Arabic and English Languages

Jazyah¹

2018

Journal of Computer Science

View full text Add to dashboard Cite

Segmentation of audio data such as human speech (splitting each word in separate audio file-.WAV file) has been a major concern when working with multimedia such as recordings from radio or TV. The main focus of the segmentation of boundaries of spoken language has been on using energy and zero crossing thresholds for endpoint detection. Errors in endpoint detection are still a main cause of low accuracy of segmentation systems. The goal of this research is to develop an efficient algorithm in order to segment the speech of human in both languages of English and Arabic in different speaking speed with high accuracy. Simulation results show that the developed algorithm achieved high accuracy when segmenting human speech in English language up to 91.6% in average, while it is 89.0% of Arabic language.

show abstract

Section: Introductionmentioning

confidence: 99%

Speech Segmentation Using Dynamic Windows and Thresholds for Arabic and English Languages

Jazyah¹

2018

Journal of Computer Science

View full text Add to dashboard Cite

show abstract

Synthetic Voice Harmonization: A Fast and Precise Method

Cited by 1 publication

References 9 publications

Speech Segmentation Using Dynamic Windows and Thresholds for Arabic and English Languages

Speech Segmentation Using Dynamic Windows and Thresholds for Arabic and English Languages

Contact Info

Product

Resources

About