Abstract:Musicians often lack the ability to harmonize their voices within a track. To help with this matter, a tool can be developed for detecting the scale or key in which a track is sung and synthesizing pitches to make a triadchord or a tetrachord (combinations of three or four notes that fit in the scale's harmony) for each corresponding tone in the melody. In this paper, we present a fast and precise method to detect the pitch of voice and shift it to the appropriate frequencies, consequently building up a harmon… Show more
“…In the information society, speech is used not only in its original form but also through many digital electronic devices (Zhang and Kuo, 2001) mobile phones grow rapidly, wired telephone systems become digital and Internet phones and audio are common in use. On the other hand, computers may also speak to humans by synthetic voice (Juan et al, 2015) and listen to us using speech recognition. To understand these processes for both human and machine, we have to study carefully the structures and functions of spoken language: how to produce and perceive it and how speech technology may help us to communicate (Hennig and Chellali, 2012) Speech segmentation is the process of splitting the speech into separately words and each word is saved in separated audio file for the upcoming processing as shown in Fig.…”
Segmentation of audio data such as human speech (splitting each word in separate audio file-.WAV file) has been a major concern when working with multimedia such as recordings from radio or TV. The main focus of the segmentation of boundaries of spoken language has been on using energy and zero crossing thresholds for endpoint detection. Errors in endpoint detection are still a main cause of low accuracy of segmentation systems. The goal of this research is to develop an efficient algorithm in order to segment the speech of human in both languages of English and Arabic in different speaking speed with high accuracy. Simulation results show that the developed algorithm achieved high accuracy when segmenting human speech in English language up to 91.6% in average, while it is 89.0% of Arabic language.
“…In the information society, speech is used not only in its original form but also through many digital electronic devices (Zhang and Kuo, 2001) mobile phones grow rapidly, wired telephone systems become digital and Internet phones and audio are common in use. On the other hand, computers may also speak to humans by synthetic voice (Juan et al, 2015) and listen to us using speech recognition. To understand these processes for both human and machine, we have to study carefully the structures and functions of spoken language: how to produce and perceive it and how speech technology may help us to communicate (Hennig and Chellali, 2012) Speech segmentation is the process of splitting the speech into separately words and each word is saved in separated audio file for the upcoming processing as shown in Fig.…”
Segmentation of audio data such as human speech (splitting each word in separate audio file-.WAV file) has been a major concern when working with multimedia such as recordings from radio or TV. The main focus of the segmentation of boundaries of spoken language has been on using energy and zero crossing thresholds for endpoint detection. Errors in endpoint detection are still a main cause of low accuracy of segmentation systems. The goal of this research is to develop an efficient algorithm in order to segment the speech of human in both languages of English and Arabic in different speaking speed with high accuracy. Simulation results show that the developed algorithm achieved high accuracy when segmenting human speech in English language up to 91.6% in average, while it is 89.0% of Arabic language.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.