Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones

Moulines, Éric; Charpentier, F.

doi:10.1016/0167-6393(90)90021-z

Cited by 973 publications

(496 citation statements)

References 10 publications

Supporting

Mentioning

467

Contrasting

Unclassified

Order By: Relevance

“…We hence artificially modified F 0 and/or duration cues associated to the last syllable of each fragment using PSOLA (Pitch Synchronous Overlap and Add; Moulines and Charpentier 1990) in Praat (Boersma and Weenink 2009) in order to manipulate boundary strength (Table 3) for a total of 120 stimuli (20 NPs Â 2 boundary levels Â 3 acoustic combinations).…”

Section: Methods 321 Materialsmentioning

confidence: 99%

Prosodic boundary strength guides syntactic parsing of French utterances

Michelas

D’Imperio²

2015

Laboratory Phonology

View full text Add to dashboard Cite

This study tests how prosodic boundary strength (i.e., categorical differences between Accentual Phrase, AP, and intermediate phrase, ip, boundaries) per se affects the syntactic parsing of spoken utterances in French. Two forced-choice perception experiments demonstrated that French listeners use prosodic boundary strength (either AP or ip boundaries) at the end of noun phrases (e.g., La nana du sauna 'The girl who manages the sauna') to choose whether NPs are likely to be followed by a prepositional phrase (e.g., d'Héléna 'of Héléna') or instead by a verb phrase (e.g., déconseille 'advises against'). Experiment 1 employed fragments extracted from natural speech stimuli, while Experiment 2 made use of resynthesized speech, in which fundamental frequency and durational cues to AP and ip boundaries were independently manipulated. In Experiment 1, results show that listeners prefer PP completions following AP boundaries and VP completions after ip boundaries. Experiment 2 shows that preboundary duration cues consistent with the presence of an AP boundary successfully guide listeners to prefer PP completions, while both fundamental frequency and duration cues consistent with an ip boundary are necessary to induce VP completions. We hence argue that prosodic boundary strength at the right edge of an utterance fragment influences syntactic parsing decisions in French.

show abstract

Section: Methods 321 Materialsmentioning

confidence: 99%

Prosodic boundary strength guides syntactic parsing of French utterances

Michelas

D’Imperio²

2015

Laboratory Phonology

View full text Add to dashboard Cite

show abstract

“…The quality of the generated contours is evaluated acoustically. This is achieved by resynthesizing the original utterance with the newly generated F 0 contour using the PSOLA (Pitch Synchronous Overlap and Add) resynthesis method (Moulines and Charpentier 1990). Thus, in a small-scale perceptual experiment the resynthesized utterances taken from ToBI and the Boston Radio News Corpus (the only generally available prosodically labelled corpus of American English) are assessed as to their naturalness as well as their similarity to / differences from the respective originals.…”

Section: Introductionmentioning

confidence: 99%

Rules for the generation of ToBI-based American English intonation

Jilka

Möhler

Dogil

1999

Speech Communication

View full text Add to dashboard Cite

“…An /aː/ vowel from one of the speaker's accented productions of sagte which had F1 values closest to the F1 median of this speaker's /a/ and /aː/ tokens was manipulated in duration to create 11 equidistant steps using Praat's (Boersma and Weenink 2012) implementation of the PSOLA algorithm (Moulines and Charpentier 1990). The 11 steps ranged from 40 ms to 112 ms; these selected durations were based on the model speaker's range of target vowel durations in two small pilot studies.…”

Section: Methodsmentioning

confidence: 99%

The relationship between prosodic weakening and sound change: evidence from the German tense/lax vowel contrast

Harrington

Kleber

Reubold

et al. 2015

Laboratory Phonology

View full text Add to dashboard Cite

Abstract:The study tests a model of sound change based on how prosodic weakening affects shortening in polysyllabic words. Twenty-nine L1-German speakers produced minimal pairs differing in vowel tensity in both monosyllables /zakt, zaːkt/ and disyllables /zaktə, zaːktə/. The target words were produced in accented and deaccented contexts. The duration ratio between the vowel and the following /kt/ cluster was less for lax than tense vowels and less for disyllables than monosyllables. Under deaccentuation, there was an approximation of tense and lax vowels towards each other but no influence due to the mono-vs. disyllabic difference. On the other hand, Gaussian /a/ vs. /aː/ classifications of these data showed a lesser influence due to the syllable count in deaccented words. Compatibly, when the same speakers as listeners classified synthetic sackt-sagt and sackte-sagte continua, they were shown to compensate for the syllable count differences, but to a lesser extent in a deaccented context. Deaccentuation may therefore provide the conditions for sound change to take place by which /aː/ shortens in polysyllabic words; it may do so because the association between coarticulation and the source that gives rise to it is hidden to a greater extent than in accented contexts.

show abstract

Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones

Cited by 973 publications

References 10 publications

Prosodic boundary strength guides syntactic parsing of French utterances

Prosodic boundary strength guides syntactic parsing of French utterances

Rules for the generation of ToBI-based American English intonation

The relationship between prosodic weakening and sound change: evidence from the German tense/lax vowel contrast

Contact Info

Product

Resources

About