Interspeech 2015 2015
DOI: 10.21437/interspeech.2015-231
|View full text |Cite
|
Sign up to set email alerts
|

Neuromorphic based oscillatory device for incremental syllable boundary detection

Abstract: Syllables are considered as basic supra-segmental units, used mainly in prosodic modelling. It has long been thought that efficient syllabification algorithms may also provide valuable cues for improved segmental (acoustic) modelling. However, the best current syllabification methods work offline, considering the power envelope of whole utterance.In this paper we introduce a new method for detection of syllable boundaries based on a model of speech parsing into syllables by neural oscillations in human auditor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(6 citation statements)
references
References 16 publications
0
5
0
Order By: Relevance
“…Precoss-β was built by including oscillating state-dependent precisions in a previous generative model version (Precoss) (Hovsepyan, Olasagasti and Giraud, 2020). The model input consists of a speech reduced auditory spectrogram (Chi, Ru and Shamma, 2005) and of its slow amplitude modulations (Hyafil et al, 2015), both extracted English sentences of the TIMIT database (Garofolo et al, 1993) (see Hovespyan et al 2020 (Hovsepyan, Olasagasti and for details about speech input generation). In Precoss-β, the activation of the appropriate syllable unit generates the corresponding auditory spectrogram with a flexible duration determined by eight gamma units (Figure 1).…”
Section: Precoss-β Architecture and Oscillating Precisionsmentioning
confidence: 99%
See 2 more Smart Citations
“…Precoss-β was built by including oscillating state-dependent precisions in a previous generative model version (Precoss) (Hovsepyan, Olasagasti and Giraud, 2020). The model input consists of a speech reduced auditory spectrogram (Chi, Ru and Shamma, 2005) and of its slow amplitude modulations (Hyafil et al, 2015), both extracted English sentences of the TIMIT database (Garofolo et al, 1993) (see Hovespyan et al 2020 (Hovsepyan, Olasagasti and for details about speech input generation). In Precoss-β, the activation of the appropriate syllable unit generates the corresponding auditory spectrogram with a flexible duration determined by eight gamma units (Figure 1).…”
Section: Precoss-β Architecture and Oscillating Precisionsmentioning
confidence: 99%
“…Briefly, for each sentence, a 6-channel reduced auditory spectrogram was calculated with a biologically plausible model of the auditory periphery (Chi, Ru and Shamma, 2005). Additionally, slow amplitude modulation of the sentence waveform was calculated following procedures described in Hyafil and colleagues (Hyafil and Cernak, 2015;Hyafil et al, 2015). Syllable boundaries in the input sentences were defined with the Tsylb2 (Fisher, 1996) programme based on the phonemic transcriptions provided in the TIMIT database (Garofolo et al, 1993).…”
Section: Speech Input and Syllabificationmentioning
confidence: 99%
See 1 more Smart Citation
“…To estimate syllable boundaries from the speech signal, a neuromorphic oscillatory device is used, based on modelling brain neural oscillations at syllable frequency. This results in highly noise robust incremental syllable boundary detection [54]. It is built around an interconnected network composed of 10 excitatory and 10 inhibitory leaky integrate-and-fire neurons.…”
Section: Coding Of Prosodic Informationmentioning
confidence: 99%
“…frequencies around formants likely to provide more information about syllable boundaries because of the vocalisation process. Syllable boundaries are characterised by local minima of the weighted signal; these can be generalised to a convolution of the temporal kernel and the weighted signal [54].…”
Section: Trainingmentioning
confidence: 99%