ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing
DOI: 10.1109/icassp.1985.1168461
|View full text |Cite
|
Sign up to set email alerts
|

Instantaneous-frequency distribution vs. time: An interpretation of the phase structure of speech

Abstract: A new time-frequency display is constructed based on the phase of the running short-time Fourier transform, specifically the distribution of its time derivative. Typical results are given for speech, indicating more precise location of formants than is usual for the spectrogram.Some insights as to the pertinent structural aspects of the speech signal can clearly be gained from study of the auditory system in humans and other mammals. The bulk of experimental evidence indicates that the ear performs some sort o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
13
0

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 29 publications
(13 citation statements)
references
References 3 publications
0
13
0
Order By: Relevance
“…Murthy and her colleagues have recently used GDF-based features for ASR (Murthy and Gadde, 2003;Hegde et al, 2004b,c;Alsteris and Paliwal, 2005). The time-derivative of the phase spectrum, most often referred to as the instantaneous frequency distribution (IFD), has been used in the past for pitch extraction (Abe et al, 1995;Charpentier, 1986;Nakatani et al, 2003) and formant extraction (Potamianos and Maragos, 1996;Friedman, 1985). Potamianos and Maragos (2001), Dimitriadis and Maragos (2003), Paliwal and Atal (2003) and Wang et al (2003) have recently investigated IFD-based features for ASR.…”
Section: Introductionmentioning
confidence: 97%
“…Murthy and her colleagues have recently used GDF-based features for ASR (Murthy and Gadde, 2003;Hegde et al, 2004b,c;Alsteris and Paliwal, 2005). The time-derivative of the phase spectrum, most often referred to as the instantaneous frequency distribution (IFD), has been used in the past for pitch extraction (Abe et al, 1995;Charpentier, 1986;Nakatani et al, 2003) and formant extraction (Potamianos and Maragos, 1996;Friedman, 1985). Potamianos and Maragos (2001), Dimitriadis and Maragos (2003), Paliwal and Atal (2003) and Wang et al (2003) have recently investigated IFD-based features for ASR.…”
Section: Introductionmentioning
confidence: 97%
“…These two methods are also provided with step-by-step algorithms here, and their performance relative to the naïve benchmark is anecdotally and mathematically evaluated. A third independent method was also published, 15 but this seems to be a partial foreshadowing of the method of Auger and Flandrin, 3 and so will not be separately treated here. A host of more distantly related work on increasing precision in time-frequency analysis has also been published over the years 16 but also will not be considered because the contributions do not directly modify the spectrogram as closely as our main subject, the TCIF spectrogram.…”
Section: Brief History Of the Tcif Spectrogrammentioning
confidence: 97%
“…Algorithmic improvements were proposed and, thanks to the many developments that occurred the field, the scope of the technique was considerably enlarged, far beyond only the spectrogram case. In parallel, other related techniques were developed independently(e.g., the "ridge and skeleton" method [5,13,19,20], the "instantaneous frequency density" [17], the "differential spectral analysis" [18] or the "synchrosqueezing" technique [29]).…”
Section: Some Historical Commentsmentioning
confidence: 99%