2017
DOI: 10.1121/1.4999059
|View full text |Cite
|
Sign up to set email alerts
|

The role of short-time intensity and envelope power for speech intelligibility and psychoacoustic masking

Abstract: The generalized power spectrum model [GPSM; Biberger and Ewert (2016). J. Acoust. Soc. Am. 140, 1023-1038], combining the "classical" concept of the power-spectrum model (PSM) and the envelope power spectrum-model (EPSM), was demonstrated to account for several psychoacoustic and speech intelligibility (SI) experiments. The PSM path of the model uses long-time power signal-to-noise ratios (SNRs), while the EPSM path uses short-time envelope power SNRs. A systematic comparison of existing SI models for several … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
40
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
6
1
1

Relationship

2
6

Authors

Journals

citations
Cited by 21 publications
(41 citation statements)
references
References 39 publications
1
40
0
Order By: Relevance
“…These studies on AM perception have provided support for the development of computational models of AM processing based on the concept of a modulation filterbank (Dau et al, 1997a). These models are used to predict the perception of AM cues and have recently accounted for a wide range of detection, discrimination and identification data in adults (e.g., Biberger and Ewert, 2017).…”
Section: Sensory Factors Involved In Temporal Processingmentioning
confidence: 97%
“…These studies on AM perception have provided support for the development of computational models of AM processing based on the concept of a modulation filterbank (Dau et al, 1997a). These models are used to predict the perception of AM cues and have recently accounted for a wide range of detection, discrimination and identification data in adults (e.g., Biberger and Ewert, 2017).…”
Section: Sensory Factors Involved In Temporal Processingmentioning
confidence: 97%
“…The GPSM q ( Biberger et al., 2018 ) represents an audio quality extension of the GPSM, which has been demonstrated to predict the results of many psychoacoustic and speech intelligibility experiments ( Biberger & Ewert, 2016 , 2017 ). GPSM q applies a linear, fourth-order gammatone filterbank with bandwidth equal to the equivalent rectangular bandwidth of the auditory filter (ERB N ; Glasberg & Moore, 1990 ; Moore & Glasberg, 1983 ) that simulates the behavior of the basilar membrane, followed by calculating the low-pass filtered Hilbert envelope (cutoff frequency of 150 Hz) to account for decreased modulation sensitivity at high modulation frequencies.…”
Section: Audio Quality Modelsmentioning
confidence: 99%
“…Biberger and Ewert combined the Power Spectrum Model (PSM; Fletcher, 1940 ; Patterson & Moore, 1986 ) and Envelope Power Spectrum Model (EPSM; Ewert & Dau, 2000 ) with multiresolution analysis as suggested by Jørgensen et al. (2013) , denoted the Generalized Power Spectrum Model (GPSM), which has been demonstrated to predict the results of several experiments on psychoacoustic masking and speech intelligibility ( Biberger & Ewert, 2016 , 2017 ). Recently, Biberger and colleagues suggested the Generalized Power Spectrum Model for quality (GPSM q ; Biberger et al., 2018 ) that has been shown to predict the perception of a large variety of monaural distortions.…”
mentioning
confidence: 99%
“…Viemeister, ; Dau et al ., ; Ewert & Dau, ) and the most recent attempts successfully described a wide range of behavioural data ( e.g . Piechowiak et al ., ; Jepsen et al ., ; Dau et al ., ; Biberger & Ewert, , ). In comparison, very few modelling attempts have been made to model for FM sensitivity (Hartmann & Klein, ; Moore & Sek, ; Ernst & Moore, ; Paraouty et al ., ).…”
Section: Introductionmentioning
confidence: 99%