The role of short-time intensity and envelope power for speech intelligibility and psychoacoustic masking

Biberger, Thomas; Ewert, Stephan D.

doi:10.1121/1.4999059

Cited by 21 publications

(41 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…These studies on AM perception have provided support for the development of computational models of AM processing based on the concept of a modulation filterbank (Dau et al, 1997a). These models are used to predict the perception of AM cues and have recently accounted for a wide range of detection, discrimination and identification data in adults (e.g., Biberger and Ewert, 2017).…”

Section: Sensory Factors Involved In Temporal Processingmentioning

confidence: 97%

Development of temporal auditory processing in childhood: Changes in efficiency rather than temporal-modulation selectivity

Cabrera

Varnet

Buss

et al. 2019

The Journal of the Acoustical Society of America

View full text Add to dashboard Cite

The ability to detect amplitude modulation (AM) is essential to distinguish the spectro-temporal features of speech from those of a competing masker. Previous work shows that AM sensitivity improves until 10 years of age. This may relate to the development of sensory factors (tuning of AM filters, susceptibility to AM masking) or to changes in processing efficiency (reduction in internal noise, optimization of decision strategies). To disentangle these hypotheses, three groups of children (5-11 years) and one of young adults completed psychophysical tasks measuring thresholds for detecting sinusoidal AM (with a rate of 4, 8, or 32 Hz) applied to carriers whose inherent modulations exerted different amounts of AM masking. Results showed that between 5 and 11 years, AM detection thresholds improved and that susceptibility to AM masking slightly increased. However, the effects of AM rate and carrier were not associated with age, suggesting that sensory factors are mature by 5 years. Subsequent modelling indicated that reducing internal noise by a factor 10 accounted for the observed developmental trends. Finally, children's consonant identification thresholds in noise related to some extent to AM sensitivity. Increased efficiency in AM detection may support better use of temporal information in speech during childhood.

show abstract

Section: Sensory Factors Involved In Temporal Processingmentioning

confidence: 97%

Development of temporal auditory processing in childhood: Changes in efficiency rather than temporal-modulation selectivity

Cabrera

Varnet

Buss

et al. 2019

The Journal of the Acoustical Society of America

View full text Add to dashboard Cite

show abstract

“…The GPSM q ( Biberger et al., 2018 ) represents an audio quality extension of the GPSM, which has been demonstrated to predict the results of many psychoacoustic and speech intelligibility experiments ( Biberger & Ewert, 2016 , 2017 ). GPSM q applies a linear, fourth-order gammatone filterbank with bandwidth equal to the equivalent rectangular bandwidth of the auditory filter (ERB N ; Glasberg & Moore, 1990 ; Moore & Glasberg, 1983 ) that simulates the behavior of the basilar membrane, followed by calculating the low-pass filtered Hilbert envelope (cutoff frequency of 150 Hz) to account for decreased modulation sensitivity at high modulation frequencies.…”

Section: Audio Quality Modelsmentioning

confidence: 99%

“…Biberger and Ewert combined the Power Spectrum Model (PSM; Fletcher, 1940 ; Patterson & Moore, 1986 ) and Envelope Power Spectrum Model (EPSM; Ewert & Dau, 2000 ) with multiresolution analysis as suggested by Jørgensen et al. (2013) , denoted the Generalized Power Spectrum Model (GPSM), which has been demonstrated to predict the results of several experiments on psychoacoustic masking and speech intelligibility ( Biberger & Ewert, 2016 , 2017 ). Recently, Biberger and colleagues suggested the Generalized Power Spectrum Model for quality (GPSM q ; Biberger et al., 2018 ) that has been shown to predict the perception of a large variety of monaural distortions.…”

mentioning

confidence: 99%

Instrumental Quality Predictions and Analysis of Auditory Cues for Algorithms in Modern Headphone Technology

et al. 2021

Self Cite

View full text Add to dashboard Cite

Smart headphones or hearables use different types of algorithms such as noise cancelation, feedback suppression, and sound pressure equalization to eliminate undesired sound sources or to achieve acoustical transparency. Such signal processing strategies might alter the spectral composition or interaural differences of the original sound, which might be perceived by listeners as monaural or binaural distortions and thus degrade audio quality. To evaluate the perceptual impact of these distortions, subjective quality ratings can be used, but these are time consuming and costly. Auditory-inspired instrumental quality measures can be applied with less effort and may also be helpful in identifying whether the distortions impair the auditory representation of monaural or binaural cues. Therefore, the goals of this study were (a) to assess the applicability of various monaural and binaural audio quality models to distortions typically occurring in hearables and (b) to examine the effect of those distortions on the auditory representation of spectral, temporal, and binaural cues. Results showed that the signal processing algorithms considered in this study mainly impaired (monaural) spectral cues. Consequently, monaural audio quality models that capture spectral distortions achieved the best prediction performance. A recent audio quality model that predicts monaural and binaural aspects of quality was revised based on parts of the current data involving binaural audio quality aspects, leading to improved overall performance indicated by a mean Pearson linear correlation of 0.89 between obtained and predicted ratings.

show abstract

“…Viemeister, ; Dau et al ., ; Ewert & Dau, ) and the most recent attempts successfully described a wide range of behavioural data ( e.g . Piechowiak et al ., ; Jepsen et al ., ; Dau et al ., ; Biberger & Ewert, , ). In comparison, very few modelling attempts have been made to model for FM sensitivity (Hartmann & Klein, ; Moore & Sek, ; Ernst & Moore, ; Paraouty et al ., ).…”

Section: Introductionmentioning

confidence: 99%

A two‐path model of auditory modulation detection using temporal fine structure and envelope cues

Ewert

Paraouty

Lorenzi

2018

Eur J of Neuroscience

Self Cite

View full text Add to dashboard Cite

A model using temporal-envelope cues was previously developed to explain perceptual interference effects between amplitude modulation and frequency modulation (FM). As that model could not accurately predict FM sensitivity and the interference effects, temporal fine structure (TFS) cues were added to the model. Thus, following the initial stage of the model consisting of a linear filter bank simulating cochlear filtering, processing was split into an 'envelope path' based on envelope power cues and a 'TFS path' based on a measure of the distribution of time intervals between successive zero-crossings. This yielded independent detectability indices for envelope and TFS cues, which were optimally combined to produce a single decision statistic. Independent internal noises in the envelope and TFS paths were adjusted to match the data. Simulations indicate that TFS cues are required to account for FM data for young normal-hearing listeners and that TFS processing is impaired for both older normal-hearing and hearing-impaired listeners. The role of TFS was further assessed by relating the monaural FM sensitivity to measures of interaural phase difference, commonly assumed to rely on binaural TFS sensitivity. The model demonstrates that binaural TFS sensitivity is considerably lower than monaural TFS sensitivity. Similar to FM thresholds, interaural phase difference sensitivity declined with age and hearing loss, although higher degradations were observed in binaural temporal processing compared to monaural processing. Overall, this model provides a novel tool to explore the mechanisms involved in FM processing in the normal auditory system and the degradations in FM sensitivity with ageing and hearing loss.

show abstract

The role of short-time intensity and envelope power for speech intelligibility and psychoacoustic masking

Cited by 21 publications

References 39 publications

Development of temporal auditory processing in childhood: Changes in efficiency rather than temporal-modulation selectivity

Development of temporal auditory processing in childhood: Changes in efficiency rather than temporal-modulation selectivity

Instrumental Quality Predictions and Analysis of Auditory Cues for Algorithms in Modern Headphone Technology

A two‐path model of auditory modulation detection using temporal fine structure and envelope cues

Contact Info

Product

Resources

About