Text as a Supplement to Speech in Young and Older Adults

Krull, Vidya; Humes, Larry E.

doi:10.1097/aud.0000000000000234

Cited by 17 publications

(23 citation statements)

References 52 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Interestingly, they actually found evidence for increases in listening effort (operationalized as reaction time) with the visual task, despite subjective reports that the task was easier. In contrast, our results fit with the prior literature on assistive text captioning which suggest that visual text cues can provide a substantial benefit to speech processing (e.g., Krull & Humes, 2016;Gordon-Salant & Callahan, 2009;Grossman & Rajan, 2017).…”

Section: Discussionsupporting

confidence: 89%

“…One recent paper estimated that some popular ASR systems have word error rates between 9 and 34%; Këpuska & Bohouta, 2017). As noted previously, both Krull and Humes (2016) and Zekveld and colleagues (2008) used ASR technology to generate the text that accompanied the speech, which would allow for the introduction of realistic captioning errors.…”

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

Text captioning buffers against the effects of background noise and hearing loss on memory for speech

Payne¹,

Silcox²,

Crandell³

et al. 2020

Preprint

View full text Add to dashboard Cite

Objectives. Everyday speech understanding frequently occurs in perceptually demanding environments, for example due to background noise and normal age-related hearing loss. The resulting degraded speech signals increase listening effort, which gives rise to negative downstream effects on subsequent memory and comprehension, even when speech is intelligible. In two experiments, we explored whether the presentation of realistic assistive text captioned speech offsets the negative effects of background noise and hearing impairment on multiple measures of speech memory.Design. In Experiment 1, young normal hearing adults (N = 48) listened to sentences for immediate recall and delayed recognition memory. Speech was presented in quiet or in two levels of background noise. Sentences were either presented as speech only or as text captioned speech. Thus, the experiment followed a 2 (caption vs no caption) x 3 (no noise, +7 dB SNR, +3 dB SNR) within-subjects design. In Experiment 2, a group of older adults (age range : 61 – 80, N = 31), with varying levels of hearing acuity completed the same experimental task as in Experiment 1. For both experiments, immediate recall, recognition memory accuracy, and recognition memory confidence were analyzed via general(ized) linear mixed effects models. In addition, we examined individual differences as a function of hearing acuity in Experiment 2.Results. In Experiment 1, we found that the presentation of realistic text-captioned speech in young normal-hearing listeners improved immediate recall, delayed recognition memory accuracy, and memory confidence compared to speech alone. Moreover, text captions attenuated the negative effects of background noise on all speech memory outcomes. In Experiment 2, we replicated the same pattern of results in a sample of older adults with varying levels of hearing acuity. Moreover, we showed that the negative effects of hearing loss on speech memory in older adulthood were attenuated by the presentation of text captions.Conclusion. Collectively, these findings suggest that listeners can rapidly integrate text and speech, and that the simultaneous presentation of text can offset the negative effects of effortful listening on speech memory.

show abstract

Section: Discussionsupporting

confidence: 89%

Section: Discussionmentioning

confidence: 99%

Text captioning buffers against the effects of background noise and hearing loss on memory for speech

Payne¹,

Silcox²,

Crandell³

et al. 2020

Preprint

View full text Add to dashboard Cite

show abstract

“…Captions may be helpful in a variety of conditions where, for example, the sound system is poor, there are foreign accents, there is interfering speech or noise, or the viewer’s speech perception is affected by hearing loss. Both older and younger individuals modulate their emphasis on captions versus spoken speech depending on a variety of factors, including the SNR of the acoustic signal and the accuracy of the caption (Krull & Humes, 2016). Assuming an accurate caption, one can learn the content of the spoken message simply by reading the caption text.…”

Section: Discussionmentioning

confidence: 99%

The Effect of Aging and Priming on Same/Different Judgments Between Text and Partially Masked Speech

et al. 2017

View full text Add to dashboard Cite

Objectives It is well known from previous research that when listeners are told what they are about to hear before a degraded or partially masked auditory signal is presented, the speech signal “pops out” of the background and becomes considerably more intelligible. The goal of this research was to explore whether this priming effect is as strong in older adults as in younger adults. Design Fifty-six adults – 28 older and 28 younger – listened to “nonsense” sentences spoken by a female talker in the presence of a two-talker speech masker (also female) or a fluctuating speech-like noise masker at five signal-to-noise ratios. Just prior to, or just after, the auditory signal was presented, a typed caption was displayed on a computer screen. The caption sentence was either identical to the auditory sentence or differed by one key word. The subjects’ task was to decide whether the caption and auditory messages were the same or different. Discrimination performance was reported in d′. The strength of the pop-out perception was inferred from the improvement in performance that was expected from the caption-before order of presentation. A subset of 12 subjects from each group made confidence judgments as they gave their responses, and also completed several cognitive tests. Results Data showed a clear order effect for both subject groups and both maskers, with better same-different discrimination performance for the caption-before condition than the caption-after condition. However, for the two-talker masker, the younger adults obtained a larger and more consistent benefit from the caption-before order than the older adults across signal-to-noise ratios. Especially at the poorer signal-to-noise ratios, older subjects showed little evidence that they experienced the pop-out effect that is presumed to make the discrimination task easier. On average, older subjects also appeared to approach the task differently, being more reluctant than younger subjects to report that the captions and auditory sentences were same. Correlation analyses indicated a significant negative association between age and priming benefit in the two-talker masker and non-significant associations between priming benefit in this masker and either high-frequency hearing loss or performance on the cognitive tasks. Conclusions Previous studies have shown that older adults are at least as good, if not better, at exploiting context in speech recognition, as compared to younger adults. The current results are not in disagreement with those findings but suggest that, under some conditions, the automatic priming process that may contribute to context benefits is not as strong in older as in younger adults.

show abstract

“…In other words, spatially separating the target from the masker 'releases' the target from auditory masking. Somewhat surprisingly, providing listeners with information from a different modality prior to or during the presentation of masked sentences facilitates speech reception in noise (Freyman, Balakrishnan, & Helfer, 2004;Krull & Humes, 2016). For example, Freyman, Balakrishnan, and Helfer had participants report the last word of a syntactically correct but semantically anomalous sentence (e.g., BA rose could paint a fish.^) when it was masked by two other simultaneously presented sentences of the same type spoken in two different voices.…”

mentioning

confidence: 99%

Do age and linguistic background alter the audiovisual advantage when listening to speech in the presence of energetic and informational masking?

Avivi-Reich

Puka

Schneider

2017

Atten Percept Psychophys

View full text Add to dashboard Cite

We examined how the type of masker presented in the background affected the extent to which visual information enhanced speech recognition, and whether the effect was dependent on or independent of age and linguistic competence. In the present study, young speakers of English as a first language (YEL1) and English as a second language (YEL2), as well as older speakers of English as a first language (OEL1), were asked to complete an audio (A) and an audiovisual (AV) speech recognition task in which they listened to anomalous target sentences presented against a background of one of three masker types (noise, babble, and competing speech). All three main effects were found to be statistically significant (group, masker type, A vs. AV presentation type). Interesting two-way interactions were found between masker type and group and between masker type and presentation type; however, no interactions were found between group (age and/or linguistic competence) and presentation type (A vs. AV). The results of this study, while they shed light on the effect of masker type on the AV advantage, suggest that age and linguistic competence have no significant effects on the extent to which a listener is able to use visual information to improve speech recognition in background noise.

show abstract

Text as a Supplement to Speech in Young and Older Adults

Cited by 17 publications

References 52 publications

Text captioning buffers against the effects of background noise and hearing loss on memory for speech

Text captioning buffers against the effects of background noise and hearing loss on memory for speech

The Effect of Aging and Priming on Same/Different Judgments Between Text and Partially Masked Speech

Do age and linguistic background alter the audiovisual advantage when listening to speech in the presence of energetic and informational masking?

Contact Info

Product

Resources

About