Objectives. Everyday speech understanding frequently occurs in perceptually demanding environments, for example due to background noise and normal age-related hearing loss. The resulting degraded speech signals increase listening effort, which gives rise to negative downstream effects on subsequent memory and comprehension, even when speech is intelligible. In two experiments, we explored whether the presentation of realistic assistive text captioned speech offsets the negative effects of background noise and hearing impairment on multiple measures of speech memory.Design. In Experiment 1, young normal hearing adults (N = 48) listened to sentences for immediate recall and delayed recognition memory. Speech was presented in quiet or in two levels of background noise. Sentences were either presented as speech only or as text captioned speech. Thus, the experiment followed a 2 (caption vs no caption) x 3 (no noise, +7 dB SNR, +3 dB SNR) within-subjects design. In Experiment 2, a group of older adults (age range : 61 – 80, N = 31), with varying levels of hearing acuity completed the same experimental task as in Experiment 1. For both experiments, immediate recall, recognition memory accuracy, and recognition memory confidence were analyzed via general(ized) linear mixed effects models. In addition, we examined individual differences as a function of hearing acuity in Experiment 2.Results. In Experiment 1, we found that the presentation of realistic text-captioned speech in young normal-hearing listeners improved immediate recall, delayed recognition memory accuracy, and memory confidence compared to speech alone. Moreover, text captions attenuated the negative effects of background noise on all speech memory outcomes. In Experiment 2, we replicated the same pattern of results in a sample of older adults with varying levels of hearing acuity. Moreover, we showed that the negative effects of hearing loss on speech memory in older adulthood were attenuated by the presentation of text captions.Conclusion. Collectively, these findings suggest that listeners can rapidly integrate text and speech, and that the simultaneous presentation of text can offset the negative effects of effortful listening on speech memory.