“…Active learning for speech recognition aims at identifying the most informative utterances to be manually transcribed from a large pool of unlabeled speech. This topic has been extensively explored on a number of different fronts, including the use of uncertainty-based sampling to select informative speech samples [6,7,8,9], active learning for low-resource speech recognition [10,11], combined active and semi-supervised learning [12] and active learning for arXiv:2103.03142v1 [cs.SD] 4 Mar 2021 end-to-end ASR systems [13,14]. In active learning, the goal is to select informative speech samples that are subsequently transcribed, while our work focuses on the reverse problem of selecting informative sentences that are subsequently recorded as speech.…”