“…As the dataset has been a popular and useful resource, it has been further extended with captions in other languages such as Chinese (Li et al, 2016) and Turkish (Unal et al, 2016). However, (Federmann and Lewis, 2017) 4.5-10h audio 7k-18k segments de, en, fr, ja, zh IWSLT '18 (Niehues et al, 2018) 1,565 audio clips 171k segments de, en LibriSpeech (Kocabiyikoglu et al, 2018) 236h audio 131k segments en, fr MuST-C (Di Gangi et al, 2019a) 385-504h audio 211k-280k segments 10 languages MaSS (Boito et al, 2019) 18.5-23h audio 8.2k segments 8 languages as these captions were independently crowdsourced, they are not translations of each other, which makes them less effective for MMT.…”