“…The need for monolingual spoken data is growing steadily to achieve linguistic coverage in automatic speech recognition and text-to-speech research and development. Some examples to these are: the Switchboard corpus (Godfrey & Holliman, 1993), for English telephone conversational speech, the CALLHOME speech corpora, consisting of telephone conversations in several languages (Canavan, Graff, & Zipperlen, 1997), English Boston University Radio Speech Corpus (Ostendorf, Price, & Shattuck-Hufnagel, 1996), Rhapsodie (Lacheret et al, 2014), a French speech corpus with prosodic, syntactic and orthographic annotations, DEMoS (Parada-Cabaleiro et al, 2019) an Italian emotional speech corpus, RSC 3 https://rosettaproject.org/projects/300-languages/. (Georgescu et al, 2020), a Romanian read speech corpus for automatic speech recognition, TV3Parla (Külebi & Ö ktem, 2018) and ParlamentParla (Külebi et al, 2020), parliamentary and television speech corpora for Catalan.…”