“…The data comprised 60 recorded audio files in which a female native Japanese speaker read 30 artificially constructed sentences aloud twice at a natural speed. The sentences comprised 3 The source code of Prosodic DAA is available at https://github.com/EmergentSystemLabStudent/Prosodic-DAA 4 Japanese vowel native speech dataset: https://github.com/EmergentSystemLabStudent/aioi dataset five words {aioi, aue, ao, ie, uo}, which consisted of five Japanese vowels {a, i, u, e, o} representing {ä, i, W B , e fl , o fl } in phonetic symbols respectively. By combining the 5 words, the 30 sentences included 25 two-word sentences, e.g., "aioi," "aue ie," and "uo ao," and five three-word sentences i.e., "aioi uo ie," "aue ao ie," "ao ie ao," "ie uo," and "uo aue ie," were prepared.…”