“…ASR takes into account (i) acoustic, lexical, and syntactic information; and (ii) semantic knowledge. Acoustic model (AM) processing includes speech coding [1], speech enhancement [2], source separation [2,3], speech security (e.g, steganography [4][5][6] and watermarking [7][8][9]), and other technologies that can all be used in audio analysis. On the other hand, semantic model (SM), commonly known in the literature as language model (LM) processing, includes all techniques of natural language processing (NLP).…”