“…There is comparatively less work in the literature on automated analysis of code-switched speech, partially due to the relative lack of structured corpora (as compared to those for textbased work) and also potentially because it also poses yet another significant challenge in the form of speech recognition for multiple languages. Nonetheless, some researchers have made strong strides in spoken corpus development to support such research in certain language pairs, for instance, Mandarin-English [21,22], Cantonese-English [23] and Hindi-English [24], which have in turn led to developments in automatic speech recognition [25,26] and language modeling [27]. However, these are limited; there remains a need for more codeswitched speech resources in these and other languages to spur research into the automated processing and analysis of such data.…”