“…Pitch, as a perceptual measurement of fundamental frequency (F0) of speech signals [1], is a powerful prosodic cue for auditory perception. Pitch features have long known to be useful for recognition of normal speech, especially for tonal languages, such as Mandarin [2,3,4], Cantonese [5,6], Vietnamese [7,8] and Thai [9,10], since pitch can serve as an informative source to distinguish different tones in tonal languages [11]. In non-tonal languages, for instance, English [12,13,14] and Japanese [15,16], it is also feasible to treat pitch as an auxiliary information by concatenating with acoustic features to improve speech recognition performance.…”