“…Five large acoustic feature sets were extracted from each individual spoken utterance, including (a) envelope modulation spectrum (EMS; 60 features), (b) rhythm metrics (12 features), (c) long-term average spectrum (LTAS; 99 features), (d) mel-frequency cepstral coefficients (MFCCs; 234 features), and (e) voice report (24 features). These feature sets, which have been previously reported on (e.g., Berisha, Liss, Sandoval, Utianski, & Spanias, 2014;Tu, Berisha, & Liss, 2017;Tu, Jiao, Berisha, & Liss, 2016;Willi et al, 2018), are considered to reflect rhythmic (EMS, rhythm metrics), articulatory (LTAS, MFCC), and phonatory (voice report) dimensions of speech signal behavior (e.g., Cleveland, Sunberg, & Stone, 2001;Dellwo, Fourcin, & Abberton, 2013;Liss, LeGendre, & Lotto, 2010; also see Borrie et al, 2019, for dimension justification). For comprehensive details of feature calculation, please refer to Supplemental Material S1.…”