“…Other studies explored mismatches in multisensory context including the congruity of appearance and voice (Mitchell et al, 2011;Hastie et al, 2017;Cabral et al, 2017;Stein and Ohler, 2018;McGinn and Torre, 2019). One drawback of these studies is that they usually use subjective measures such as fear and eeriness (Mitchell et al, 2011), credibility or attractiveness (Stein and Ohler, 2018), politeness and lifelikeness (Hastie et al, 2017), likability, expressiveness, and understandability (Cabral et al, 2017), drawings (Mara et al, 2020), or emotion labeling (Tsiourti et al, 2019) to evaluate artificial agents rather than more objective measures such as reaction time or accuracy, which would be more informative about the basic perceptual processes.…”