“…facial movement, expression, head pose) [34,36], conversational behaviors (e.g. voice activity, adjacency pair, backchannel, turn length) [18,35,37], laughing [38], and posture [39]. Engagement recognition modules based on the multi-modal features were implemented in agent systems and empirically tested with real users [36].…”