“…In a typical videoconference, these oral cues, including facial expressions, tones, and gestures, must be conveyed by camera. However, it could be tedious and time-consuming to operate the camera manually, and many of the conversational cues that people rely on in ordinary face-to-face communications can thus be lost [22], [23], [24]. This loss reduces the sense of presence in the context.…”