“… Chen, et al (2022) considered both text and image as input and quantified their ambiguity by computing cross-modal correlations. Others moved beyond text and image, and leveraged contextual features on propagation ( Bian, et al, 2020 , Zhou and Zafarani, 2019 ), source ( Yuan, Ma, Zhou, Han, & Hu, 2020 ), and time ( Allein et al, 2021 , Song, Shu, and Wu, 2021 ). Shu, Mahudeswaran, Wang, and Liu (2020) , for example, combined multiple contextual features in multimodal, hierarchical propagation networks using linguistic, structural, and temporal features from micro-level and macro-level propagation networks to detect fake news on Twitter.…”