“…Indeed, familiar voice recognition is relatively robust to speech content generated variations in low‐level acoustic features (e.g., Zäske et al, 2010), or vocal disguise (e.g., Hollien et al, 1982). Both face (Jeffrey et al, 2011; Valentine, 1991) and voice representations (Latinus & Belin, 2011; Lavan, Knight, & McGettigan, 2019; Papcun et al, 1989; Patel, 2008) also appear to be coded using corresponding modality‐specific mechanisms (see also recent research consistent with this proposal: Lavan & Mcgettigan, 2019; Lavan et al, 2020). Nevertheless, although initial coding and recognition processes are physiologically independent, once face or voice representations are accessed (Belin et al, 2004), cross‐modal familiar identity‐related information and representations (i.e., name, face, voice) can be linked (for a recent review see Tsantani et al, 2019).…”