“…In recent years, deep neural networks (DNNs) have emerged as a powerful tool for representing complex visual data, such as images (LeCun et al, 2015) or videos (Liu et al, 2020). In the auditory domain, DNNs have been shown to provide valuable representations-so-called feature or latent spaces-for modeling the cerebral processing of sound (brain encoding) (speech: Kell et al, 2018;Millet et al, 2022;Tuckute & Feather, 2023; semantic content: Caucheteux et 3 al., 2023;Giordano et al, 2023;music: Güçlü et al, 2016), or reconstructing the stimuli listened by a participant (brain decoding) (Akbari et al, 2019). They have not yet been used to explain cerebral representations of identity-related information due in part to the focus on speech information (von Kriegstein et al, 2003).…”