“…Despite the long-standing and widespread attention, the AGC task is extremely challenging due to multiple factors. Previous works differ with respect to 1) the notion of genre, which represents a quite heterogeneous object of study (Biber and Conrad, 2009), 2) the typologies of text properties, either token- (Crossley and Louwerse, 2007;HaCohen-Kerner et al, 2020;Mehler et al, 2010;Santini, 2004) or sentence-based characteristics (Cimino et al, 2017;Fang and Cao, 2010;Stamatatos et al, 2001;Wan et al, 2019) or also accounting for the discourse structure of texts (Sun et al, 2021), 3) the machine learning approach adopted (Worsham and Kalita, 2018), and 4) the source exploited to classify the genre, either the book content (Rahul et al, 2021;Shamir, 2020;Worsham and Kalita, 2018), title or summary (Ozsarfati et al, 2019), descriptions on websites (Sobkowicz et al, 2018), or cover design (Buczkowski et al, 2018).…”