“…Most previous work within the NLP community applies distant reading (Jockers, 2013) to large collections of books, focusing on modeling different aspects of narratives such as plots and event sequences (Chambers and Jurafsky, 2009;McIntyre and Lapata, 2010;Goyal et al, 2010;Eisenberg and Finlayson, 2017), characters (Bamman et al, 2014;Iyyer et al, 2016;Chaturvedi et al, , 2017, and narrative similarity (Chaturvedi et al, 2018). In the same vein, researchers in computational literary analysis have combined statistical techniques and linguistics theories to perform quantitative analysis on large narrative texts (Michel et al, 2011;Franzosi, 2010;Underwood, 2016;Jockers and Kirilloff, 2016;Long and So, 2016), but these attempts largely rely on techniques such as word counting, topic modeling, and naive Bayes classifiers and are therefore not able to capture the meaning of sentences or paragraphs (Da, 2019).…”