“…This “turn” entails not only the use of increasingly sophisticated statistical techniques in quantitative research, but also the expectation that even primarily qualitatively oriented studies will contain some degree of quantification of the phenomenon in question (see also Janda 2013). As a result, recent years have seen an increasing number of data-driven studies on language, gender, and sexuality, mainly focusing on aspects of the representation and construction of identities (e.g., Caldas-Coulthard & Moon 2010; Moon 2014; Baker & Levon 2015a, 2015b; Potts 2015; Norberg 2016; Taylor 2016); Konnelly (this issue), on the varying patterns of representation of men and women in US broadcast talk, contributes to this growing body of research. The advantage of large corpora in such studies lies in affording researchers the ability to uncover latent patterns of representation and to avoid “cherry-picking” only those results that fit their expectations, although, as noted by Baker (2014), corpus linguistic studies always involve an element of subjectivity, and therefore researcher reflexivity and the use of triangulation remain extremely important.…”