“…Beyond literature, several evaluation corpora for authorship attribution studies have been built covering certain text domains such as online newspaper articles (Stamatatos, et al, 2000;Diederich, et al, 2003;Luyckx & Daelemans, 2005;Sanderson & Guenter, 2006), e-mail messages (de Vel, et al, 2001;Koppel & Schler, 2003), online forum messages (Argamon, et al, 2003;Abbasi & Chen, 2005;Zheng, et al, 2006), newswire stories (Khmelev & Teahan, 2003a;Zhao & Zobel, 2005), blogs (Koppel, Schler, Argamon, & Messeri, 2006), etc. Alternatively, corpora built for other purposes have also been used in the framework of authorship attribution studies including parts of the Reuters-21578 corpus (Teahan & Harper, 2003;Marton, et al, 2005), the Reuters Corpus Volume 1 (Khmelev & Teahan, 2003a;Madigan, et al, 2005;Stamatatos, 2007) and the TREC corpus (Zhao & Zobel, 2005) that were initially built for evaluating thematic text categorization tasks.…”