“…Some of the attributes targeted for extraction focus on demographic related information, such as gender/age (Koppel et al, 2002;Mukherjee and Liu, 2010;Burger et al, 2011;Van Durme, 2012;, race/ethnicity (Pennacchiotti and Popescu, 2011;Eisenstein et al, 2011;Rao et al, 2011;, location (Bamman et al, 2014), yet other aspects are mined as well, among them emotion and sentiment , personality types (Schwartz et al, 2013;, user political affiliation (Cohen and Ruths, 2013;Volkova and Durme, 2015), mental health diagnosis (Coppersmith et al, 2015) and even lifestyle choices such as coffee preference (Pennacchiotti and Popescu, 2011). The task is typically approached from a machine learning perspective, with data originating from a variety of user generated content, most often microblogs (Pennacchiotti and Popescu, 2011;Coppersmith et al, 2015;, article com-ments to news stories or op-ed pieces (Riordan et al, 2014), social posts (originating from sites such as Facebook, MySpace, Google+) (Gong et al, 2012), or discussion forums on particular topics (Gottipati et al, 2014). Classification labels are then assigned either based on manual annotations , self identified user attributes (Pennacchiotti and Popescu, 2011), affiliation with a given discussion forum type, or online surveys set up to link a social media user identification to the responses provided (Schwartz et al, 2013).…”