An Integrated Model for User Attribute Discovery: A Case Study on Political Affiliation Identification

Gottipati, Swapna; Qiu, Minghui; Liu, Yang; Zhu, Feida; Jiang, Jing

doi:10.1007/978-3-319-06608-0_36

Cited by 3 publications

(1 citation statement)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Some of the attributes targeted for extraction focus on demographic related information, such as gender/age (Koppel et al, 2002;Mukherjee and Liu, 2010;Burger et al, 2011;Van Durme, 2012;, race/ethnicity (Pennacchiotti and Popescu, 2011;Eisenstein et al, 2011;Rao et al, 2011;, location (Bamman et al, 2014), yet other aspects are mined as well, among them emotion and sentiment , personality types (Schwartz et al, 2013;, user political affiliation (Cohen and Ruths, 2013;Volkova and Durme, 2015), mental health diagnosis (Coppersmith et al, 2015) and even lifestyle choices such as coffee preference (Pennacchiotti and Popescu, 2011). The task is typically approached from a machine learning perspective, with data originating from a variety of user generated content, most often microblogs (Pennacchiotti and Popescu, 2011;Coppersmith et al, 2015;, article com-ments to news stories or op-ed pieces (Riordan et al, 2014), social posts (originating from sites such as Facebook, MySpace, Google+) (Gong et al, 2012), or discussion forums on particular topics (Gottipati et al, 2014). Classification labels are then assigned either based on manual annotations , self identified user attributes (Pennacchiotti and Popescu, 2011), affiliation with a given discussion forum type, or online surveys set up to link a social media user identification to the responses provided (Schwartz et al, 2013).…”

Section: Related Workmentioning

confidence: 99%

A Computational Analysis of the Language of Drug Addiction

Strapparava

Mihalcea

2017

Proceedings of the 15th Conference of the European Chapter of The Association for Computational Linguistics: Volume 2

View full text Add to dashboard Cite

We present a computational analysis of the language of drug users when talking about their drug experiences. We introduce a new dataset of over 4,000 descriptions of experiences reported by users of four main drug types, and show that we can predict with an F1-score of up to 88% the drug behind a certain experience. We also perform an analysis of the dominant psycholinguistic processes and dominant emotions associated with each drug type, which sheds light on the characteristics of drug users.

show abstract