2011
DOI: 10.1007/978-3-642-22327-3_16
|View full text |Cite
|
Sign up to set email alerts
|

‘twazn me!!! ;(’ Automatic Authorship Analysis of Micro-Blogging Messages

Abstract: Abstract. In this paper we propose a set of stylistic markers for automatically attributing authorship to micro-blogging messages. The proposed markers include highly personal and idiosyncratic editing options, such as 'emoticons', interjections, punctuation, abbreviations and other low-level features. We evaluate the ability of these features to help discriminate the authorship of Twitter messages among three authors. For that purpose, we train SVM classifiers to learn stylometric models for each author based… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 52 publications
(13 citation statements)
references
References 9 publications
(6 reference statements)
0
8
0
Order By: Relevance
“…Notwithstanding, in recent years forensic linguists have acknowledged the potential of computational analyses in forensic settings (see e.g. Sousa-Silva et al 2011;Grieve et al 2018), so the provision of forensic linguistics expertise is increasingly seen as indissociable from a lower or higher degree of computational analysis.…”
Section: Computational Forensic Linguisticsmentioning
confidence: 99%
“…Notwithstanding, in recent years forensic linguists have acknowledged the potential of computational analyses in forensic settings (see e.g. Sousa-Silva et al 2011;Grieve et al 2018), so the provision of forensic linguistics expertise is increasingly seen as indissociable from a lower or higher degree of computational analysis.…”
Section: Computational Forensic Linguisticsmentioning
confidence: 99%
“…Textual genre and length are key issues in AA. Most work on social media [2,3,33] have to tackle with the limited size of the texts.…”
Section: Related Workmentioning
confidence: 99%
“…[9] worked on gender classification (Author Profiling task) based on Twitter data in Portuguese. Silva et al [33] focused on idiosyncratic usage on a corpus of social media data using SVM.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations