2016
DOI: 10.1007/978-3-319-47874-6_15
|View full text |Cite
|
Sign up to set email alerts
|

Relevancer: Finding and Labeling Relevant Information in Tweet Collections

Abstract: Abstract. We introduce a tool that supports knowledge workers who want to gain insights from a tweet collection, but due to time constraints cannot go over all tweets. Our system first pre-processes, de-duplicates, and clusters the tweets. The detected clusters are presented to the expert as so-called information threads. Subsequently, based on the information thread labels provided by the expert, a classifier is trained that can be used to classify additional tweets. As a case study, the tool is evaluated on … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
3
1
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 7 publications
(8 reference statements)
0
3
0
Order By: Relevance
“…We could extend the training set with cluster mining using Relevancer (Hürriyetoglu et al, 2016), use the rule-based system to extend the training set (Hürriyetoglu, 2019), or use the rule-based system to generate fine-grained data that can be used in a multi-task setting.…”
Section: Discussionmentioning
confidence: 99%
“…We could extend the training set with cluster mining using Relevancer (Hürriyetoglu et al, 2016), use the rule-based system to extend the training set (Hürriyetoglu, 2019), or use the rule-based system to generate fine-grained data that can be used in a multi-task setting.…”
Section: Discussionmentioning
confidence: 99%
“…We could extend the training set with cluster mining using Relevancer (Hürriyetoglu et al, 2016), use the rule-based system to extend the training set (Hürriyetoglu, 2019), or use the rulebased system to generate fine-grained data that can be used in a multi-task setting.…”
Section: Discussionmentioning
confidence: 99%
“…For the information retrieval process positively labeled documents in a dataset are important and should not be missed, therefore achieving high recall is extremely important. However, there is generally a large number of documents that are relevant or not to the concerned topic and doing close reading for all documents and annotating them requires lots of time and resources (Hürriyetoglu et al, 2016;Hürriyetoǧlu et al, 2017). Therefore, ranking documents according to relevance to the investigated class may help to reduce close reading time and decrease the likelihood of missing critical information.…”
Section: Introductionmentioning
confidence: 99%