2013
DOI: 10.1613/jair.3940
|View full text |Cite
|
Sign up to set email alerts
|

Topic Segmentation and Labeling in Asynchronous Conversations

Abstract: Topic segmentation and labeling is often considered a prerequisite for higher-level conversation analysis and has been shown to be useful in many Natural Language Processing (NLP) applications. We present two new corpora of email and blog conversations annotated with topics, and evaluate annotator reliability for the segmentation and labeling tasks in these asynchronous conversations. We propose a complete computational framework for topic segmentation and labeling in asynchronous conversations. Our approach e… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
50
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 45 publications
(50 citation statements)
references
References 60 publications
(77 reference statements)
0
50
0
Order By: Relevance
“…We also compute average and maximum counts for different types of data to better understand what scale the visualization needs to deal with. These values are computed based on a set of 20 Slashdot blogs which comes with human generated topic annotations [JCN13]. Here, the topics and the sentiment are added, since they can be useful for performing almost all of the tasks in Table 1.…”
Section: From User Requirements To Design Principlesmentioning
confidence: 99%
See 2 more Smart Citations
“…We also compute average and maximum counts for different types of data to better understand what scale the visualization needs to deal with. These values are computed based on a set of 20 Slashdot blogs which comes with human generated topic annotations [JCN13]. Here, the topics and the sentiment are added, since they can be useful for performing almost all of the tasks in Table 1.…”
Section: From User Requirements To Design Principlesmentioning
confidence: 99%
“…Since the internet revolution and the subsequent rise of social media, an ever‐increasing amount of human conversations are generated in many different modalities. These conversations are primarily asynchronous in nature, where the participants communicate with each other at different times (e.g., emails, blogs, forums, Twitter, Facebook) [JCN13]. Often many people contribute to the discussion, which can quickly become very long with hundreds of comments and replies.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Even the interpretations of topics can vary among people according to expertise and the current task in hand. In fact, during topic annotations by human experts, there was considerable disagreement on the number of topics and on the assignment of sentences to topic clusters (Joty et al, 2013b). Depending on user's mental model and current tasks, the topic modeling results may require to be more specific in some cases, and more generic in other cases.…”
Section: Human In the Loop: Interactive Topic Revisionmentioning
confidence: 99%
“…We test this hypothesis using a baseline that extracts labels from the comment clusters only. We adopt phrase or term as the most suitable linguistic unit to represent labels as evidenced by several previous studies (Mei et al, 2007;Joty et al, 2013;. This paper is organised as follows.…”
Section: Introductionmentioning
confidence: 99%