Proceedings of the 2009 International Database Engineering &Amp; Applications Symposium on - IDEAS '09 2009
DOI: 10.1145/1620432.1620447
|View full text |Cite
|
Sign up to set email alerts
|

Near-duplicate detection for web-forums

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
9
0

Year Published

2010
2010
2022
2022

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(9 citation statements)
references
References 9 publications
0
9
0
Order By: Relevance
“…whether the problem presented in the thread is solved or not) classification of forum threads [15]. Additionally, threading information has been shown to enhance retrieval effectiveness for post-level retrieval [7,4], thread-level retrieval [4,5], sentencelevel shallow information extraction [16], and near-duplicate thread detection [17]. Moreover, Wang and Rose [13] demonstrated that initiation-response pairs (e.g.…”
Section: Related Workmentioning
confidence: 99%
“…whether the problem presented in the thread is solved or not) classification of forum threads [15]. Additionally, threading information has been shown to enhance retrieval effectiveness for post-level retrieval [7,4], thread-level retrieval [4,5], sentencelevel shallow information extraction [16], and near-duplicate thread detection [17]. Moreover, Wang and Rose [13] demonstrated that initiation-response pairs (e.g.…”
Section: Related Workmentioning
confidence: 99%
“…Besides the approaches focused on Web pages or documents, Muthmann et al [20] proposed their work to identify threads with near-duplicate content and to group these threads in the search results. They incorporated text-based features, features based on extracted entities for products, and structure-based features to capture the near-duplicate threads.…”
Section: Related Workmentioning
confidence: 99%
“…Along with the increasing requirements, nearduplicate detection has received much attentions in recent years [24,25,11,26,20].…”
Section: Introductionmentioning
confidence: 99%
“…However, finding a solution to a specific problem is not as easy as it might seem. In fact, it has been estimated that many of the forum questions have already been answered at the time they are posted, but were not found [41].…”
Section: Searching In Developer Networkmentioning
confidence: 99%