2000
DOI: 10.21236/ada477533
|View full text |Cite
|
Sign up to set email alerts
|

Comparing Effectiveness in TDT and IR

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2002
2002
2015
2015

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(15 citation statements)
references
References 0 publications
0
12
0
Order By: Relevance
“…It was argued by Allan et al [4] that there were different similarity measures, ranging from cosine distance measure to distributional language models. An example of similarity measure was given in which agglomerative clustering methods were extensively explored to determine if a news story is describing a new event or a redundant one as compared to the previously identified documents.…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…It was argued by Allan et al [4] that there were different similarity measures, ranging from cosine distance measure to distributional language models. An example of similarity measure was given in which agglomerative clustering methods were extensively explored to determine if a news story is describing a new event or a redundant one as compared to the previously identified documents.…”
Section: Literature Reviewmentioning
confidence: 99%
“…There is a large volume of published studies utilising TDT test data for evaluation, in accordance with [4]. The TDT benchmark collection is poorly annotated: TDT5 covers 278,109 English news events but only 100 themes and approximately 4,500 labelled documents.…”
Section: Benchmark Datasetmentioning
confidence: 99%
“…Each new document is then compared to the previous ones, and if it has similarity to the closest document (or centroid) below a certain threshold, the new document is declared as a First Story or new event. This approach is used in the UMass and the CMU system [4].…”
Section: Vector Space Modelmentioning
confidence: 99%
“…Most approaches to the story link detection task relied on text similarity. For example the cosine similarity and the clarity metric have been found to be very effective [3,16]. In addition, many approaches focused on matching named entities.…”
Section: Related Workmentioning
confidence: 99%