Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015) 2015
DOI: 10.18653/v1/s15-2143
|View full text |Cite
|
Sign up to set email alerts
|

USAAR-CHRONOS: Crawling the Web for Temporal Annotations

Abstract: This paper describes the USAAR-CHRONOS participation in the Diachronic Text Evaluation task of SemEval-2015 to identify the time period of historical text snippets. We adapt a web crawler to retrieve the original source of the text snippets and determine the publication year of the retrieved texts from their URLs. We report a precision score of >90% in identifying the text epoch. Additionally, by crawling and cleaning the website that hosts the source of the text snippets, we present Daikon, a corpus that can … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2016
2016
2016
2016

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 14 publications
0
1
0
Order By: Relevance
“…The Wikipedia List of Lists of Lists (LOLOL) is a crowdsourced list of lists of terms. We adapted the customized crawler 4 (Tan et al, 2014;Tan and Ordan, 2015) to crawl for tables or bullet points in the Wikipedia subpages of the LOLOL for the food domain. We started the crawl from these seed pages under the bullet point of https://en.wikipedia.org/wiki/ List of lists of lists#Food and drink.…”
Section: Extending a Taxonomy With Wikipedia List Of Lists Of Listsmentioning
confidence: 99%
“…The Wikipedia List of Lists of Lists (LOLOL) is a crowdsourced list of lists of terms. We adapted the customized crawler 4 (Tan et al, 2014;Tan and Ordan, 2015) to crawl for tables or bullet points in the Wikipedia subpages of the LOLOL for the food domain. We started the crawl from these seed pages under the bullet point of https://en.wikipedia.org/wiki/ List of lists of lists#Food and drink.…”
Section: Extending a Taxonomy With Wikipedia List Of Lists Of Listsmentioning
confidence: 99%