Proceedings of the 19th ACM International Conference on Information and Knowledge Management 2010
DOI: 10.1145/1871437.1871735
|View full text |Cite
|
Sign up to set email alerts
|

Using Wikipedia categories for compact representations of chemical documents

Abstract: Today, Web pages are usually accessed using text search engines, whereas documents stored in the deep Web are accessed through domain-specific Web portals. These portals rely on external knowledge bases, respectively ontologies, mapping documents to more general concepts allowing for suitable classifications and navigational browsing. Since automatically generated ontologies are still not satisfactory for advanced information retrieval tasks, most portals heavily rely on hand-crafted domain-specific ontologies… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2012
2012
2014
2014

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 8 publications
(9 reference statements)
0
4
0
Order By: Relevance
“…In fact, numerous previous works leverage the use of Wikipedia categories. For example, Köhncke and Balke [11] exploit Wikipedia categories in order to generate useful descriptions for chemical documents. In their work, they identify chemical entities in documents and extract the categories of these entities.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In fact, numerous previous works leverage the use of Wikipedia categories. For example, Köhncke and Balke [11] exploit Wikipedia categories in order to generate useful descriptions for chemical documents. In their work, they identify chemical entities in documents and extract the categories of these entities.…”
Section: Related Workmentioning
confidence: 99%
“…Additionally, categories are organized in a graph in which sub-categories reference to top-level categories. The English Wikipedia has a total of 23 top-level categories (Main topic classifications), which we use to represent a profile 11 . The creation of semantically enhanced profiles consists of three stages.…”
Section: Fingerprintsmentioning
confidence: 99%
“…Wikipedia categories have been successfully exploited with this purpose in different works: e.g. to describe chemical documents (Köhncke and Balke, 2010), to identify topics of interest for Twitter users (Michelson and Macskassy, 2010), and also to improve Web video categorization (Chen et al, 2010). Moreover, (Hahn et al, 2010) have shown that the structured information gathered from Wikipedia infoboxes can be used to answer complex questions, like "Which Rivers flow into the Rhine and are longer than 50 kilometers?"…”
Section: Multimodal Analytics and Semantic Enrichmentmentioning
confidence: 99%
“…They fail to capture other useful semantic knowledge, e.g. Wikipedia category that contains much meaningful information in the form of a hierarchical ontology [11]. In addition, these methods failed to model and cluster documents represented with multiple feature space (or relations).…”
Section: Introductionmentioning
confidence: 99%