Viktor de Boer scite author profile

Wielinga

2006

Abstract. In this document we describe our approach to a specific subtask of ontology population, the extraction of instances of relations. We present a generic approach with which we are able to extract information from documents on the Web. The method exploits redundancy of information to compensate for loss of precision caused by the use of domain independent extraction methods. In this paper, we present the general approach and describe our implementation for a specific relation instance extraction task in the art domain. For this task, we describe experiments, discuss evaluation measures and present the results.

show abstract

A redundancy-based method for the extraction of relation instances from the Web

International Journal of Human-Computer Studies

Wielinga

2007

Classifying Web Pages With Visual Features

Boer¹,

Lupascu³

2010

Abstract:To automatically classify and process web pages, current systems use the textual content of those pages, including both the displayed content and the underlying (HTML) code. However, a very important feature of a web page is its visual appearance. In this paper, we show that using generic visual features we can classify the web pages for several different types of tasks. The features used in this document are simple color and edge histograms, Gabor and texture features. These were extracted using an off-the-shelf visual feature extraction method. In three experiments, we classify web pages based on their aesthetic value, their recency and the type of website. Results show that these simple, global visual features already produce good classification results. We also introduce an online tool that uses the trained classifiers to assess new web pages.

show abstract

Relation Instantiation for Ontology Population Using the Web

Wielinga

2007

Web Page Classification Using Image Analysis Features

Lupascu³

2011