1998
DOI: 10.1007/bfb0026695
|View full text |Cite
|
Sign up to set email alerts
|

First-order learning for Web mining

Abstract: Abstract. We present compelling evidence that the World Wide Web is a domain in which applications can benefit from using first-order learning methods, since the graph structure inherent in hypertext naturally lends itself to a relational representation. We demonstrate strong advantages for two applications -learning classifiers for Web pages, and learning rules to discover relations among pages. I n t r o d u c t i o nIn recent years, there has been a large body of research centered around the topic of learni… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2001
2001
2017
2017

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 22 publications
(3 citation statements)
references
References 2 publications
(4 reference statements)
0
3
0
Order By: Relevance
“…Existing ILP systems include FOIL (Weber, 1996), which is a system for learning intensional concept definitions from relational tuples. It has recently been applied to web mining (Craven et al, 1998). GOLEM (Weber, 1996) is a 'classic' among empirical ILP systems.…”
Section: Inductive Logic Theorymentioning
confidence: 99%
“…Existing ILP systems include FOIL (Weber, 1996), which is a system for learning intensional concept definitions from relational tuples. It has recently been applied to web mining (Craven et al, 1998). GOLEM (Weber, 1996) is a 'classic' among empirical ILP systems.…”
Section: Inductive Logic Theorymentioning
confidence: 99%
“…The graph structure of the Web makes it an interesting domain for relational learning [2]. Moreover, Craven, Slattery, and Nigam demonstrated that for several Web-based learning tasks, a relational learning algorithm can learn more accurate classifiers than a common statistical approach [3]. Therefore, many researchers have been done to apply relational learning algorithms to web page classification.…”
Section: Introductionmentioning
confidence: 99%
“…It took a user-defined feature set together with a set of hand tagged training documents and learned rules for extraction. Craven et al [4] reported that greater accuracy could be achieved by representing each web page as a node in graph and each hyperlink an edge. Cardie [5] provided a list of learning-based IE problems, including the difficulty of obtaining enough training data and the lack of corpora annotated with the appropriate semantic and domainspecific supervisory information.…”
Section: Introductionmentioning
confidence: 99%