Proceedings of the 13th International World Wide Web Conference on Alternate Track Papers &Amp; Posters - WWW Alt. '04 2004
DOI: 10.1145/1013367.1013545
|View full text |Cite
|
Sign up to set email alerts
|

OntoMiner

Abstract: In this paper, we present automated techniques for bootstrapping and populating specialized domain ontologies by organizing and mining a set of relevant overlapping Web sites provided by the user. We develop algorithms that detect and utilize HTML regularities in the Web documents to turn them into hierarchical semantic structures encoded as XML. Next, we present tree-mining algorithms that identify key domain concepts and their taxonomical relationships. We also extract semi-structured concept instances annot… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2006
2006
2011
2011

Publication Types

Select...
2
2
1

Relationship

1
4

Authors

Journals

citations
Cited by 13 publications
references
References 3 publications
0
0
0
Order By: Relevance