2007
DOI: 10.1007/978-3-540-75563-0_38
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Hidden-Web Table Interpretation by Sibling Page Comparison

Abstract: Abstract. The longstanding problem of automatic table interpretation still illudes us. Its solution would not only be an aid to table processing applications such as large volume table conversion, but would also be an aid in solving related problems such as information extraction and semi-structured data management. In this paper, we offer a conceptual modeling solution for the common special case in which so-called sibling pages are available. The sibling pages we consider are pages on the hidden web, commonl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2009
2009
2011
2011

Publication Types

Select...
5
1

Relationship

4
2

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 18 publications
0
5
0
Order By: Relevance
“…A user can then modify the form, 9 if desired, and use it to harvest information. We have implemented this reverse-engineering of tables into FOCIH forms based on a system called TISP (Table Interpretation for Sibling Pages) [29,30]. TISP converts tables from sites like hidden-web sites that have machinegenerated sibling pages into FOCIH forms and thus into FOCIH-generated ontologies.…”
Section: Further Reduction Of Labor-intensive Tasksmentioning
confidence: 99%
“…A user can then modify the form, 9 if desired, and use it to harvest information. We have implemented this reverse-engineering of tables into FOCIH forms based on a system called TISP (Table Interpretation for Sibling Pages) [29,30]. TISP converts tables from sites like hidden-web sites that have machinegenerated sibling pages into FOCIH forms and thus into FOCIH-generated ontologies.…”
Section: Further Reduction Of Labor-intensive Tasksmentioning
confidence: 99%
“…There is ample evidence that automating table understanding, or even merely verifying claims to this effect, is very difficult [21,22,23]. As mentioned, our project is the front end of a larger undertaking that endeavors to create narrow-domain ontologies by combining information from web tables [1,24,25]. Suppose, for instance, that we process the left-hand table in Figure 8 and include it into the ontology.…”
Section: Labeled Table Candidates For Which Wang Notation Exists Are mentioning
confidence: 99%
“…We call our table-interpretation system TISP (Table Interpretation with Sibling Pages) [39]. Table from WormBase.…”
Section: Model(s)amino Acids 2 ) → 406 Aa Where Identificationgenementioning
confidence: 99%