2003 Symposium on Applications and the Internet Workshops, 2003. Proceedings.
DOI: 10.1109/saintw.2003.1210172
|View full text |Cite
|
Sign up to set email alerts
|

Information extraction from HTML pages and its integration

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0
1

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 4 publications
0
2
0
1
Order By: Relevance
“…Peneliti lain melakukan integrasi data dari sejumlah tabel web yang berbentuk column atau row wise [6]. Mereka menambahkan kasus jika dalam satu cell terdapat beberapa pasangan atribut-data values.…”
Section: Tabel III Contoh Mixed-cellunclassified
“…Peneliti lain melakukan integrasi data dari sejumlah tabel web yang berbentuk column atau row wise [6]. Mereka menambahkan kasus jika dalam satu cell terdapat beberapa pasangan atribut-data values.…”
Section: Tabel III Contoh Mixed-cellunclassified
“…This method focuses on the Japanese language. Itai et al [10] integrate various tables in a specific domain into an XML document based on a DTD (Document Type Definition). This method segments table cells into attributes and values by applying SVM (Support Vector Machines) and HMM (Hidden Markov Model) techniques.…”
Section: Related Workmentioning
confidence: 99%
“…The former determines whether the TABLE tags are used to represent genuine tables or not [3]. Meanwhile, the latter analyzes the logical structure of a table and extracts its attribute-value relationships [1][2] [4][5][6][7][8][9][10][11][12][13][14][15][16][17][18][19][20]. In this paper, we focus on structure recognition.…”
Section: Introductionmentioning
confidence: 99%