2010
DOI: 10.1016/j.eswa.2010.05.027
|View full text |Cite
|
Sign up to set email alerts
|

Tag tree template for Web information and schema extraction

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2011
2011
2018
2018

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 13 publications
(15 citation statements)
references
References 18 publications
0
14
0
Order By: Relevance
“…Main processes include message extraction from webpages, message filtering, and message storage [17].…”
Section: A the Framework Of The Proposed Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Main processes include message extraction from webpages, message filtering, and message storage [17].…”
Section: A the Framework Of The Proposed Methodsmentioning
confidence: 99%
“…An automatic program which can get the pages from the Web is designed. We parse the pages using a tag-based template extraction method [17] and get the information of each post. On the other hand, we select the trading data, including stock code, closing price, and so on, from China A-share stock market between 2009 and 2012.…”
Section: A Dataset and Experiments Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…On the contrary, more different rules are encouraged to use when facing different tasks. In addition, two third-party tools can function together: HTML tidy [3] and HTML Parser [7]. The former is a proposal that is intended to preprocess web documents by fixing their HTML code and converting it into XHTML.…”
Section: Discussionmentioning
confidence: 99%
“…Ji et al [19] proposed a tag tree algorithm, in which they detected and removed the shared part among web pages with the same template, and then the main text is retained. Also some other methods extract the knowledge with Regex rules from the HTML pages.…”
Section: Related Workmentioning
confidence: 99%