2012
DOI: 10.3844/jcssp.2012.2053.2061
|View full text |Cite
|
Sign up to set email alerts
|

Web Document Segmentation Using Frequent Term Sets for Summarization

Abstract: Query sensitive summarization aims at extracting the query relevant contents from web documents. Web page segmentation focuses on reducing the run time overhead of the summarization systems by grouping the related contents of a web page into segments. At query time, query relevant segments of the web page are identified and important sentences from these segments are extracted to compose the summary. DOM tree structures of the web documents are utilized to perform the segmentation of the contents. Leaf nodes o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 19 publications
0
2
0
Order By: Relevance
“…The most common one (though used in only four publications) is that of a visual "block" with coherent content [9,24,26,37]. Other definitions characterize segments by their edges [12,13], as being semantically self-contained [16], as distinct [30], or as labeled with a heading [28]. Only two papers resort to HTML/DOM elements or sub-trees as segment building blocks [9,24].…”
Section: Concept Formation: Page Segmentmentioning
confidence: 99%
“…The most common one (though used in only four publications) is that of a visual "block" with coherent content [9,24,26,37]. Other definitions characterize segments by their edges [12,13], as being semantically self-contained [16], as distinct [30], or as labeled with a heading [28]. Only two papers resort to HTML/DOM elements or sub-trees as segment building blocks [9,24].…”
Section: Concept Formation: Page Segmentmentioning
confidence: 99%
“…WSDL provide the foundation for composition of web service, by providing the support in information exchange between the service, it is not rich enough to specify the semantic of the composition and they are not understand by machine. Pasupathi et al (2012), focused on segment the content of web document that highly related with query. It is an simple attempt made over the text comparision.…”
Section: Science Publicationsmentioning
confidence: 99%