Proceedings of the Fourteenth ACM Conference on Hypertext and Hypermedia - HYPERTEXT '03 2003
DOI: 10.1145/900095.900097
|View full text |Cite
|
Sign up to set email alerts
|

Enhanced web document summarization using hyperlinks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2006
2006
2019
2019

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 21 publications
(12 citation statements)
references
References 0 publications
0
12
0
Order By: Relevance
“…Then they analyze the candidate pages and select the best sentences containing links to the web page heuristically. Delort et al [54] extended and improved this approach by using an algorithm trying to select a sentence about the same topic that covers as many aspects of the web page as possible.…”
Section: A Web Summarizationmentioning
confidence: 99%
“…Then they analyze the candidate pages and select the best sentences containing links to the web page heuristically. Delort et al [54] extended and improved this approach by using an algorithm trying to select a sentence about the same topic that covers as many aspects of the web page as possible.…”
Section: A Web Summarizationmentioning
confidence: 99%
“…This text often provides a descriptive summary of a web page (e.g., "Access to papers published within the last year by members of the NLP group"). Proponents of using context to provide summary sentences argue that a web site includes multimedia, may cover diverse topics, and it may be hard for a summarizer to distinguish good summary content from bad [47]. The earliest work on this approach was carried out to provide snippets for each result from a search engine [2].…”
Section: Web Page Summarizationmentioning
confidence: 99%
“…To determine a summary, their system issued a search for a URL, selected all sentences containing a link to that URL and the best sentence was identified using heuristics. Delort et al [47] use a very similar procedure to select context sentences. They extend the older approach through an algorithm that allows selection of a sentence that covers as many aspects of the web page as possible and that is on the same topic.…”
Section: Web Page Summarizationmentioning
confidence: 99%
“…Distinctiveness can be measured by classic measure of TF-IDF, which is where K is the number of occurrences of the word in the "document" or text to be summarized, N is a sample of documents, and n is the number of those documents in that sample having the word at least once. Other useful input for text summarization are the headings of pages linked to (Delort, Bouchon-Meunier, & Rifqi, 2003) since neighbor pages provide content clues. Content can also be classified into semantic units by aggregating clues or even by "parsing" the page display.…”
Section: Content Rating By Importancementioning
confidence: 99%