Proceedings of the 1st ACM/IEEE-CS Joint Conference on Digital Libraries 2001
DOI: 10.1145/379437.379454
|View full text |Cite
|
Sign up to set email alerts
|

Personalized spiders for web search and analysis

Abstract: Searching for useful information on the World Wide Web has become increasingly difficult. While Internet search engines have been helping people to search on the web, low recall rate and outdated indexes have become more and more problematic as the web grows. In addition, search tools usually present to the user only a list of search results, failing to provide further personalized analysis which could help users identify useful information and comprehend these results. To alleviate these problems, we propose … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
26
0

Year Published

2003
2003
2023
2023

Publication Types

Select...
6
1
1

Relationship

5
3

Authors

Journals

citations
Cited by 39 publications
(30 citation statements)
references
References 27 publications
0
26
0
Order By: Relevance
“…Spiders, also known as crawlers, wanderers, or Webbots, are defined as ''software programs that traverse the World Wide Web information space by following hypertext links and retrieving Web documents by standard HTTP protocol'' (Cheong, 1996). Since the early days of the Web, spiders have been widely used to build the underlying databases of search engines (e.g., Pinkerton, 1994), to perform personal search (e.g., Chau et al, (2001)), to archive particular Web sites or even the whole Web (e.g., Kahle, 1997), or to collect Web statistics (e.g., Broder et al, 2000).…”
Section: Web Mining and Social Network Analysismentioning
confidence: 99%
“…Spiders, also known as crawlers, wanderers, or Webbots, are defined as ''software programs that traverse the World Wide Web information space by following hypertext links and retrieving Web documents by standard HTTP protocol'' (Cheong, 1996). Since the early days of the Web, spiders have been widely used to build the underlying databases of search engines (e.g., Pinkerton, 1994), to perform personal search (e.g., Chau et al, (2001)), to archive particular Web sites or even the whole Web (e.g., Kahle, 1997), or to collect Web statistics (e.g., Broder et al, 2000).…”
Section: Web Mining and Social Network Analysismentioning
confidence: 99%
“…Intelligence Spider, or CI Spider, collects Web pages on a real-time basis from Web sites specified by the user and performs indexing and categorization analysis on them, to provide the user with a comprehensive view of the Web sites of interest (Chau et al, 2001b;Chen et al, 2002). The second tool, Meta Spider, has similar functionalities as the CI Spider, but instead of performing breadth-first search on a particular Web site, connects to different search engines on the Internet and integrates the results (Chau et al, 2001b;).…”
Section: Proposed Approachesmentioning
confidence: 99%
“…The second tool, Meta Spider, has similar functionalities as the CI Spider, but instead of performing breadth-first search on a particular Web site, connects to different search engines on the Internet and integrates the results (Chau et al, 2001b;). The architecture of CI Spider and Meta Spider is shown in …”
Section: Proposed Approachesmentioning
confidence: 99%
See 1 more Smart Citation
“…Also, more computation, time, and memory are often required. These limitations may be severe, especially for Web-based search engines that have to handle thousands to millions of search queries per day [6].…”
Section: Web Content Mining and Post-retrieval Analysismentioning
confidence: 99%