2006
DOI: 10.1002/0470108096
|View full text |Cite
|
Sign up to set email alerts
|

Data Mining the Web

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 76 publications
(10 citation statements)
references
References 0 publications
0
10
0
Order By: Relevance
“…Besides that there are two formats, which is Common Log File and Extended Common Log File Format, The common log format (CLF or "clog") is supported by a variety of web server applications and includes the following seven fields: If a field is unused in a particular entry dash "-" marks the omitted field. [6] The extended common log format (ECLF) is a variation of the common log format, formed by appending two additional fields onto the end of the record, the referrer field, and the user agent field [7].…”
Section: Figure 1-sample Of Server Log Filementioning
confidence: 99%
“…Besides that there are two formats, which is Common Log File and Extended Common Log File Format, The common log format (CLF or "clog") is supported by a variety of web server applications and includes the following seven fields: If a field is unused in a particular entry dash "-" marks the omitted field. [6] The extended common log format (ECLF) is a variation of the common log format, formed by appending two additional fields onto the end of the record, the referrer field, and the user agent field [7].…”
Section: Figure 1-sample Of Server Log Filementioning
confidence: 99%
“…They suggest using concept learning methods to generate unique descriptions of sets of web content. This can be used to find the similar qualities of a new web page or a document [8]. In order to group or cluster the web content several techniques like Hierarchical Agglomerative Clustering, k-Means Clustering and Probability-Based Clustering can be used.…”
Section: Web Data Mining Frameworkmentioning
confidence: 99%
“…In order to group or cluster the web content several techniques like Hierarchical Agglomerative Clustering, k-Means Clustering and Probability-Based Clustering can be used. To evaluate the data models generated by the clusters Similarity based criterion functions, Probabilistic criterion function, MDL based model and Feature evaluations can be used [8].…”
Section: Web Data Mining Frameworkmentioning
confidence: 99%
“…Web content mining has two perspectives: 1-Data Recovery, which aims to improve the process of searching or filtering the data to the users and 2-Database formation which aims to provide a model of Web data by integrate them. Web content mining includes various activities such as [9], [10], [11], [12], [15]:…”
Section: Web Content Miningmentioning
confidence: 99%
“…2-The Marco model which is actually a Marco chain of rank M that changing the state of a system depends to the current state and the M-1 previous states. Web structure mining applications includes determining quality associated with a subject, page classification, Web navigation, finding Web communities, adaptive Web sites design and personalization [9], [10], [11], [12].…”
Section: Web Structure Miningmentioning
confidence: 99%