Data Mining Patterns 2008
DOI: 10.4018/978-1-59904-162-9.ch009
|View full text |Cite
|
Sign up to set email alerts
|

Mining XML Documents

Abstract: XML documents are becoming ubiquitous because of their rich and flexible format that can be used for a variety of applications. Giving the increasing size of XML collections as information sources, mining techniques that traditionally exist for text collections or databases need to be adapted and new methods to be invented to exploit the particular structure of XML documents. Basically XML documents can be seen as trees, which are well known to be complex structures. This chapter describes various ways of usin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2008
2008
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 23 publications
0
1
0
Order By: Relevance
“…One of the reasons these business process log data store in this semi-structured format is the capability of XML format to represent the contextual information among different attribute or metadata in a domain unambiguous method. Nevertheless, it is quite challenging to perform statistical analysis and data mining technique to XML data because of the complex data structure and dimensions (structure dimension and content dimension) [19]. Due to the similar characteristic of XML document and trees structured data, many researchers modelled XML document as an ordered, labelled and rooted trees.…”
Section: Related Workmentioning
confidence: 99%
“…One of the reasons these business process log data store in this semi-structured format is the capability of XML format to represent the contextual information among different attribute or metadata in a domain unambiguous method. Nevertheless, it is quite challenging to perform statistical analysis and data mining technique to XML data because of the complex data structure and dimensions (structure dimension and content dimension) [19]. Due to the similar characteristic of XML document and trees structured data, many researchers modelled XML document as an ordered, labelled and rooted trees.…”
Section: Related Workmentioning
confidence: 99%