2016
DOI: 10.18637/jss.v074.i04
|View full text |Cite
|
Sign up to set email alerts
|

R Package clickstream: Analyzing Clickstream Data with Markov Chains

Abstract: Clickstream analysis is a useful tool for investigating consumer behavior, market research and software testing. I present the clickstream package which provides functionality for reading, clustering, analyzing and writing clickstreams in R. The package allows for a modeling of lists of clickstreams as zero-, first-and higher-order Markov chains. I illustrate the application of clickstream for a list of representative clickstreams from an online store.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
24
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 27 publications
(25 citation statements)
references
References 11 publications
1
24
0
Order By: Relevance
“…Various packages mention MCs -related models in the CRAN repository, whereby a few of them will be now reviewed. The clickstream package (Scholz, 2016), on CRAN since 2014, aims to model websites click stream using higher order Markov Chains. It provides a MarkovChain S4 class that is similar to the markovchain class.…”
Section: Analysis Of Existing Dtmc-related Softwarementioning
confidence: 99%
“…Various packages mention MCs -related models in the CRAN repository, whereby a few of them will be now reviewed. The clickstream package (Scholz, 2016), on CRAN since 2014, aims to model websites click stream using higher order Markov Chains. It provides a MarkovChain S4 class that is similar to the markovchain class.…”
Section: Analysis Of Existing Dtmc-related Softwarementioning
confidence: 99%
“…First, it has been used in previous studies in similar problem domains, e.g., Yasami et al [29], Muniyandi et al [21] and Mori et al [20]. Secondly, k-means is well suited for the problem at hand [5] and has commonly available implementations [23,27]. It should, however, be noted that the anomaly detection approach presented in the present study is technologically independent from the k-means clustering algorithm.…”
Section: Initial Labelingmentioning
confidence: 99%
“…A method that fits well with describing users' behavior on the macroscale is analyzing users' behavior as a sequence of events [17]. This way of analyzing provides insight into the order in which the user navigates the system, whereas the more classical approach to log file analysis for health care information systems only describes quantities of usage (eg, number of times pages are visited, mean duration of visiting a page).…”
Section: Log File Analysismentioning
confidence: 99%
“…The probabilities are calculated by means of Markov Chain modeling, meaning that, in contrast to summarizing sequential patterns, the purpose is to predict future usage behavior. Probabilities can be calculated by using zero-, first-, and higher-order Markov Chains [17]. These orders differ in that the next page category is predicted only on the basis of the current page category that is visited by the user (first order) or on a combination of the current page category with the page categories that the user was visiting before the current page category (higher order).…”
Section: Heat Mappingmentioning
confidence: 99%