2010
DOI: 10.1007/s11390-010-9403-4
|View full text |Cite
|
Sign up to set email alerts
|

Negative Selection of Written Language Using Character Multiset Statistics

Abstract: We study the combination of symbol frequence analysis and negative selection for anomaly detection of discrete sequences where conventional negative selection algorithms are not practical due to data sparsity. Theoretical analysis on ergodic Markov chains is used to outline the properties of the presented anomaly detection algorithm and to predict the probability of successful detection. Simulations are used to evaluate the detection sensitivity and the resolution of the analysis on both generated artificial d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2012
2012
2017
2017

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 17 publications
0
2
0
Order By: Relevance
“…Their article reported the results of experiments done by different participants. Pöllä and Honkela (2010) applied English Wikipedia in their study on "the combination of symbol frequency analysis and negative selection algorithm for anomaly detection of discrete sequences" (2010, p.1256), concluding that "the baseline result of the Wikipedia edit detection experiment is promising" (2010, p.1265).…”
Section: Data Miningmentioning
confidence: 99%
“…Their article reported the results of experiments done by different participants. Pöllä and Honkela (2010) applied English Wikipedia in their study on "the combination of symbol frequency analysis and negative selection algorithm for anomaly detection of discrete sequences" (2010, p.1256), concluding that "the baseline result of the Wikipedia edit detection experiment is promising" (2010, p.1265).…”
Section: Data Miningmentioning
confidence: 99%
“…To overcome the limitations of negative selection-based anomaly detection techniques in sparse data cases, Pöllä and Honkela [104] proposed a combination of symbol frequency analysis and negative selection. Wikipedia was employed as a real-world data to evaluate 8 the sensitivity of the proposed anomaly detection algorithm.…”
Section: Information Retrievalmentioning
confidence: 99%