2007
DOI: 10.1017/s0001867800001634
|View full text |Cite
|
Sign up to set email alerts
|

Improved compound Poisson approximation for the number of occurrences of any rare word family in a stationary markov chain

Abstract: We derive a new compound Poisson distribution with explicit parameters to approximate the number of overlapping occurrences of any set of words in a Markovian sequence. Using the Chen-Stein method, we provide a bound for the approximation error. This error converges to 0 under the rare event condition, even for overlapping families, which improves previous results. As a consequence, we also propose Poisson approximations for the declumped count and the number of competing renewals.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
30
0
4

Year Published

2008
2008
2019
2019

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 20 publications
(34 citation statements)
references
References 12 publications
0
30
0
4
Order By: Relevance
“…General error-bounds for this approximation are known only for motifs consisting of a single module as long as no version of the motif is a proper sub-word of another version of it (Roquain and Schbath 2007); in particular, bounds for the error are unknown for motifs with two or more modules with unbounded gaps. However, due to Markov's inequality (Durrett 2004), for an arbitrary motif m we have P[W $ 1] # E(W), where E(W) denotes the expected value of W. But, due to the stationarity of our probabilistic model and the linearity of the expectation operator, E(W) = Np, and hence P[W $ 1] # Np.…”
Section: Poisson Versus Upper-bound Approximationmentioning
confidence: 99%
“…General error-bounds for this approximation are known only for motifs consisting of a single module as long as no version of the motif is a proper sub-word of another version of it (Roquain and Schbath 2007); in particular, bounds for the error are unknown for motifs with two or more modules with unbounded gaps. However, due to Markov's inequality (Durrett 2004), for an arbitrary motif m we have P[W $ 1] # E(W), where E(W) denotes the expected value of W. But, due to the stationarity of our probabilistic model and the linearity of the expectation operator, E(W) = Np, and hence P[W $ 1] # Np.…”
Section: Poisson Versus Upper-bound Approximationmentioning
confidence: 99%
“…We can basically classify these approximations in three categories: 1) Gaussian approximations (Cowan, 1991;Kleffe & Borodovski, 1997;Nuel, 2010;Pevzner et al, 1989;Prum et al, 1995); 2) Poisson approximations Erhardsson (2000); Geske et al (1995); Godbole (1991); Reinert & Schbath (1999); Roquain & Schbath (2007); 3) large deviations approximations Denise et al (2001);Nuel (2004). In this chapter we deliberately left aside the Poisson-based approximations and considered only two of these approximations: the (Near-) Gaussian approximations with NG h (n), and the large deviations based approximations with CB(n) and BR(n).…”
Section: Discussionmentioning
confidence: 99%
“…See also Lladser et al [16]. Approximations using the normal distribution and the Poisson distribution are in Schbath [31], Nicodème et al [21], Reinert et al [28] and Roquain and Schbath [30]. Large deviations asymptotics are in Régnier and Szpankowski [27], Régnier [25] and Régnier and Denise [26].…”
Section: P Pudlomentioning
confidence: 99%
“…The leading term of the asymptotics of g a (n, it) when t is close to the origin is due to Mann [17] (pp. [24][25][26][27][28][29][30][31][32][33], and is given in Proposition 4.2 below.…”
Section: Asymptotics On the Fourier Transform Of S Nmentioning
confidence: 99%