NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks

Kontkanen, Petri; Wettig, Hannes; Myllymäki, Petri

doi:10.1155/2007/90947

Cited by 6 publications

(9 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…According to the MDL principle, learning can be equated with finding regularities with data. Consequently the more the data is compressed the more the data is learnt [9].…”

Section: Minimum Description Length Principlementioning

confidence: 99%

See 1 more Smart Citation

Improved spatially adaptive MDL denoising of images using normalized maximum likelihood density

Meena¹,

Annadurai²

2008

Image and Vision Computing

View full text Add to dashboard Cite

“…According to the MDL principle, learning can be equated with finding regularities with data. Consequently the more the data is compressed the more the data is learnt [9].…”

Section: Minimum Description Length Principlementioning

confidence: 99%

“…The model class is only used as a technical device for constructing an efficient code for describing the data. [9].…”

Section: Minimum Description Length Principlementioning

confidence: 99%

Improved spatially adaptive MDL denoising of images using normalized maximum likelihood density

Meena¹,

Annadurai²

2008

Image and Vision Computing

View full text Add to dashboard Cite

“…Exact and computationally tractable formulas are rare: results for multinomial models are given in [10], and for Bayesian networks with structural restrictions in [11], [12], [13]; more references can be found in [3] and [4]. Similarly to the present work, in the context of structural equation models, Preacher et al [14] estimate the normalizing coefficient by sampling random data-sets from a uniform distribution using Markov chain Monte Carlo (MCMC) methods.…”

Section: Introductionmentioning

confidence: 99%

Monte Carlo estimation of minimax regret with an application to MDL model selection

Roos

2008

2008 IEEE Information Theory Workshop

View full text Add to dashboard Cite

Abstract-Minimum description length (MDL) model selection, in its modern NML formulation, involves a model complexity term which is equivalent to minimax/maximin regret. When the data are discrete-valued, the complexity term is a logarithm of a sum of maximized likelihoods over all possible data-sets. Because the sum has an exponential number of terms, its evaluation is in many cases intractable. In the continuous case, the sum is replaced by an integral for which a closed form is available in only a few cases. We present an approach based on Monte Carlo sampling, which works for all model classes, and gives strongly consistent estimators of the minimax regret. The estimates convergence almost surely to the correct value with increasing number of iterations. For the important class of Markov models, one of the presented estimators is particularly efficient: in empirical experiments, accuracy that is sufficient for model selection is usually achieved already on the first iteration, even for long sequences.

show abstract

“…The FFT method involves utilization of Newton's method and is explained in the paper [1]. However, the usefulness of this approach is unclear as some earlier tests with the multinomial normalizing term [12] show that the used floating point numbers must have very high precision in practical cases. This is due to the fact that the values of the normalizing terms can be quite large, and consequently, as the data size increases, the precision of the floating point numbers must also increase.…”

Section: Theorem 2 (The Miller Formula) If Two Formal Power Series Arementioning

confidence: 99%

“…The computational complexity of computing the NML criterion for a Naive Bayes model is the same as for this algorithm, as the numerator of (1) is trivial to compute. Further information on computing the stochastic complexity for Naive Bayes models can be found in papers [11,12].…”

Section: The Algorithmmentioning

confidence: 99%

Fast NML Computation for Naive Bayes Models

Mononen

Myllymäki

Discovery Science

Self Cite

View full text Add to dashboard Cite

Abstract. The Minimum Description Length (MDL) is an informationtheoretic principle that can be used for model selection and other statistical inference tasks. One way to implement this principle in practice is to compute the Normalized Maximum Likelihood (NML) distribution for a given parametric model class. Unfortunately this is a computationally infeasible task for many model classes of practical importance. In this paper we present a fast algorithm for computing the NML for the Naive Bayes model class, which is frequently used in classification and clustering tasks. The algorithm is based on a relationship between powers of generating functions and discrete convolution. The resulting algorithm has the time complexity of O(n 2 ), where n is the size of the data.

show abstract

NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks

Cited by 6 publications

References 24 publications

Improved spatially adaptive MDL denoising of images using normalized maximum likelihood density

Improved spatially adaptive MDL denoising of images using normalized maximum likelihood density

Monte Carlo estimation of minimax regret with an application to MDL model selection

Fast NML Computation for Naive Bayes Models

Contact Info

Product

Resources

About