Inducing probabilistic grammars by Bayesian model merging

Stolcke, Andreas; Omohundro, Stephen M.

doi:10.1007/3-540-58473-0_141

Cited by 144 publications

(108 citation statements)

References 18 publications

(21 reference statements)

Supporting

Mentioning

105

Contrasting

Unclassified

Order By: Relevance

“…For the HMM parameter estimation, we apply an incremental learning scheme utilizing the best first model merging framework [8,9]. Model merging is inspired by the observation that, when faced with new situations, humans and animals alike drive their learning process by first storing individual examples (memory based learning) when few data points are available and gradually switching to a parametric learning scheme to allow for better generalization as more and more data becomes available [10].…”

Section: Hmmmentioning

confidence: 99%

Comparing Hidden Markov Models and Long Short Term Memory Neural Networks for Learning Action Representations

Panzner

Cimiano

2016

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. In this paper we are concerned with learning models of actions and compare a purely generative model based on Hidden Markov Models to a discriminatively trained recurrent LSTM network in terms of their properties and their suitability to learn and represent models of actions. Specifically we compare the performance of the two models regarding the overall classification accuracy, the amount of training sequences required and how early in the progression of a sequence they are able to correctly classify the corresponding sequence. We show that, despite the current trend towards (deep) neural networks, traditional graphical model approaches are still beneficial under conditions where only few data points or limited computing power is available.

show abstract

Section: Hmmmentioning

confidence: 99%

Comparing Hidden Markov Models and Long Short Term Memory Neural Networks for Learning Action Representations

Panzner

Cimiano

2016

Lecture Notes in Computer Science

View full text Add to dashboard Cite

show abstract

“…Stolcke and Omohundro [58] propose a technique called Bayesian Model Merging (BMM): first, strings that are observed in the data are incorporated by adding ad-hoc rules to form an initial grammar; then, the grammar is made more concise by merging some of the rules. Stolcke and Omohundro [58] discuss two incarnations of their technique, one in which the models are probabilistic context-free grammars (PCFGs), and another in which they are hidden Markov models (HMMs).…”

Section: Computational Grammar Inductionmentioning

confidence: 99%

“…Stolcke and Omohundro [58] discuss two incarnations of their technique, one in which the models are probabilistic context-free grammars (PCFGs), and another in which they are hidden Markov models (HMMs). In the former, rules are merged by identifying non-terminal symbols A and B if the rule A → B is in the grammar; this leads to (over-) generalizations, and renders the grammar more compact.…”

Section: Computational Grammar Inductionmentioning

confidence: 99%

“…In both cases, the prior probabilities are optimized by minimizing their description length. Stolcke and Omohundro [58] discuss the application of BMM to natural language learning, but do not provide quantitative evaluation results.…”

Section: Computational Grammar Inductionmentioning

confidence: 99%

“…Notably, Borensztajn and Zuidema [16] extend the BMM model of Stolcke and Omohundro [58], but they assume that the input is already bracketed. Their algorithm then proceeds by merging nonterminal labels to maximize a Bayesian objective function.…”

Section: Computational Grammar Inductionmentioning

confidence: 99%

See 2 more Smart Citations

Computational Models of Language Acquisition

Wintner

2010

Computational Linguistics and Intelligent Text Processing

View full text Add to dashboard Cite

Abstract. Child language acquisition, one of Nature's most fascinating phenomena, is to a large extent still a puzzle. Experimental evidence seems to support the view that early language is highly formulaic, consisting for the most part of frozen items with limited productivity. Fairly quickly, however, children find patterns in the ambient language and generalize them to larger structures, in a process that is not yet well understood. Computational models of language acquisition can shed interesting light on this process. This paper surveys various works that address language learning from data; such works are conducted in different fields, including psycholinguistics, cognitive science and computer science, and we maintain that knowledge from all these domains must be consolidated in order for a well-informed model to emerge. We identify the commonalities and differences between the various existing approaches to language learning, and specify desiderata for future research that must be considered by any plausible solution to this puzzle.

show abstract

Machine Learning for Cognitive Networks: Technology Assessment and Research Challenges

Dietterich

Langley

2007

Cognitive Networks

View full text Add to dashboard Cite

Optimizing multiple co-located networks, each with a variable number of network functionalities that influence each other, is a complex problem that has not yet received a lot of attention in the research community. However, since independent co-located networks increasingly influence each other, optimization solutions can no longer afford to look only at the performance of a single network. To this end, we propose a multi-tiered solution, based on Least Square Policy Improvement (LSPI), a machine learning technique.

show abstract

Inducing probabilistic grammars by Bayesian model merging

Cited by 144 publications

References 18 publications

Comparing Hidden Markov Models and Long Short Term Memory Neural Networks for Learning Action Representations

Comparing Hidden Markov Models and Long Short Term Memory Neural Networks for Learning Action Representations

Computational Models of Language Acquisition

Machine Learning for Cognitive Networks: Technology Assessment and Research Challenges

Contact Info

Product

Resources

About