Class phrase models for language modeling

Ries, Klaus; Buo, F.D.; Waibel, Alex

doi:10.1109/icslp.1996.607138

Cited by 26 publications

(20 citation statements)

References 8 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…By using (6) as the objective function, we observed that the resulting segmentations yield promising applications in n-gram topic modeling, named entity recognition and Chinese segmentation. However, in the spirit of Ries et al (1996), attempts to minimize perplexity instead of maximizing (6), resulted in larger segments and the segment quality definition of Section 1 was not met.…”

Section: Discussion and Future Workmentioning

confidence: 99%

Maximizing Component Quality in Bilingual Word-Aligned Segmentations

Martzoukos¹,

Monz

Florêncio

2014

Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics

View full text Add to dashboard Cite

Given a pair of source and target language sentences which are translations of each other with known word alignments between them, we extract bilingual phrase-level segmentations of such a pair. This is done by identifying two appropriate measures that assess the quality of phrase segments, one on the monolingual level for both language sides, and one on the bilingual level. The monolingual measure is based on the notion of partition refinements and the bilingual measure is based on structural properties of the graph that represents phrase segments and word alignments. These two measures are incorporated in a basic adaptation of the Cross-Entropy method for the purpose of extracting an N -best list of bilingual phrase-level segmentations. A straight-forward application of such lists in Statistical Machine Translation (SMT) yields a conservative phrase pair extraction method that reduces phrase-table sizes by 90% with insignificant loss in translation quality.

show abstract

Section: Discussion and Future Workmentioning

confidence: 99%

Maximizing Component Quality in Bilingual Word-Aligned Segmentations

Martzoukos¹,

Monz

Florêncio

2014

Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics

View full text Add to dashboard Cite

show abstract

“…In natural language, statistical modeling has mainly been applied to modeling word sequences (Jurafsky & Martin, 2000). The idea of using class models for phrases rather than words was presented by Ries, Buø and Waibel (1996), with the result called class phrase models. There, words were grouped into short phrases, and the phrase class is the sequence of word classes in the phrase.…”

Section: Discussionmentioning

confidence: 99%

“…The analogy to the work described here is revealed by considering words as notes and phrases as segments. The phrase representation used by Ries, Buø and Waibel (1996) can be viewed as an instance of the lift viewpoint constructor, where the basic event attributes rather than abstract event classes are lifted. Therefore the viewpoint representation may prove useful in natural language modeling.…”

Section: Discussionmentioning

confidence: 99%

Melodic analysis with segment classes

Conklin

2006

Mach Learn

View full text Add to dashboard Cite

This paper presents a representation for melodic segment classes and applies it to music data mining. Melody is modeled as a sequence of segments, each segment being a sequence of notes. These segments are assigned to classes through a knowledge representation scheme which allows the flexible construction of abstract views of the music surface. The representation is applied to sequential pattern discovery and to the statistical modeling of musical style.

show abstract

“…In grambased approaches (Riccardi, Pieraccini & Bocchieri, 1996;Hu, Turin & Brown, 1997;Ristad & Thomas, 1997;Siu & Ostendorf, 1997;Niesler & Woodland, 1999), models take into account variable-length dependencies by conditioning the probability of each word with a context of variable length. In contrast, in phrase-based approaches (Suhm & Waibel, 1994;Deligne & Bimbot, 1995;Masataki & Sagisaka, 1996;Ries, Buo & Waibel, 1996;Matsunaga & Sagayama, 1997;Riccardi & Bangalore, 1998;Siu, 1998), sentences are structured into variable-length phrases and probabilities are assigned to phrases instead of words. The probability of each phrase may be conditioned by the preceding phrases, in just the same way that the probability of a word is conditioned by the preceding words in a gram-based framework.…”

Section: Introductionmentioning

confidence: 95%