A novel channel distortion measure for vector quantization and a fuzzy model for codebook index assignment

Siu, Kai-Chung; Meng, Helen

doi:10.21437/eurospeech.1999-451

Cited by 18 publications

(9 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This work extends our previous effort in the use of semiautomatically induced grammars for bi-directional English-Chinese machine translation using an example-based approach [1,2]. Our parallel experimental corpora includes the English ATIS-3 Class A sentences (training set, test set 1993 and 1994) with their Chinese translations.…”

Section: Introductionmentioning

confidence: 90%

“…We ran our grammar induction procedure with the distance metrics KL, MN and GI to generate three grammars respectively -G KL , G MN and G GI . 2 We compared these three grammars at every tenth iteration until the stopping criterion is met. 3 When we evaluated with the ATIS training set and rank the grammars in decreasing order of precision (P), we observed (G MN > G GI > G KL ) across the various iterations.…”

Section: Spatial Cluster Terminalsmentioning

confidence: 99%

“…The BLEU (Bilingual Evaluation Understudy) metric was recently proposed by IBM [8] for automatic evaluation of machine translation. BLEU compares variable length phrases of the translated result against multiple 2 All other experimental parameters are controlled. No.…”

Section: Automatic Machine Translation Evaluationmentioning

confidence: 99%

“…In spatial clustering, words or multi-word entities with similar left and right linguistic contexts are clustered together based on the symmetrized divergence (Div) that is applied to the left and right linguistic contexts of the entity pair. Div incorporates the Kullback-Leibler (KL) distance metric [1,2] (See Equation 1). 1 In general, temporal clustering generates phrasal categories in the grammar and spatial clustering generates semantic categories.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Example-based bi-directional Chinese-English machine translation with semi-automatically induced grammars

Siu¹,

Meng²,

Wong³

2003

8th European Conference on Speech Communication and Technology (Eurospeech 2003)

Self Cite

View full text Add to dashboard Cite

We have previously developed a framework for bi-directional English-to-Chinese/Chinese-to-English machine translation using semi-automatically induced grammars from unannotated corpora. The framework adopts an example-based machine translation (EBMT) approach. This work reports on three extensions to the framework. First, we investigate the comparative merits of three distance metrics (Kullback-Leibler, Manhattan-Norm and Gini Index) for agglomerative clustering in grammar induction. Second, we seek an automatic evaluation method that can also consider multiple translation outputs generated for a single input sentence based on the BLEU metric. Third, our previous investigation shows that Chinese-to-English translation has lower performance due to incorrect use of English inflectional forms -a consequence of random selection among translation alternatives. We present an improved selection strategy that leverages information from the example parse trees in our EBMT paradigm.

show abstract

Section: Introductionmentioning

confidence: 90%

Section: Spatial Cluster Terminalsmentioning

confidence: 99%

Section: Automatic Machine Translation Evaluationmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Example-based bi-directional Chinese-English machine translation with semi-automatically induced grammars

Siu¹,

Meng²,

Wong³

2003

8th European Conference on Speech Communication and Technology (Eurospeech 2003)

Self Cite

View full text Add to dashboard Cite

show abstract

“…Acquiring domain information is a resource intensive effort but is a necessary part of making language technologies useful. Automatic techniques have been described for language modeling [1,2] and as an adjunct to grammar writing [3,4,5] for spoken language systems. Parallel efforts, though with somewhat different goals and approaches, exist in text processing [6].…”

Section: Introductionmentioning

confidence: 99%

Automatic concept identification in goal-oriented conversations

Chotimongkol¹,

Rudnicky²

2002

7th International Conference on Spoken Language Processing (ICSLP 2002)

View full text Add to dashboard Cite

We address the problem of identifying key domain concepts automatically from an unannotated corpus of goal-oriented human-human conversations. We examine two clustering algorithms, one based on mutual information and another one based on Kullback-Liebler distance. In order to compare the results from both techniques quantitatively, we evaluate the outcome clusters against reference concept labels using precision and recall metrics adopted from the evaluation of topic identification task. However, since our system allows more than one cluster to associate with each concept an additional metric, a singularity score, is added to better capture cluster quality. Based on the proposed quality metrics, the results show that Kullback-Liebler-based clustering outperforms mutual information-based clustering for both the optimal quality and the quality achieved using an automatic stopping criterion.

show abstract

Concept Discovery and Automatic Semantic Annotation for Language Understanding in an Information-Query Dialogue System Using Latent Dirichlet Allocation and Segmental Methods

Camelin

Detienne

Huet

et al. 2013

Communications in Computer and Information Science

View full text Add to dashboard Cite

HAL is a multidisciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L'archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d'enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

show abstract

A novel channel distortion measure for vector quantization and a fuzzy model for codebook index assignment

Cited by 18 publications

References 7 publications

Example-based bi-directional Chinese-English machine translation with semi-automatically induced grammars

Example-based bi-directional Chinese-English machine translation with semi-automatically induced grammars

Automatic concept identification in goal-oriented conversations

Concept Discovery and Automatic Semantic Annotation for Language Understanding in an Information-Query Dialogue System Using Latent Dirichlet Allocation and Segmental Methods

Contact Info

Product

Resources

About