A stack decoder approach to approximate string matching

Huerta, Juan M.

doi:10.1145/1835449.1835634

Cited by 2 publications

(3 citation statements)

References 6 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This approach is important because it improves the accuracy of an off line system using a position index by using an approximation of the string edit distance without sacrificing speed. The results are within 2.5% error (Huerta, 2010b). In this paper, we will focus exclusively on the on-line approach.…”

Section: Approximate String Edit Distance Computationmentioning

confidence: 74%

“…To reduce this mismatch between off-line and on-line methods, it is possible to approximate the SED (and the related Longest Common Subsequence computation) based on a stack computation and information derived from a positional index. This computation is possible through the use of a Stack structure and a A* like algorithm as described in (Huerta, 2010b). In that paper, Huerta proposed a method that takes O(m s log s) operations on average where s is the depth of the stack (typically much smaller than T, or m) instead of O(T) using a positional index.…”

Section: Approximate String Edit Distance Computationmentioning

confidence: 99%

See 1 more Smart Citation

Towards Efficient Translation Memory Search Based on Multiple Sentence Signatures

Huerta¹

2011

Speech and Language Technologies

View full text Add to dashboard Cite

Section: Approximate String Edit Distance Computationmentioning

confidence: 74%

Section: Approximate String Edit Distance Computationmentioning

confidence: 99%

Towards Efficient Translation Memory Search Based on Multiple Sentence Signatures

Huerta¹

2011

Speech and Language Technologies

View full text Add to dashboard Cite

“…In previous work [4,5] we evaluated the ability of a stack approach to recall an optimal match given a query, as well as the speed compared to naIve SED. We now want to investigate the question of n-gram equivalence, namely, to How does S2LM behave as compared to n-gram SLM in a LM rescoring task?…”

Section: Discussionmentioning

confidence: 99%

Subsequence similarity language models

Huerta

2011

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Self Cite

View full text Add to dashboard Cite

In this work we present the Subsequence Similarity Language Model (S2-LM) which is a new approach to language modeling based on string similarity. As a language model, S2-LM generates scores based on the closest matching string given a very large corpus. In this paper we describe the properties and advantages of our approach and describe efficient methods to carry out its computation. We describe an n-best rescoring experiment intended to show that S2-LM can be adjusted to behave as an n-gram SLM model.

show abstract

A stack decoder approach to approximate string matching

Cited by 2 publications

References 6 publications

Towards Efficient Translation Memory Search Based on Multiple Sentence Signatures

Towards Efficient Translation Memory Search Based on Multiple Sentence Signatures

Subsequence similarity language models

Contact Info

Product

Resources

About