Dana Shapira scite author profile

Dana Shapira

4Publications

104Citation Statements Received

21Citation Statements Given

How they've been cited

134

104

How they cite others

Affiliations

Ariel University, Ashkelon Academic College, Brandeis University

Publications

Order By: Most citations

Edit distance with move operations

Shapira

Storer

2007

Journal of Discrete Algorithms

View full text Add to dashboard Cite

Abstract. The traditional edit-distance problem is to find the minimum number of insert-character and delete-character (and sometimes change character) operations required to transform one string into another. Here we consider the more general problem of strings being represented by a singly linked list (one character per node) and being able to apply these operations to the pointer associated with a vertex as well as the character associated with the vertex. That is, in O(1) time, not only can characters be inserted or deleted, but also substrings can be moved or deleted. We limit our attention to the ability to move substrings and leave substring deletions for future research. Note that O(1) time substring move operations imply O(1) substring exchange operations as well, a form of transformation that has been of interest in molecular biology. We show that this problem is NP-complete, show that a "recursive" sequence of moves can be simulated with at most a constant factor increase by a non-recursive sequence, and present a polynomial time greedy algorithm for non-recursive moves with a worst-case log factor approximation to optimal. The development of this greedy algorithm shows how to reduce moves of substrings to moves of characters, and how to convert moves with characters to only insert and deletes of characters.

show abstract

Edit Distance with Move Operations

Shapira

Storer

2002

View full text Add to dashboard Cite

Adapting the Knuth–Morris–Pratt algorithm for pattern matching in Huffman encoded texts

Shapira

Daptardar

2006

Information Processing & Management

View full text Add to dashboard Cite

We perform compressed pattern matching in Huffman encoded texts. A modified Knuth-Morris-Pratt (KMP) algorithm is used in order to overcome the problem of false matches, i.e., an occurrence of the encoded pattern in the encoded text that does not correspond to an occurrence of the pattern itself in the original text. We propose a bitwise KMP algorithm that can move one extra bit in the case of a mismatch, since the alphabet is binary. To avoid processing any encoded text bit more than once, a preprocessed table is used to determine how far to back up when a mismatch is detected, and is defined so that the encoded pattern is always aligned with the start of a codeword in the encoded text. We combine our KMP algorithm with two Huffman decoding algorithms which handle more than a single bit per machine operation; Skeleton trees defined by Klein [1], and numerical comparisons between special canonical values and portions of a sliding window presented in Moffat and Turpin [3]. We call the combined algorithms sk-kmp and win-kmp respectively.The following table compares our algorithms with cgrep of Moura et al. As can be seen, the KMP variants are faster than the methods corresponding to "decompress and search" but slower than cgrep. However, when compression performance is important or when one does not want to re-compress Huffman encoded files in order to use cgrep, the proposed algorithms are the better choice.

show abstract

Boosting the Compression of Rewriting on Flash Memory

Klein

Shapira

2014

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Dana Shapira

Edit distance with move operations

Edit Distance with Move Operations

Adapting the Knuth–Morris–Pratt algorithm for pattern matching in Huffman encoded texts

Boosting the Compression of Rewriting on Flash Memory

Contact Info

Product

Resources

About