A more cost effective algorithm for finding perfect hash functions

Fox, Edward A.; Chen, Q.-F.; Heath, Lenwood S.; Datta, Sanjeev

doi:10.1145/75427.75440

Cited by 5 publications

(8 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…is of concern to us only insofar that good pseudo-random functions can be found which map the keys to integers. Several authors have given families of pseudo-random functions which work well for character string keys [4,5,6,7,10].…”

Section: Preliminariesmentioning

confidence: 99%

Finding succinct ordered minimal perfect hash functions

Seiden

Hirschberg

1994

Information Processing Letters

View full text Add to dashboard Cite

An ordered minimal perfect hash table is one in which no collisions occur among a predened set of keys, no space is unused and the data are placed in the table in order. A new method for creating ordered minimal perfect hash functions is presented. It creates hash functions with representation space requirements closer to the theoretical lower bound than previous methods. The method presented requires approximately 17% less space to represent generated hash functions and is easy to implement. However, a high time complexity makes it practical for small sets only (size < 1000).

show abstract

Section: Preliminariesmentioning

confidence: 99%

Finding succinct ordered minimal perfect hash functions

Seiden

Hirschberg

1994

Information Processing Letters

View full text Add to dashboard Cite

show abstract

“…2 would work well as a perfect hashing algorithm because it completely eliminates the pattem collision problem, and also because it produces minimal word lists that can be ordered in any way (the list shown is alphabetical). If the trie array were stored in memory, the CPU time needed to index a word through the hie would actually be less than that of other perfect hashing functions currently in use for handling large word sets [22], [9]. For example, to look up the word CORN using Sager's algorithm [22] (or Fox's improvement of it [9]) requires that seven characters be indexed from the word, along with 11 additions, three MOD operations, and three array indexings.…”

Section: The N I E Data Structurementioning

confidence: 99%

“…CD-ROM's also have slow track seek times, which makes multiple accesses to the disk extremely frustrating for the user of the CD-ROM. Perfect hashing functions have been used in this environment to provide quick access to large medical dictionaries [9] and other word lists. The perfect hashing function allows any word to be accessed from the disk with only one disk access operation.…”

Section: Introductionmentioning

confidence: 99%

Using tries to eliminate pattern collisions in perfect hashing

Brain

Tharp

1994

IEEE Trans. Knowl. Data Eng.

View full text Add to dashboard Cite

4any current perfect hashing algorithms suffer from the problem of pattern collisions. In this paper, a perfect hashing technique that uses array-based tries and a simple sparse matrix packing algorithm is introduced. This technique eliminates all pattern collisions, and because of this it can be used to form ordered minimal perfect hash functions on extremely large word lists. This algorithm is superior to other known perfect hashing functions for large word lists in terms of function building efflciency, pattern collision avoidance, and retrieval function complexity. It has been successfully used to form an ordered minimal perfect hashing function for the entire 24, 481 element Unix word list without resorting to segmentation. The item lists addressed by the perfect hashing function formed can be ordered in any manner, including alphabetically, to easily allow other forms of access to the same list. Index Tenns-Perfect hashing, minimal perfect hashing, hashing tries, sparse array packing. The Essentials. .. and More (Digital Press). He is the lead author Prentice Hall's five-book Bridge series on Windows NT.

show abstract

“…After further investigation we developed a modified algorithm [16] requiring O(m 3) time. With this algorithm we were able to find MPHF's for sets of over a thousand words.…”

Section: Practical Minimal Perfect Nash Functions For Large Databasesmentioning

confidence: 99%

“…After careful analysis of Sager's algorithm [32] and our enhanced version [16], Heath made three crucial observations that serve as the foundation of our new algorithms:…”

Section: Key Concepts Of New Algorithmsmentioning

confidence: 99%