Free access
Proceedings
2023 Proceedings of the Symposium on Algorithm Engineering and Experiments (ALENEX)

PaCHash: Packed and Compressed Hash Tables

Abstract

We introduce PaCHash, a hash table that stores its objects contiguously in an array without intervening space, even if the objects have variable size. In particular, each object can be compressed using standard compression techniques. A small search data structure allows locating the objects in constant expected time. PaCHash is most naturally described as a static external hash table where it needs a constant number of bits of internal memory per block of external memory. Here, in some sense, PaCHash beats a lower bound on the space consumption of k-perfect hashing. An implementation for fast SSDs needs about 5 bits of internal memory per block of external memory, requires only one disk access (of variable length) per search operation, and has small internal search overhead compared to the disk access cost. Our experiments show that it has lower space consumption than all previous approaches even when considering objects of identical size.

Formats available

You can view the full content in the following formats:

Information & Authors

Information

Published In

cover image Proceedings
2023 Proceedings of the Symposium on Algorithm Engineering and Experiments (ALENEX)
Pages: 162 - 175
ISBN (Online): 978-1-61197-756-1

History

Published online: 12 January 2023

Authors

Affiliations

Metrics & Citations

Metrics

Citations

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

Cited By

View Options

View options

PDF

View PDF

Figures

Tables

Media

Share

Share

Copy the content Link

Share with email

Email a colleague

Share on social media