HashMap for a set that is known ahead of time

Nils · October 16, 2021, 12:37am

I have a set of 1 to 20 seemingly random and distinct integers that need to be mapped to another set of integers. I've been using the standard HashMap until now but I think it's probably possible to optimize the process.

I've done some research and I think what I need is a perfect hash function. I know there is phf crate, but it only seems to works for keys that are known at compile time. My keys are generated at runtime.

Is there a crate that allows creating hashmaps that are optimized for a precise set of values? Or do I need to implement it myself? If so, is there an obvious algorithm that I should implement?

Thanks!

scottmcm · October 16, 2021, 1:43am

It's not obvious to me that it'd be worth doing a bunch of optimizing at runtime to figure out a PHF for whatever you happen to get.

If you're using the default HashMap, though, that comes with a DoS-resistent Hasher, which would be the easiest place to optimize things. The easiest speed improvement would probably just be to use fxhash - Rust by replacing HashMap with FxHashMap -- which is just a type alias of HashMap with a different hasher, so all the methods you're used to will still be there.

Nils · October 16, 2021, 1:52am

I'm currently using a custom Hasher implementation that does nothing other than assiging its input to its state. This works because I'm only receiving u64 keys.

My get operation is quite critical in term of performances, so would perfer spending a bit more time optimizing than losing a few cpu cycles each time I need to query a value.

Fredrik · October 16, 2021, 2:02am

It would be interesting to see a crate that JIT compiles simple functions like this to optimized machine code.

IndianBoy42 · October 16, 2021, 2:05am

This may seem weird but if you only have 1-20 keys, a simple linear search may be the fastest. Especially if you can vectorize, for this you should have 2 separate arrays for the keys and values, search the keys (2-8 at a time depending on AVX and branchless), use the index for the value.

Fredrik · October 16, 2021, 3:14am

Here's a proof of concept JIT compiled map runnable on the playground. I would be interested in knowing how it compares to other alternatives in terms of performance.

It allocates 29 bytes of memory for each entry, but is rounded up to the page size, which is typically 4096 bytes. Memory management could be improved to pack multiple maps into a single page.
For simplicity, it does a linear search, but could be improved by striking a balance between binary search and linear search.
It only supports x64 Linux, but if it performs well and someone likes to make a crate out of it, support for more platforms should be added.
Having an enormous amount of entries in the map is undefined behavior, due to the four byte address of branches wrapping around. A crate with a safe interface would need to take care of this.

Nils · October 16, 2021, 3:41am

Unfortunately, I'm currently on Windows so I can't test it outiside the playground. It's pretty impressive nonetheless and I'd love to see how this performs compared to the other existing solutions.

EDIT: I've done some benchmarks this morning. For those who might end up need it later, here what I found on my machine. input is the number of key inserted (those are u64s), and it's mapped to the average time taken to find a random value of the set.

ahash, fxhash and nohash are hashing algoryhtm (well, nohash doesn't hash the values - and I guess it was a mistake on my part). linear is linear search in an array (not vectorized, I'm not sure how this can be done). binary is binary search in a sorted array.

I'm curious to see how the JIT map and a perfect hash function can perform for this special case and I might come back to edit this post if I find something that outperform those. For now I guess I will use fxhash, as proposed by scottmcm, which outperforms the other soltions.

Fredrik · October 16, 2021, 11:57am

Can you share the benchmark code?

Nils · October 16, 2021, 1:18pm

No problem, @Fredrik! It uses criterion so it's just cargo bench. Here's a pastbin link.

Fredrik · October 16, 2021, 9:30pm

My results on x64 GNU/Linux are significantly different from yours. fxhash isn't that low, and is even higher than nohash. ahash doing far worse. JIT collections looking good. As I saw you're using a set rather than a map, I made a JIT set which generates fewer instructions than the JIT map.

system · January 14, 2022, 9:30pm

This topic was automatically closed 90 days after the last reply. We invite you to open a new topic if you have further questions or comments.

Topic		Replies	Views
Fastest StrMap? help	16	3345	September 28, 2019
Fastest (lookup time) map for short keys help	8	4786	January 12, 2023
HashMap performance	26	28310	January 22, 2020
Why is phf::Map slower than std HashMap?	3	1787	October 20, 2023
Best option for perfect hash function	3	537	December 23, 2023

HashMap for a set that is known ahead of time

Related topics