HashMap key access across threads where data is non-Send

John_Nagle · October 2, 2021, 6:02am

Here's an interesting little problem. I have a HashMap<K,V>. K, the key type, is Copy and Send. V, the data type, is not Send, because it contains an Rc cell. I really need V to be an Rc.

I want to find out, from another thread, if the HashMap contains a key k.

The problem is, I can't put the HashMap inside an Arc<Mutex<HashMap<K,V>>> because V is not Send. So there's no way to test if K is in the map.

Right now, as a workaround, I'm maintaining a HashSet in parallel to to the HashMap. The HashSet just has a list of the keys in the HashMap. The HashSet can be put inside Arc<Mutex<HashSet>>. Works, but doubles storage and time.

Is there a better way?

Kestrer · October 2, 2021, 7:05am

If you're OK with unsafe you could write a custom accessor type that wraps your shared map with a restricted key-only API, and then unsafely implement Send/Sync for it.

Alternatively, you could make the main thread an actor that responds to "is this key in the map" queries from other threads over a channel. That has the nice effect of avoiding shared mutability entirely.

alice · October 2, 2021, 8:49am

That's a quite interesting problem. I agree that you could do it correctly with unsafe.

quinedot · October 2, 2021, 9:29am

I had the same thoughts and wrote this up, but waited to post it until others chimed in. You're technically relying on HashMap to be well-behaved and that Rc's documentation is exaggerating from a pedantic point of view (i.e. it's OK to Send an Rc that isn't manipulated).

alice · October 2, 2021, 11:08am

Note also that this is not ok when using an Arc to share it across threads. This is because if the key-only version is the last Arc to get dropped, then you are running the destructor of non-Send values on a new thread.

John_Nagle · October 3, 2021, 6:08am

OK, there's no really great solution. Didn't think so. Just wanted to see if someone was going to say "Oh, that is easy using the XYZ crate" or "you need to use feature ABC" or something.

This is actually the first time, in a year, that I've hit a locking granularity problem. I expected this to come up more often when new to Rust.

2e71828 · October 3, 2021, 7:42am

I think that maintaining two parallel datastructures is probably required, but you might be able to get better performance with a different distribution of responsibilities. A generational arena like thunderdome, for example, could let you avoid calculating the hash value multiple times:

Shared: Arc<Mutex<HashMap<K, thunderdome::Index>>>
Thread local: thunderdome::Arena<V>

system · January 1, 2022, 7:42am

This topic was automatically closed 90 days after the last reply. We invite you to open a new topic if you have further questions or comments.

Topic		Replies	Views
How do I force multithreaded mutation on an Arc<HashMap<>>?	16	1831	December 11, 2022
[SOLVED] Is this way of using map with rc correct? help	6	953	January 12, 2023
How to share a HashMap between threads help	12	15034	February 20, 2021
Share String between key and value of HashMap help	8	3590	January 12, 2023
Ways to share updated hashmap in multiple threads without using Arc<mutex> and Rwlock	9	4988	June 15, 2022

HashMap key access across threads where data is non-Send

Related topics