Stuffing a small int & a flag into a u32

zeroexcuses · September 2, 2021, 2:07pm

Suppose we have an index that fits in a u27, and a flag that fits in a u5. We can store both in a u32 as either:

(index << 5) + flag
(flag << 27) + index

In practice, is there any advantage/standard for picking one over the other? [Context: compressing stuff in b tree nodes, so not using 2 u32 is important]

steffahn · September 2, 2021, 2:18pm

You could do it without bit shifting, using u32 flags that are multiples of 2²⁷, combine them with | (bitwise or), and extract the values with & (bitwise and) and an appropriate mask. In that case, the representation would be the

(flag << 27) + index

version. Disclaimer: I don’t know about practical advantages/standards and don’t have any experience doing something like this myself.

zeroexcuses · September 2, 2021, 2:20pm

This is clever.

For extracting the lower k bits, for k < 32, do we do

... & ((1 << k) - 1)

or is there a more idiomatic way to do this ?

steffahn · September 2, 2021, 2:27pm

I’d assume your k is actually a constant? Then it doesn’t really matter how you calculate it, and I would maybe just put the mask in a const.

const LOWER_27_BITS: u32 = (1 << 27) - 1;

I’m personally not aware of any more idiomatic approach. You have to make sure to ~~use 28 in the shift for getting 27 bits to be 1~~ use parentheses even though the shift is used somewhat like an exponential here, and exponentiation usually binds tighter than addition. Maybe it’s slightly less confusing to use 2.pow then…

const LOWER_27_BITS: u32 = 2_u32.pow(27) - 1;

Edit: I mean, you can also do it with a literal

const LOWER_27: u32 = 0b_00000111_11111111_11111111_11111111;

Edit2: You can also work with the “complement” of the 27 being 5 (i.e. the two add to 32):

const LOWER_27: u32 = !0 >> 5;

Again, I personally don’t know if any of these would be considered “most idiomatic”.

quinedot · September 2, 2021, 8:25pm

I like using enums for flags. You can use something like no_panic to make the test a compile time assert. I'd also have utility methods, etc.

There are a number of crates for this purpose too, but I haven't tried them yet.

tczajka · September 3, 2021, 6:22am

Actually you have to make sure to use 27 in the shift, your code will give you 28 ones.

steffahn · September 3, 2021, 8:13am

Thanks, good catch. Makes sense now, thinking about it again. I've messed up my testing by forgetting the parentheses, so that the result of 1 << 28 - 1 looked closer to the correct anwer. But only because that's 1 << (28 - 1).

So the code I had wouldn't have given 27 ones or 28 ones but instead a single one followed by 27 zeroes.

system · December 2, 2021, 8:13am

This topic was automatically closed 90 days after the last reply. We invite you to open a new topic if you have further questions or comments.

Topic		Replies	Views
Rust bit splitting	2	400	November 22, 2021
Saving a f32's bits in a usize on a big endian machine help	3	722	January 12, 2023
F32 -> f28, f29?	5	852	March 31, 2022
Best way to get the 3 bytes of a small u32? help	15	6764	January 12, 2023
Reading out bytes in memory: u32,f32 <-> [u8;4]	5	997	January 12, 2023

Stuffing a small int & a flag into a u32

Related Topics