Question about BTreeSet implementation

droundy · June 3, 2022, 3:03am

I'm wondering about the implementation of BTreeSet::from_iter(), which collects the values into a Vec and then sorts them with a stable sort and then iterates over that to construct the BTreeSet.

It seems terribly wasteful in cases where the iterator is already sorted, as happens e.g. in Sub, where the ordered difference between two BTreeSets is collected into another BTreeSet.

It's hard to imagine the standard library leaving on the floor such an obvious optimization as to call BTreeMap::bulk_build_from_sorted_iter when it knows it has a sorted iterator. Presumably there is a trick that makes `collect| fast. What is it?

cuviper · June 3, 2022, 3:55am

It's not so strange to me that nobody noticed this case, but you're welcome to send a pull request if you'd like to improve it! I think the BitAnd intersection can do the same thing.

H2CO3 · June 3, 2022, 4:24am

Well, but checking for a sorted iterator would require traversing it. There are two problems with that:

It's linear, so it might approximately double (or at least significantly increase) the running time of collect if the iterator is long.
You would have to collect it into a data structure anyway because from_iter() needs to consume the elements, so if the elements didn't go into some sort of allocation, there would be no way to actually build the set after checking for sortedness.

So I don't think the general FromIterator impl can be reasonably expected to perform a check for sorted input and not allocate.

ssomers · June 3, 2022, 11:18am

Note that this function is a fairly recent PR to begin with, it already is the trick that makes collect fast… because:

No need to look up the tree node for each inserted key.
Requires fewer allocations because it creates a compact tree: fewer tree nodes with most of them filled up to capacity. The resulting map is most likely faster for lookups but actually slower if you then need to insert more keys into it, but that's a different topic.

On the other hand, the use of bulk_build_from_sorted_iter in from_iter slows us down if the iterator yields only a few elements. Up to 11 elements, there's only one tree node to be allocated, and the price of allocating an intermediate vector needs to be offset by sorting smartly to beat the linear searches in tree nodes. Certainly if the iterator yields just one or two elements, you pay for that extra allocation (compared to the old naive implementation). That may be what the last remark in the PR reports.

But anyways, if the caller knows the keys are ascending, like all the BTree iterators are, using bulk_build_from_sorted_iter is low hanging fruit.

ssomers · June 3, 2022, 1:50pm

You could copy or base a PR on this:

https://github.com/ssomers/rust/tree/btree_from_sorted_iter

If you don't, I'll make a PR myself, but then it's probably going to lie untouched for months or years. I don't know why, must be on some blacklist or something.

droundy · June 7, 2022, 3:20pm

Did you end up submitting this pull request?

ssomers · June 8, 2022, 5:57am

It's here.

droundy · June 10, 2022, 3:05pm

Great, and already merged! Thanks!

marcoDreis · June 10, 2022, 4:12pm

interesting information

system · September 8, 2022, 4:12pm

This topic was automatically closed 90 days after the last reply. We invite you to open a new topic if you have further questions or comments.

Topic		Replies	Views
Set difference on BTreeSet and BTreeMap keys	6	1808	January 12, 2023
Iterating over my BTreeMap code review	23	1568	July 9, 2024
Speeding up or finding alternative to BTreeSet help	19	886	April 2, 2023
Sorted-iter: provides set and relational operations for sorted iterators announcements	7	699	January 31, 2021
Sorted unique list from Iterator<T: Ord + Hash>?	7	1978	April 1, 2021

Question about BTreeSet implementation

Related topics