Merge sort: Vec<T> vs Vec<Vec<T>>

zeroexcuses · February 8, 2019, 8:56pm

Suppose we are doing merge sort of A & B, of n_A and n_B elements.
If A, B, and output are stored as Vec, then we will need:

( n_A + n_B + (n_A + n_B) * sizeof(T) bytes of storage

On the other hand, if we stored A, B, and outputa s Vec<Vec> ... with the inner Vec having K elements, then at any given time, we will only need:

(n_A + n_B + K * 2) * sizeof(T) + (n_A/k + n_B/k)*sizeof(ptr) bytes of storage right?

Because if we into_iter A and B, we consume Vec's of size K as we march along / allocate new blocks in the output.

If the above is correct, does Rust have any crates built around Vec<Vec> ?

dthul · February 8, 2019, 9:26pm

I don't know whether there exists any crate that offers such a data structure. Since the memory savings are only a factor of two I wonder how useful this is in practice. If your input vectors are so large that a factor of two will make the difference between fitting in memory and not fitting in memory, a more common solution would be to stream over the input data instead of keeping it in memory in the first place.

zeroexcuses · February 8, 2019, 9:28pm

Yeah, that's a good point.:

2 * data > RAM > data

is a very rare situation indeed.

Topic		Replies	Views
Create merge sort in rust	16	4295	November 21, 2021
Merge 2 arrays into bigger array help	3	488	April 5, 2023
Merge sorting 100GB worth of data	8	1205	January 12, 2023
Feedback on how Improve and be more memory efficient code review	7	378	February 1, 2023
Datastructure help	7	425	August 3, 2022

Merge sort: Vec<T> vs Vec<Vec<T>>

Related Topics