Currently it's possible to split slices at index and get the parts, even mutable slices since they don't overlap. Vec itself implements a split_off method that allows the vector to be split at index, it however triggers an allocation of the new vector.
I'm interested in a method that would consume a Vec<T> and produce two owned instances of Vec<T> without the extra allocation step. Given an input vector of (pointer / capacity / length) = (N / 10 / 6), splitting it at offset 3 would produce a pair of (N / 3 / 3) and (N+3 / 7 / 3). Basically splitting the original allocated region between the two.
I'm still kind of new to how heap and stack works, is that even possible?
It is conceivable that this could be made to work, and I've occasionally wished for it as well. But as far as I know, there's currently no way to tell the allocator "Hey I got this piece of memory from you, now please pretend you gave it to me as two separate, contiguous allocations". In fact, this may be fundamentally incompatible with how some allocators work (e.g., if it stores some metadata right next to each allocation). But I wish it were possible on a best-effort basis, I just don't know if any allocators provide an interface to do it.
From the practical point of view I suspect that spliting a slice would be the right solution in the majority of the cases.
An interesting point is that if you do split a vector into two vectors, then the first one does not have any spare capacity, so any push will case an allocation. If you want to avoid allocations, then you must use the first vector as essentially a slice anyway.
Since the allocator API doesn't allow deallocating just part of the original allocation, the returned values would need shared ownership of the buffer, using something like Rc. For example (playground):
This has a few downsides. The contents of the Vec can't be mutated without interior mutability or unsafe code. The Rc<Vec<T>> has an unnecessary extra heap allocation and double pointer indirection. (Rc<[T]> would be a better fit, but I don't think there's any way to get one from a Vec<T> yet.) And of course, reference counting has some runtime overhead. There may be some cases where this is useful, but overall it has little advantage over a borrowed slice.
There are cases where this can be performed very easily in divide and conquer algorithms. You just have to save the indices in variables. But I don't know how well this works on your algorithm.
Some allocators do that (dlmalloc). High performance allocators tend to dedicate each block of memory requested from the OS to a single allocation size, and tightly pack chunks of that size into the block with no slack space or per-allocation metadata. To determine how much space is being freed in a call to free, they look up the page the pointer belongs to in a global table. This is also incompatible with splitting.
You're probably best off making your own allocator, really. One nice thing is that mmap and its equivalents on other OSes are splittable: you can deallocate individual pages of an mmapped block. So for small allocations you could delegate to normal malloc and just reallocate upon splitting, while for large ones you could track which pages belong to what, preserving O(1). Since the split point might not be on a page boundary, you'd have to reference-count pages, though...