Safe to transmute lifetimes inside Vec?

Coder-256 · February 5, 2025, 5:39pm

Let's say I want to reuse a Vec<&'a i32> as Vec<&'b i32>. Transmuting should be safe if the vector is empty, right?

Now consider a more generic API:

pub trait Transmutable {
    type Assoc<'a>;
}

pub fn transmute_vec_lifetime<'a, 'b, X: Transmutable>(
    mut vec: Vec<X::Assoc<'a>>,
) -> Vec<X::Assoc<'b>> {
    vec.clear();
    unsafe { std::mem::transmute(vec) }
}

Is this code sound for arbitrary implementations of Transmutable? Are there any types where this would not be a valid transformation, or any known API where this would break the safety assumptions?

Example usage:

struct Foo<'a>(&'a i32);
struct FooTransmute;
impl Transmutable for FooTransmute {
    type Assoc<'a> = Foo<'a>;
}
fn transmute_foo<'a, 'b>(vec: Vec<Foo<'a>>) -> Vec<Foo<'b>> {
    transmute_vec_lifetime::<FooTransmute>(vec)
}

alice · February 5, 2025, 5:51pm

You can't use transmute for this, but if you use Vec::from_raw_parts together with a pointer cast then that should work.

drewtato · February 5, 2025, 6:38pm

Just doing .into_iter().map(|_| unreachable!()).collect() might also work.

cod10129 · February 5, 2025, 6:40pm

It should look like this without transmute:

pub trait Transmutable {
    type Assoc<'a>;
}

pub fn transmute_vec_lifetime<'a, 'b, X: Transmutable>(
    mut vec: Vec<X::Assoc<'a>>,
) -> Vec<X::Assoc<'b>> {
    vec.clear();
    let mut vec = std::mem::ManuallyDrop::new(vec);
    let len = vec.len();
    let cap = vec.capacity();
    let ptr: *mut X::Assoc<'a> = vec.as_mut_ptr();
    let ptr = ptr.cast::<X::Assoc<'b>>();
    unsafe {
        Vec::from_raw_parts(ptr, len, cap)
    }
}

This code would be a lot nicer if into_raw_parts was stable because you have to take some subtle stuff into account in this code:

You have to not drop the original Vec, and ManuallyDrop is generally better than mem::forget.
If you call as_mut_ptr before the len/capacity calls, the &Vec created to call capacity could invalidate the mutable pointer.^[1]
^[2] There can’t be any edge cases where Assoc<'a> could have a different layout than Assoc<'b>. (which I don’t think is possible)
You have to call clear before you wrap in ManuallyDrop to not leak the vector’s buffer if an element’s Drop impl panics.

I saw this before on this forum somewhere, but I don’t remember where. See quinedot’s post below for more information on whether and when this pointer gets invalidated. ↩︎
also a problem if you had into_raw_parts ↩︎

alice · February 5, 2025, 6:49pm

No that's wrong. You need to call clear before you wrap it in ManuallyDrop. Otherwise you leak memory on panic.

quinedot · February 5, 2025, 7:08pm

Whether a mutable pointer is invalidated or not is defined at the memory model level (e.g. stacked borrows), and could cause problems. The library consideration is, as I understand it, does Vec have the same strong ownership qualities (noalias) as Box does?^[1] As far as I know that's an undecided question, and thus it would be unsound to rely on it not being true in the implementation today.

More discussion here and here, though they're sort of lengthy.

so far ↩︎

Coder-256 · February 5, 2025, 8:25pm

I see, as_mut_ptr()/from_raw_parts() definitely seems sounder than an actual transmute.

Anyway, I'm now realizing a simpler and strictly more general version would be:

pub fn transmute_vec<T, U>(mut vec: Vec<T>) -> Vec<U> {
    const { assert!(std::mem::size_of::<T>() == std::mem::size_of::<U>()) };
    const { assert!(std::mem::align_of::<T>() == std::mem::align_of::<U>()) };
    vec.clear();
    let mut vec = std::mem::ManuallyDrop::new(vec);
    let len = vec.len();
    let cap = vec.capacity();
    let ptr = vec.as_mut_ptr();
    unsafe { Vec::from_raw_parts(ptr.cast(), len, cap) }
}

Per the docs of Vec::from_raw_parts, really the core requirement is that the size and alignment of the allocation are correct. The actual type doesn't even matter. Not sure how I didn't think of this earlier.

Edit: technically it's unclear whether the pointer returned from as_mut_ptr() is valid to pass to from_raw_parts() if the vector didn't allocate, but I imagine it's probably fine.

cod10129 · February 5, 2025, 9:47pm

Fixed that.

EDIT: this part is wrong, see the discussion below about it
It turns out that in the source of Vec::clear, a panic in T::drop will cause the remainder of the elements to be leaked. However, the Vec will still free the buffer, making the change worthwhile.

When I said “problems” in the first version I really meant segfaults etc., but that didn’t really make sense since the rest of the sentence was about UB. I edited my message (mostly to make it defer to yours), and thanks for the description and the links.

Well, the pointer if there's no allocation is just the alignment transmuted into a pointer type (RawVecInner::new_in). That means that the pointer was definitely not allocated with the global allocator. The global allocator can definitely be changed to one that will never accept^[1] pointers in, say, the zero page and just never return them from allocating methods.

You can simply fix this by adding an assert!(vec.capacity() > 0), but take care to put it before you move the Vec into the ManuallyDrop.^[2]

Also, ZSTs could be a problem since Vec<ZST>'s capacity is set to usize::MAX and the pointer is dangling. assert!(size_of::<T>() != 0) should work here. Since the whole purpose of your function is to reuse the capacity of the vector, then ZSTs should never be used with it (since then the allocation would be zero-sized).

when passed to dealloc ↩︎
or else the panic from the assert would leak the Vec's memory ↩︎

alice · February 6, 2025, 6:59am

No, a panic will not leak the rest of the elements. It uses the destructor of slice, which continues to run drop on panic of an element.

197g · February 6, 2025, 11:13am

I find this point quite interesting. Can we provably assume this? I had sketched a crate for the remainder of the operation well before these kinds of associated types were possible. If such a safe trait is sound then it'd be easy to construct the token that the crate relies on for the Vec's clear-then-reassemble operation. The reuse of the storage only depends on type layout, that trait is the only question where lifetimes play a role specifically.

zirconium-n · February 6, 2025, 11:24am

You need lifetime specialized associated types to have different layout.

cod10129 · February 6, 2025, 2:21pm

EDIT: the point I’m trying to make here is wrong, see the discussion below

pub fn clear(&mut self) {
    let elems: *mut [T] = self.as_mut_slice();

    // SAFETY:
    // - `elems` comes directly from `as_mut_slice` and is therefore valid.
    // - Setting `self.len` before calling `drop_in_place` means that,
    //   if an element's `Drop` impl panics, the vector's `Drop` impl will
    //   do nothing (leaking the rest of the elements) instead of dropping
    //   some twice.
    unsafe {
        self.len = 0;
        ptr::drop_in_place(elems);
    }
}

This is the source code of Vec::clear. Vec::drop will still drop a slice, but one of zero length. clear() has to set the length to zero before dropping the elements to prevent them from being potentially dropped twice (which is what the second bullet of the SAFETY comment says).

alice · February 6, 2025, 8:47pm

The slice is created before the length is set to zero.

I recommend trying to write some code and see if you can get it to leak elements. I'll bet you can't.

jumpnbrownweasel · February 6, 2025, 9:15pm

Is the problem that the comment in clear is misleading? It is talking about dropping of the elements by the Vec Drop impl, but this makes no sense to me.

quinedot · February 6, 2025, 9:33pm

The behavior has changed at least once in the past.

github.com/rust-lang/rust

Make the semantics of Vec::truncate(N) consistent with slices.

master ← gnzlbg:simplify_truncate

opened 03:19PM - 13 Sep 19 UTC

gnzlbg

+12 -22

This commit simplifies the implementation of `Vec::truncate(N)` and makes its s…emantics identical to dropping the `[vec.len() - N..]` sub-slice tail of the vector, which is the same behavior as dropping a vector containing the same sub-slice. This changes two unspecified aspects of `Vec::truncate` behavior: * the drop order, from back-to-front to front-to-back, * the behavior of `Vec::truncate` on panics: if dropping one element of the tail panics, currently, `Vec::truncate` panics, but with this PR all other elements are still dropped, and if dropping a second element of the tail panics, with this PR, the program aborts. Programs can trivially observe both changes. For example ([playground](https://play.rust-lang.org/?version=stable&mode=debug&edition=2018&gist=7bef575b83b06e82b3e3529e4edbcac7)): ```rust fn main() { struct Bomb(usize); impl Drop for Bomb { fn drop(&mut self) { panic!(format!("{}", self.0)); } } let mut v = vec![Bomb(0), Bomb(1)]; std::panic::catch_unwind(std::panic::AssertUnwindSafe(|| { v.truncate(0); })); assert_eq!(v.len(), 1); std::mem::forget(v); } ``` panics printing `1` today and succeeds. With this change, it panics printing `0` first (due to the drop order change), and then aborts with a double-panic printing `1`, just like dropping the `[Bomb(0), Bomb(1)]` slice does, or dropping `vec![Bomb(0), Bomb(1)]` does. This needs to go through a crater run. r? @SimonSapin

(At the time of that PR, clear was a call to truncate(0).)

cod10129 · February 7, 2025, 12:18am

I tried (Playground) and it doesn't work, but now I don't know why. Let me walk through what I thought my playground would do:

vec.clear() is called
It creates a pointer *mut [T] to the buffer
vec.len is set to 0
The slice pointer is drop_in_placed, which starts by dropping the first element. It prints "Dropping Printer #1", and then panic!s.
That panic triggers an unwind, which unwinds up out of Printer::drop, [T]::drop, Vec::clear, and then main, causing the Vec to be dropped.
Since the len was previously set to zero, all Vec::drop does is free the underlying buffer of the Vec without dropping any elements.
main unwinds, exiting the program.

In this model, Printer(2) is never dropped. But in reality, it is dropped while unwinding and causes an abort. What did I not think of above? Please correct me.

(Since elems in Vec::clear is a raw pointer, it won't drop the slice. The backtrace reports Printer(2) being dropped in Vec::clear though, is there some strange magic in [T]::drop/drop_in_place?)

quinedot · February 7, 2025, 12:43am

That's not what happens.

vec.clear() is called
It creates a pointer *mut [T] to the buffer
vec.len is set to 0
The slice pointer is drop_in_placed, which starts by dropping the first element. It prints "Dropping Printer #1", and then panic!s.

(Differences start here)

Unwinding is caught
Attempt to drop the slice continues, dropping the second element. It prints "Dropping Printer #2", and then panic!s
A backtrace is printed and the process is aborted (not unwound) due to panicking a second time

I don't know where the implementation is, but see the description in the PR I linked above (which is basically the bullet points above). looks awhile.... Probably this is the code, which I think came from here.

Compare and contrast with only panicking once.

jumpnbrownweasel · February 7, 2025, 12:52am

Thanks for the explanation!

I wonder why it's considered safe to continue dropping after the first panic, since unwinding has started.

steffahn · February 7, 2025, 3:40am

Dropping things is the main thing that happens during unwind, where's the safety concern? The panic happened while dropping one of the slice element, this element isn't touched again^[1], but the logic continues in dropping the remaining elements whose destruction hadn't even been started yet.

The only possible "bad" effect is that another panic will abort then, but panicking destructors are discouraged anyway, and apparently the choice here is that it's more desirable to risk an abort than unnecessarily leaking values.

except for any panic handling that happens within its destructor, before the logic for slices/arrays is even gaining back control ↩︎

alice · February 7, 2025, 7:16am

It's the same everywhere. On panic, local variables continue to be dropped. On panic of one field in a struct, other fields continue to be dropped. On panic in a hash map, other elements continue to be dropped. Exceptions are very rare.

Topic		Replies	Views
How to cache a vector's capacity? help	60	2133	August 29, 2023
Is My Highly Unsafe Code Correct? In Place Mapping a Vector code review	23	1724	October 19, 2023
Unsafe Rust is... tough? help	11	2705	December 27, 2020
Rustnomincon `Drain` and double dropping help	14	276	March 14, 2025
Please help me let me keep my allocation help	33	1228	October 12, 2020

Safe to transmute lifetimes inside Vec?

Related topics