Generic function using variable of unknown size

Blastula · January 15, 2019, 4:06am

For fun I'm trying to figure out how to write a generic function that takes a std::fs::File and reads size <T> from the file into <T>. It feels a little strange to me that when trying to create a [u8] to fill that the Rust compiler complains that it doesn't know statically the size. It seems to me that it should know at compilation based on <T> though. I know this is easily possible. I'm just not that familiar with the language.

So I'm asking for help in understanding how I help the compiler understand. The code I have so far is

pub fn write_to_type<T: Sized>(f: &mut std::fs::File) -> T {
    let mut bytes = [0u8; std::mem::size_of::<T>];
    f.read_exact(&mut bytes).unwrap();
    unsafe { std::mem::transmute::<[u8; 2], T>(bytes) }
}

The error I'm getting is

unsized locals are gated as an unstable feature [E0277]

all local variables must have a statically known size [E0277]

to learn more, visit <https://doc.rust-lang.org/book/second-edition/ch19-04-advanced-types.html#dynamically-sized-types-and-the-sized-trait> [E0277]

the trait `std::marker::Sized` is not implemented for `[u8]` [E0277]

the size for values of type `[u8]` cannot be known at compilation time (doesn't have a size known at compile-time) [E0277]

Thoughts?

RustyYato · January 15, 2019, 4:14am

You can't use generic parameters in const expressions right now. That will have to wait until we get const-generics.

github.com/rust-lang/rust

Tracking issue for const generics (RFC 2000)

opened 11:17PM - 14 Sep 17 UTC

closed 03:26PM - 21 Mar 22 UTC

withoutboats

A-typesystem B-RFC-approved T-lang T-compiler A-const-fn C-tracking-issue A-const-generics requires-nightly F-const_generics S-tracking-impl-incomplete

Tracking issue for rust-lang/rfcs#2000 Updates: - 2 May 2019: https://github….com/rust-lang/rust/issues/44580#issuecomment-488819344 - 19 Oct 2019: https://github.com/rust-lang/rust/issues/44580#issuecomment-544155666 - 2 Jan 2020: https://github.com/rust-lang/rust/issues/44580#issuecomment-570191702 - 22 Jul 2020: https://github.com/rust-lang/rust/issues/44580#issuecomment-662543117 - 17 Nov 2020: https://github.com/rust-lang/rust/issues/44580#issuecomment-728913127 - 11 Dez 2021: https://github.com/rust-lang/rust/issues/44580#issuecomment-991782799 If you want to help out, take a look at the [open const generics issues](https://github.com/rust-lang/rust/labels/A-const-generics) and feel free to ping @varkor, @eddyb, @yodaldevoid, @oli-obk or @lcnr for help in getting started! --- Blocking stabilization: - [ ] Design: - [x] Resolving ordering of const and type parameters, with default parameters - [ ] Decide what the best UX / implementation cost balance is for unifying abstract const expressions. - [ ] How we determine well formedness of const expressions. - [x] Implementation - [ ] Documentation - [ ] rustc guide --- Remaining implementation issues: - [ ] Resolve various `FIXME(const_generics)` comments. - [ ] Resolve concerns with canonicalisation / lazy normalisation. - [ ] Investigate handling of const parameters in patterns. - [ ] Add more tests. - [ ] Implement defaults for const parameters (`FIXME(const_generics_defaults)`). - [ ] Fix other [A-const-generics issues](https://github.com/rust-lang/rust/labels/A-const-generics). - [ ] Audit uses of `has_infer_types`. - [x] Forbid complex expressions for const arguments involving parameters (for now), e.g. `{X * 2}`. - [ ] Audit diagnostics (e.g. https://github.com/rust-lang/rust/pull/76401#discussion_r484819320).

On another note, your function is unsafe, so it should be marked it as such.

DanielKeep · January 15, 2019, 5:32am

This isn't just unsafe, it's catastrophically unsafe. You could use this to do write_to_type::<Box<i32>>(&some_file) to construct an owned pointer to an arbitrary memory location. Or write_to_type::<fn()>(&some_file) to execute arbitrary memory.

Frankly, this function probably shouldn't exist. Even marked as unsafe, it'd be way too easy to misuse. If you're going to do something like this, I would strongly recommend using an unsafe trait to constrain exactly which types you can use this with:

pub unsafe trait FromBytes {}
unsafe impl FromBytes for u32 {}

pub fn write_to_type<T: FromBytes>(f: &mut std::fs::File) -> T {
    // ...
}

This way, you can individually enable the trait for the small set of types for which it is actually safe to use... which is probably limited to the built-in integer and float types, and composites of those which do not contain any padding.

But, really, you should probably just use serde plus bincode (for automated [de]serialisation) or byteorder (for manual [de]serialisation).

Blastula · January 15, 2019, 5:42am

Thanks for the reply to both KrishnaSannasi and DanielKeep. I appreciate your concern in the safety of code like this. Like I mentioned above I'm just fiddling around, really just trying to learn the language and get a grasp of things. I am aware of byteorder but really am just figuring out ways of messing with memory.

I would plan on constraining this type of code to very specific types.

I am certainly still open to more thoughts and opinions.

Blastula · January 17, 2019, 4:36am

Sorry to bring this up again, but I'm still confused. Fiddling around a bit more I have the following code, and complaint from the compiler. How is it possible a primitive type can have a varying size?

RustyYato · January 17, 2019, 5:00am

You used u32 as a type parameter. This shadows the u32 type, so whenever it sees u32 in the function, it thinks that it is a generic parameter.

I think you meant to write

pub trait FromBytes {
    fn write_to_type(f: &mut std::fs::File) -> Self;
}

impl FromBytes for u32 {
    fn write_to_type(f: &mut std::fs::File) -> u32 {
        let mut bytes = [0u8; 4];
        let bytes_read = f.read_exact(&mut bytes);
        unsafe { std::mem::transmute::<[u8; 4], u32>(bytes) }
    }
}

Note that I got rid of the generic type and used the Self alias. Self is a way to talk about the implementing type when writing a trait.

next time, please post code inside a formatted block, like so

```rust
// your code here
```

Blastula · January 17, 2019, 5:08am

I see that. Makes sense. Thanks for taking a few moments to answer my question. You have been great Krishna.

Blastula · January 17, 2019, 11:19pm

Funny enough. Announcing Rust 1.32.0 just gave me what I was trying to do painfully for free. LOL at me.

Topic		Replies	Views
Bug with `Sized` with generic type? help	3	368	July 12, 2020
Incorrect error msg regarding const fn generic usage	6	535	January 12, 2023
Having trouble defining a generic type function	6	370	January 1, 2023
Converting from generic unsized parameter to trait object help	3	487	May 29, 2022
`T` cannot be known at compilation time with `Sized` bound help	3	1109	June 19, 2020

Generic function using variable of unknown size

Related Topics