Removing the libc calls

anon80458984 · July 14, 2021, 12:04am

pub struct Jit64_Memory {
    addr: *mut u8,
    size: usize,
    offset: usize,}

impl Jit64_Memory {
    pub fn new(num_pages: usize) -> Jit64_Memory {
        let size: usize = num_pages * G.page_size;
        let addr: *mut u8;
        unsafe {
            let mut raw_addr: *mut libc::c_void = std::mem::uninitialized();
            libc::posix_memalign(&mut raw_addr, G.page_size, size); // allocate aligned to page size
            libc::mprotect(raw_addr, size, libc::PROT_READ | libc::PROT_WRITE); // read write
            libc::memset(raw_addr, 0xc3, size); // return addr everywhere
            addr = std::mem::transmute(raw_addr);}
        Jit64_Memory { addr, size, offset: 0}}}

G.page_size is 4096. Context: we are allocating some pages where we put raw x86_64 machine code then later make the page executable and run it.

Question: is there a way to do this without the libc dependency? I am on Linux x86_64 and do not care about other platforms.

jschievink · July 14, 2021, 12:20am

Sure, you can just inline the PROT_ constants and function definitions from the libc crate.

Also note that your usage of mem::uninitialized() is UB, you might want to use MaybeUninit instead.

anon80458984 · July 14, 2021, 12:49am

I'm not familiar with this. Is what you are suggesting equivalent to: (1) libc has a bunch of extern "C" defs, (2) copy over the ones you are using ?

bjorn3 · July 14, 2021, 6:22am

posix_memalign can be replaced with std::alloc::Global.alloc. memset can use slice::fill I think.

Yes. Note that even on Linux the glibc and musl libc may use different definitions for the functions. You should also probably add an compile_error!() if the target doesn't match what you expect.

anon80458984 · July 14, 2021, 4:11pm

Does this fix the UB?

pub struct Jit64_Memory {
    addr: *mut u8,
    size: usize,
    offset: usize,}

impl Jit64_Memory {
    pub fn new(num_pages: usize) -> Jit64_Memory {
        let size: usize = num_pages * G.page_size;
        let addr: *mut u8;
        unsafe {
            let mut raw_addr: MaybeUninit<*mut libc::c_void> = std::mem::uninitialized();
            libc::posix_memalign(raw_addr.as_mut_ptr(), G.page_size, size); // allocate aligned to page size
            libc::mprotect(raw_addr.assume_init(), size, libc::PROT_READ | libc::PROT_WRITE); // read write
            libc::memset(raw_addr.assume_init(), 0xc3, size); // return addr everywhere
            addr = std::mem::transmute(raw_addr);}
        Jit64_Memory { addr, size, offset: 0}}

(not sure if the usage of assume_init and as_mut_ptr are correct)

Given that libc appears to just be wrappers, is there any real benefit for me to remove libc as a dependency (which seems to just duplicate the work of writing bindings and be error prone).

slamb · July 14, 2021, 4:26pm

Rust's allocators don't expose mprotect, though, and a JIT needs that.

I was about to ask you that! I don't think libc is costing you much, just a little bit of build time. And if your program as a whole uses many libraries, it's quite likely one of them will pull in libc anyway. So unless you have strong evidence to the contrary, I'd say it's not worth avoiding libc.

arnaudgolfouse · July 14, 2021, 4:56pm

Nope: from what I remember, std::mem::uninitialized is always insta-UB, no matter the context.

If you wanted to use MaybeUninit, you should write

let mut raw_addr: MaybeUninit<*mut libc::c_void> = MaybeUninit::uninit();
libc::posix_memalign(raw_addr.as_mut_ptr(), G.page_size, size);
// Also needs error handling for `posix_memalign`, etc...
let raw_addr = raw_addr.assume_init();

However, reading the documentation of posix_memalign, I think the initial value of raw_addr does not matter, so a simple

let mut raw_addr: *mut libc::c_void = std::ptr::null_mut();
libc::posix_memalign(&mut raw_addr, ...);

should be good ?

bjorn3 · July 14, 2021, 5:05pm

As far as I know mem::uninitialized() is fine when undef is a valid value for the type. MaybeUninit is one of the few types for which undef is a valid value.

arnaudgolfouse · July 14, 2021, 5:06pm

Ohhh, sweet

cole-miller · July 14, 2021, 5:41pm

std::mem::uninitialized is still deprecated though, so even in this case where it'd be sound I'd prefer MaybeUninit::uninit.

anon80458984 · July 14, 2021, 6:14pm

Thanks for everyone's explanations. Looks like "UB awareness" is learned one scar/skeleton at a time.

chrefr · July 14, 2021, 7:47pm

It is the only one.

chrefr · July 14, 2021, 7:53pm

While it is valid, I don't see why you need it. MaybeUninit is used to not perform an expensive but redundant initialization code. Here, you can just initialize the pointer with null.

Also, the usual pattern is to initialize the pointer, then shadow the MaybeUninit with the .assume_init()ed value, so you don't need multiple .assume_init(). For example:

        let addr = unsafe {
            let mut raw_addr = MaybeUninit::uninit();
            libc::posix_memalign(raw_addr.as_mut_ptr(), G.page_size, size); // allocate aligned to page size
            let raw_addr = raw_addr.assume_init();
            libc::mprotect(raw_addr, size, libc::PROT_READ | libc::PROT_WRITE); // read write
            libc::memset(raw_addr, 0xc3, size); // return addr everywhere
            addr as *mut u8
        }

anon80458984 · July 14, 2021, 7:58pm

I agree that in this particular case, we can just use std::ptr::null_mut();. I have been playing around with MaybeUninit as I'm trying to understand it's power/limitations in full.

programmerjake · July 15, 2021, 3:16am

That's not entirely correct, there are other types where undef is a valid value:

union MyType { // basically how `MaybeUninit` is actually defined
    a: (),
    b: &'static str,
}

MaybeUninit::<MyType>::uninit().assume_init(); // valid since MyType::a doesn't need any valid bytes
MaybeUninit::<[MaybeUninit<u8>; 5]>::uninit().assume_init(); // valid since every byte is wrapped in `MaybeUninit`
MaybeUninit::<()>::uninit().assume_init(); // valid since zero-size types have no bytes that could be uninit
// Important exception:
// MaybeUninit::<!>::uninit().assume_init(); // not valid even though `!` is a zero-sized type, it is uninhabited -- it's not valid to ever create a value of an uninhabited type through any means.

system · October 13, 2021, 3:17am

This topic was automatically closed 90 days after the last reply. We invite you to open a new topic if you have further questions or comments.

Topic		Replies	Views
Help understanding libc call help	11	1606	January 12, 2023
Is libc::memcpy of uninitialized bytes UB?	19	1292	May 15, 2020
Posix wrapper for profiling (getrusage) help	6	749	January 12, 2023
Need some help struggling with unsafe Rust help	9	481	January 12, 2023
[Solved] Getting available disk space with libc help	5	1833	January 12, 2023

Removing the libc calls

Related Topics