(Why) is this safe rust?

beddy · July 8, 2015, 8:45am

Hi,

I started to use the ffi function of rust and got confused. This function is callable by c and works as expected. My question is: What is the lifetime of the return value?

extern crate libc;
use std::ffi::CString;

#[no_mangle]
pub extern fn call_me_from_c() ->* const libc::c_char
{
   let thecstring = CString::new("Hello, world!").unwrap();
   thecstring.as_ptr()
}

It is totally clear that C does not know of rusts lifetimes so I assume I need to free the value from C. But If this is so why is this code safe? I thought I need to mark code unsafe if I avoid rusts lifetime system.

gkoz · July 8, 2015, 9:42am

The lifetime of the pointer is the function body because thecstring gets destroyed at the end of it, so the function returns an invalid pointer. This is similar to the following which the compiler knows is wrong:

fn foo() -> &str {
    let s = String::new();
    let r: &str = &*s;
    r
}

It was argued that as_ptr functions don't need to be marked unsafe because you can't exploit any unsafety of the raw pointer in safe code. Your case seems like a counterexample to that.

Just like the example above can be fixed by making it return an owned String, you can get an owned pointer with unstable CString::into_ptr. That pointer needs to be returned back to Rust after it's not needed and freed by converting it back into CString with from_ptr.

BurntSushi · July 8, 2015, 11:33am

Can you explain a bit more? The code in the OP seems perfectly safe. A consumer of that function gets a raw pointer, and the only way to dereference that raw pointer is by using unsafe. But the mere presence of an invalid pointer isn't in and of itself unsafe.

BurntSushi · July 8, 2015, 11:49am

That is, IMO, a good intuition. But to answer your question, we need to be a bit more precise. I think the thing you're looking for are the "unsafe superpowers". Namely, unsafe permits you to:

Access or update a static mutable variable.

Dereference a raw pointer.

Call unsafe functions. This is the most powerful ability.

Notice that creating a raw pointer is not one of unsafe's powers. In fact, you can do it in safe code just like your example shows. It is only the dereferencing of a raw pointer that is unsafe. If your program that includes your example function never invokes unsafe, then it must also never dereference the pointer returned by call_me_from_c (assuming you don't pass it to some other library that does). Therefore, even if the returned pointer is dangling, you never actually observe unsafe behavior!

tomaka · July 8, 2015, 11:54am

C code calling Rust code can compared to unsafe Rust code calling safe Rust code.
Just because the safe Rust code is used by unsafe code doesn't mean that it must be unsafe itself.

In your example, the call_me_from_c function does absolutely nothing dangerous.

beddy · July 8, 2015, 1:09pm

ok,

I think I understand this.
Basically:

creating a raw pointer is not unsafe
dereferencing a raw pointer is unsafe
my function just creates an unsafe pointer but does not dereference it

-> But if I call this function from C and dereference the pointer this will result in undefined behaviour (?)

If I want to return a valid pointer I would use forget?

#[no_mangle]
pub extern fn call_me_from_c() ->* const libc::c_char {
    let thecstring = CString::new("Hello, world!").unwrap().as_ptr();
    forget(thecstring);
    thecstring
}

gkoz · July 8, 2015, 1:19pm

[quote="BurntSushi, post:3, topic:2042"]
Can you explain a bit more? The code in the OP seems perfectly safe. A consumer of that function gets a raw pointer, and the only way to dereference that raw pointer is by using unsafe. But the mere presence of an invalid pointer isn't in and of itself unsafe.
[/quote]Since the consumer of that extern fn is most likely not Rust (and Rust couldn't call it without unsafe), I guess we could just say: the code on the other side is unsafe by definition so anything goes. But the ability to pass a dangling pointer across the language (library) boundary so easily without a single unsafe seems too foot-gunny to me.

bluss · July 8, 2015, 1:23pm

Dereferencing the raw pointer is memory unsafe. Calling the function is not. The code that is dereferencing the pointer is at fault, which sounds like it happens in your C program. It would not be allowed in safe Rust.

The correct way may be to use CString::into_ptr, which is unfortunately quite new. I say maybe because how to handle resource ownership across ffi boundaries like this requires some careful choices, not sure what's the best.

gkoz · July 8, 2015, 1:24pm

You need into_ptr:

#[no_mangle]
pub extern fn call_me_from_c() -> *const libc::c_char {
    let thecstring = CString::new("Hello, world!").unwrap();
    thecstring.into_ptr()
}

But either way this is a memory leak unless you take care of cleaning up somehow.

However as @bluss said, in some circumstances you may actually not want to return an owned pointer, depending on the actual needs and choices.

gkoz · July 8, 2015, 2:17pm

[quote="bluss, post:8, topic:2042"]
The code that is dereferencing the pointer is at fault
[/quote]I find it difficult to accept this unless call_me_from_c is expected to return an invalid pointer.

bluss · July 8, 2015, 2:27pm

It is how Rust works internally. We can create arbitrary raw pointers in safe Rust but only use raw pointers (offset or deref, or pass to ffi) by using unsafe blocks. We've simply decided that raw pointers have no guarantee of being vaild pointers.

In Rust we have a ways to express that a pointer is always valid: &T, &mut T, Box<T>, Rc<T> and so on. Unfortunately we can't use these across the ffi boundary. We also can't enforce Rust's rules in programs that are not written in Rust..

oli_obk · July 8, 2015, 2:50pm

A raw pointer is just a strong typedef of a usize. There are no guarantees about the value of a number. If the function's doc says it returns a number that's actually the address of a valid object, then it's up to the implementor of the function to make it so. The implementor could just as well return 42 as *const libc::c_char, but he'd be breaking his own contract. As far as Rust is concerned, the function's contract says it returns a number between 0 and 2^32 or 2^64 depending on your platform.

Topic		Replies	Views
You should stop telling people that safe rust is always safe help	9	2932	June 6, 2016
Missing lifetime error? help	6	179	August 25, 2024
C function parameters, pointers and unsafe	2	832	August 8, 2017
Converting *const c_char to &str help	6	14090	February 21, 2018
Lifetimed raw pointers? help	42	2496	April 11, 2022

(Why) is this safe rust?

Related topics