Catching panic through C code, how bad is it?

kornel · July 15, 2017, 11:05am

The documentation for catch_panic says:

It is currently undefined behavior to unwind from Rust code into foreign code

but unfortunately that's exactly what I wanted to do. libjpeg expects its error handler callback to never return, so I make it panic on error via something like this:

catch_panic(|| ffi::c_function(panicking_rust_callback));

It appears to work fine on x86_64. How big risk is it, if:

I'm mixing with only plain C, not C++
setjmp is not used
catch_panic + panic are always called on the same thread, same stack (there are no async callbacks).

?

edit: extern "C-unwind" makes it possible

Dushistov · July 15, 2017, 11:26am

Side note. May be use another thread and/or global variable to catch such errors?

kornel · July 15, 2017, 12:15pm

I already do, but this is not a solution. Design of libjpeg forces error handlers to either unwind the stack or abort the program. I don't want to abort the program, so stack unwinding is the only other option I have. It's because libjpeg has lots of code like:

if (!foo) {
   error_callback(); 
}
foo[0]=1;

so if my callback ever returns, the program will crash or do unsafe things.

derekdreery · July 15, 2017, 12:58pm

Sounds like it's time for a new jpeg library.

kornel · July 15, 2017, 1:35pm

That is an unhelpful answer.

I am in fact designing a new Rust JPEG library, but to get something working first I want to leverage a few of libjpeg-turbo functions, since Rust doesn't have SIMD support yet.

Still, none of that is relevant to the question of what are the consequences of unwinding through C stack frames in a controlled situation.

Michael-F-Bryan · July 15, 2017, 2:22pm

It's a really unsatisfying answer, but your best bet will probably be to print a really descriptive error message then abort. I'm pretty sure libbacktrace can also be used to generate backtraces for any ELF executable, so that may be useful during development.

The big issue is that unwinding across the FFI boundary is UB and even if it appears to work on your machine, you're betting that the behaviour will always be the same. It's kinda like how you can return a pointer to a local variable in C and still read the variable just fine even though that stack frame has been popped. Sometimes it works, but other times you'll get garbage or leave your application in an indeterminate state (or demons could fly out of your nose).

kornel · July 15, 2017, 2:38pm

I don't doubt it's UB in general, e.g. if the FFI language has non-standard stack. And I get it would be really bad for a foreign language with destructors (maybe even if the stack goes in and out of FFI multiple times).

But what can go wrong in case of C, when the stack is Rust -> C -> Rust and C is compiled with about the same version of LLVM?

Michael-F-Bryan · July 15, 2017, 4:44pm

I'm not actually sure to be honest. I can't say I've ever tried something like that, mainly because in all the FFI resources out there say it's really bad (even across the C/C++ boundary with exception safety), and I once segfaulted when a Rust function I was calling from Python panicked.

You might want to write a dummy Rust program which calls some C which calls into Rust again and then step through the unwinding with gdb to see what actually happens. Also, is the stack layout for Rust and C guaranteed to be the same on x86_64?

Sorry I can't help you much more than that! Someone on the compiler team (like @nikomatsakis or @nrc) might be more knowledgeable than me though. Keep us informed on how your experimenting goes, I'm really curious to find out what actually happens when you unwind across the FFI boundary!

EDIT: I got curious and found an issue on the Julia language repo about unwinding from C++ into Julia. It may help answer some of your questions.

stebalien · July 15, 2017, 5:35pm

This is probably because libjpeg may now be in an inconsistent state and can't continue. So, if you do do this, you had better unload and reload libjpeg (as a dynamic library).

For the actual "unwinding", you can implement this yourself with a setjmp and longjmp in C.

jimuazu · July 15, 2017, 5:39pm

Yes, I was going to say the same -- write the error handler in C, so it is a stack of Rust->C->C->C unwinding only within the C part.

vadimcn · July 15, 2017, 5:57pm

On some targets it will work fine; on others your program will abort because Rust unwinder will not be able to locate stack unwind info for C code. As an example, I'd expect that x86_64-pc-windows-msvc is going to be among the former, while i686-pc-windows-gnu will be one of the latters.

I think that the safest option in your case (for some definition of "safe") might be C's own setjmp/longjmp. Preferably without crossing the language boundary (i.e. create C wrappers around libjpeg functions that do the setjmp and return an error code to Rust caller), though calling setjmp from Rust code might work too. Of course, this will leak all sorts of resources, but that's a given for this sort of stuff.

fweimer · July 16, 2017, 6:47pm

It depends on the distribution. Some compile all their C code with support for unwinding, others do not. It might be safer to use a small C wrapper with setjmp and longjmp, but the overhead might be prohibitive.

Another alternative would involve bunding libjpeg and recompile it with -fexceptions. If libjpeg indeed assumes that the error handler callback never returns, it should be able to cope with the stack unwinding without resource leaks.

steveklabnik · July 17, 2017, 4:34pm

Because we make no guarantee at all here. UB is UB, even if "it works" in some circumstances. Doing this is inherently playing with fire.

Dushistov · July 17, 2017, 8:12pm

Indeed, great idea. Why not compile C code of libjpeg-turbo with C++ compiler, and throw exception inside callback, and then catch it in c++ wrapper of corresponding function and return error code to rust?

kornel · July 17, 2017, 8:27pm

Oh, that's an interesting hack. Thank you for the suggstion.

notriddle · July 17, 2017, 11:15pm

Can you afford to call exit(1)?

cuviper · July 17, 2017, 11:25pm

Beware that if you catch an exception or panic, it may not be safe to call into that FFI library anymore. It usually takes some effort to maintain exception safety, which is why Rust has UnwindSafe.

kornel · July 17, 2017, 11:34pm

libjpeg is widely used with setjmp, and uses its own memory pool, so unwinding it is not a problem.

Topic		Replies	Views
What is the actual current (1.68ish) behavior of unwinding into C/C++? help	6	892	June 21, 2023
Passing callbacks to C: panic! help	26	1924	August 29, 2023
C-unwind and safety help	12	516	December 2, 2025
Handling panic in rust lib	6	1307	April 26, 2023
Is jumping over Rust stack frames UB in a context of FFI? help	4	966	January 25, 2020

Catching panic through C code, how bad is it?

Related topics