Idiomatic way for waiting for multiple children to exit on Linux?

mjguzik · April 30, 2024, 10:48am

Consider a program like xargs which may end up with multiple children at the same time and spawn more as time passes (see the -P option).

For educational purposes I'm trying to write an equivalent in Rust, but I got stumped here.

Googling around I found other toy implementations in Rust, all of which either don't support multiple processes or resort to multithreading to do it, which I consider a no-go for this particular problem.

If this was C I would just vfork to create children. The main loop would read(2), but also check for a flag from the SIGCHLD signal handler, indicating waitpid is needed. There is basically nothing to it.

In Rust I found one is expected to use std::process::Command (which btw uses fork instead of vfork which is kind of a loss). Docs only show how to explicitly deal with that specific child with .wait, there is no explanation that I can see how to handle more processes.

I found a signal-hook crate, so I can find out I got SIGCHLD. But if I waitpid based on it and reap the child, I'm going to do it from under the respective Child object. If I don't waitpid I don't know what PID it is so I don't know which object to .wait on anyway. (I could waitpid with WNOWAIT as a hack, but there is no way this is how it should be done)

Rust code which I did find cops out of the problem by having a dedicated thread for each child, which is a waste of resources and can actively hurt real-world usage if one was to implement things like that in the real xargs.

So what's the expected way to structure this in Rust?

KillTheMule · April 30, 2024, 10:51am

Are you maybe looking for spawn?

mjguzik · April 30, 2024, 10:55am

I am using using .spawn, and as described in my question I don't see how to handle multiple children at the same time.

KillTheMule · April 30, 2024, 11:01am

Ok, so I might have misunderstood I might also be out of my depth here, but let me just ask: You can spawn multiple processes, so what do you want to do with them that you can't see how to do? Do you want to check their status, their PID or... ?

mjguzik · April 30, 2024, 11:06am

See xargs -P.

The program would have an event loop waiting to parse more arguments coming in from stdin along with spawning and reaping children as needed. For example with, say, -P 20 I might have 16 children, parse some extra input from stdin and find that some of them exited. How do I even find out which Child objects are affected?

KillTheMule · April 30, 2024, 11:15am

Spawn gives you a Child object, how about sticking those in a Vec<Child>, and then loop over that, call try_wait on them and remove if necessary?

farnz · April 30, 2024, 11:16am

This is almost certainly sub-optimal, but will work without dragging in a runtime like Tokio.

You could keep a BTreeMap<u32, Child> to track your children, where you get the u32 from Child::id(). Then, when you get SIGCHLD, you can use BTreeMap::remove() to remove the referenced Child from the map, and call Child::try_wait; if it's not finished, put it back in the BTreeMap for a later iteration to find.

mjguzik · April 30, 2024, 11:26am

That would be a waitpid call for every process, every time. That's even slower than the hack I described where I waitpid(WNOWAIT) and then I know which child explicitly to wait on.

mjguzik · April 30, 2024, 11:27am

As I noted earlier, as is I don't know know which PID exited, but I could hack my way around that with waitpid(WNOWAIT).

However, I am not looking for hacky ways out, I am asking how to sort this out in a clean manner. Ultimately this is a rather basic problem.

2e71828 · April 30, 2024, 11:55am

I've not worked in this space at all, but it seems like you can get a select/epoll compatible file descriptor that will become "readable" when the process exits.

As this is an experimental feature I'm unfamiliar with, it will require nightly Rust, and I can't vouch for its completeness in any way.

farnz · April 30, 2024, 12:02pm

This is not a hacky solution - it's tracking the Child objects by PID (which may be sub-optimal), and using the knowledge you have (from siginfo_t::si_pid() or from waitpid(WNOWAIT), both work for this) to wait only on the PIDs that you expect to see information about.

If you want a solution that doesn't require looping over all the Child objects, and doesn't require you to track the Child objects by PID, then you will need to go "underneath" the Child abstraction, and do things the way you'd do them in C - drop the Child objects once the process is started (which leaves the process running in the background), and wait on the PIDs when you're notified that there's a reason to wait.

mjguzik · April 30, 2024, 12:10pm

Thanks for the hint.

Indeed having a fd for every child and plugging that into an event loop would do the trick very cleanly, it is a bummer the feature is in Nightly only for the moment. With the assumption this is going to get fully beaten into shape and available in regular builds going forward, I would say this is the way to go.

mjguzik · April 30, 2024, 12:12pm

Well the waitpid(WNOWAIT) thing is just another syscall trip which should not be necessary. I don't see how to extract the info from the signal on Rust, the one crate I found which does not spawn threads behind my back only allows to set a flag.

Anyhow, The Right Way(tm) (anyhow as far as I'm concerned) was linked by 2e71828 above.

farnz · April 30, 2024, 12:34pm

Using the signal-hook crate, you'd use an Exfiltrator that gets the origin of the signal. Then you have the PID, and you can look it up in your data structure of choice, or just wait for the child directly if you've already dropped the Child.

kpreid · April 30, 2024, 2:54pm

You could use Tokio, which does its own syscalls and so doesn't have to wait for std stabilization of anything. (I checked and it seems that its implementation uses pidfd on Linux and polling all children on other Unix, which seems reasonable.)

quinedot · April 30, 2024, 6:11pm

You could presumably use mio with the file descriptor approach (but this is also an idea I haven't implemented myself and thus can't vouch for).

mjguzik · May 1, 2024, 2:14am

I was not aware of the feature.

Poking around I don't think it does the job though. To my reading there is only space for one siginfo per signal number, meaning if I get 2 SIGCHLDs before I manage to read it, I'm going to lose one of them.

mjguzik · May 1, 2024, 2:21am

I would argue there should be an easy way do stuff without pulling in any big crates, that aside tokio's own description of their handling of the matter basically discourages it. They make a fishy claim that pid fd's are not pollable, I'm going to have to look into it.

greppety-grep and I found that poll is supported for pidfd, so looks like their commentary is just stale(?). Interested parties can find the implementation here: linux/fs/pidfs.c at master · torvalds/linux · GitHub

I'm going to try to use it later.

kpreid · May 1, 2024, 3:26am

I think the comment is stale in that it describes only the non-Linux implementation which doesn't use pid fds, not the Linux-only implementation which does.

elichai2 · May 6, 2024, 4:35am

I actually implemented exactly this in a small tool I wrote for personal use
It seems to work good but I can't promise it doesn't have UB or that the windows impl works correctly

Feel free to read the code here, if I remember correctly I created a group and added every sub process to it and then waitpid on the gid.

github.com

elichai/code-clean/blob/main/src/main.rs#L340


      
          
          impl RegisterChild for Command {
              #[inline(always)]
              fn register_child(&mut self) -> &mut Self {
                  self.process_group(get_pgid())
              }
          }
          
          /// Returns the exit status and the index of the child process that exited.
          #[inline(always)]
          pub(super) fn wait_on_children(processes: &[ChildProcess]) -> Result<(ExitStatus, usize)> {
              let mut status: c_int = 0;
              let pid = match unsafe { waitpid(-get_pgid(), &mut status, 0) } {
                  -1 => return Err(std::io::Error::last_os_error()),
                  pid if pid.is_positive() => pid,
                  _ => abort(),
              };
              let pid_u32 = u32::try_from(pid).expect("pid should fit in u32");
              let index = processes.iter().position(|p| p.child.id() == pid_u32).unwrap();
              Ok((ExitStatus::from_raw(status), index))
          }

Topic		Replies	Views
Is it possible to use process::Command and not wait? help	13	6228	January 12, 2023
Send `SIGINT` to `Child` on Unix help	5	2461	July 14, 2022
Announcing ClonableChild to make it possible to kill child processes while waiting for them announcements	7	1270	January 12, 2023
Prevent program from exiting on child SIGINT help	13	2308	January 12, 2023
Stop child process without using `kill`	8	3374	October 14, 2019

Idiomatic way for waiting for multiple children to exit on Linux?

Related topics