How can a loop create deadlock on some machines and not locally

sariug · April 21, 2022, 12:55pm

Hello,
I am new to Rust. Sorry for stupid question.
Ive been trying to run a multithreaded code, where the threads wait for some queue having some tasks, and executing that task if there are any.
I have a complex version of the following code:

use std::thread;
pub async fn do_something() {
    println!("Started!");
    loop { // <- If i comment this out, I see 4 Started
        thread::sleep(std::time::Duration::from_secs(1));
    }
}
#[tokio::main]
async fn main() {
    let mut workers = vec![];
    for _ in 0..4 {
        println!("Go!");
        workers.push(tokio::task::spawn(do_something()));
    }
    thread::sleep(std::time::Duration::from_secs(2));
    println!("Finished!");
}

Locally, it worked for every sort of PC. But it fails on places such as Azure pipeline.
I realized that instead of 4 "Started" I see 2 "Started". The playground then gives time-out while devops servers wait first for like 1 hour.
There exists also no problem if my number of threads is 1; therefore this issue happens just with >1 number of threads.

I am sort of lost looking at a really short code. Can anyone maybe see anything?

alice · April 21, 2022, 1:23pm

Check out this blog post:

tl;dr don't use thread::sleep

simonbuchan · April 21, 2022, 10:49pm

The answer to the literal question is that tokio will start a number of real threads based on the number of CPU cores, so you're probably seeing it run on a one or two core machine on Azure, with two allocated threads.

simonbuchan · April 21, 2022, 10:57pm

I do find it interesting that it doesn't hang with one thread though. That implies that tokio is coopting the main thread to run tasks after returning from main, killing the worker threads silently after some time, but gets starved if there's number of worker threads+1 blocking tasks. Tricky!

sariug · April 22, 2022, 6:00am

I also discovered this probably after 4-5 hours writing this. hope this will be useful.

It's really tricky that for the task, it hijacks the main thread!

system · July 21, 2022, 6:01am

This topic was automatically closed 90 days after the last reply. We invite you to open a new topic if you have further questions or comments.

Topic		Replies	Views
Async/await and multi-thread Tokio runtime help	10	195	April 18, 2024
Help needed to debug tokio thread/async model help	4	863	October 19, 2022
How to make tokio spawn execute asynchronously? help	8	852	October 6, 2023
Tokio timeout seems not working as the thread hangs up help	3	975	October 20, 2022
Rust threading guidelines help	5	1317	May 8, 2021

How can a loop create deadlock on some machines and not locally

Related Topics