Async: What Is Blocking? (2022)

Async: What is blocking? – Alice Ryhl

The async/await feature in Rust is implemented using a mechanism known as cooperative scheduling, and this has some important consequences for people who write asynchronous Rust code.

The intended audience of this blog post is new users of async Rust. I will be using the Tokio runtime for the examples, but the points raised here apply to any asynchronous runtime.

If you remember only one thing from this article, this should be it:

Async code should never spend a long time without reaching an .await.

Translations: chinese

Blocking vs. non-blocking code

The naive way to write an application that works on many things at the same time is to spawn a new thread for every task. If the number of tasks is small, this is a perfectly fine solution, but as the number of tasks becomes large, you will eventually run into problems due to the large number of threads. There are various solutions to this problem in different programming languages, but they all boil down to the same thing: very quickly swap out the currently running task on each thread, such that all of the tasks get an opportunity to run. In Rust, this swapping happens when you .await something.

When writing async Rust, the phrase “blocking the thread” means “preventing the runtime from swapping the current task”. This can be a major issue because it means that other tasks on the same runtime will stop running until the thread is no longer being blocked. To prevent this, we should write code that can be swapped quickly, which you do by never spending a long time away from an .await.

Let's take an example:

▶︎ use std::time::Duration;

#[tokio::main] async fn main() { println!("Hello World!");

// No .await here! std::thread::sleep(Duration::from_secs(5));

println!("Five seconds later...");

The above code looks correct, and if you run it, it will appear to work. But it has a fatal flaw: it is blocking the thread. In this case, there are no other tasks, so it's not a problem, but this wont be the case in real programs. To illustrate this point, consider the following example:

▶︎ use std::time::Duration;

async fn sleep_then_print(timer: i32) { println!("Start timer {}.", timer);

// No .await here! std::thread::sleep(Duration::from_secs(1));

println!("Timer {} done.", timer);

#[tokio::main] async fn main() { // The join! macro lets you run multiple things concurrently. tokio::join!( sleep_then_print(1), sleep_then_print(2), sleep_then_print(3), );

Start timer 1. Timer 1 done. Start timer 2. Timer 2 done. Start timer 3. Timer 3 done.

The example will take three seconds to run, and the timers will run one after the other with no concurrency whatsoever. The reason is simple: the Tokio runtime was not able to swap one task for another, because such a swap can only happen at an .await. Since there is no .await in sleep_then_print, no swapping can happen while it is running.

However if we instead use Tokio's sleep function, which uses an .await to sleep, the function will behave correctly:

▶︎ use tokio::time::Duration;

async fn sleep_then_print(timer: i32) { println!("Start timer {}.", timer);

tokio::time::sleep(Duration::from_secs(1)).await; // ^ execution can be paused here

println!("Timer {} done.", timer);

#[tokio::main] async fn main() { // The join! macro lets you run multiple things concurrently. tokio::join!( sleep_then_print(1), sleep_then_print(2), sleep_then_print(3), );

Start timer 1. Start timer 2. Start timer 3. Timer 1 done. Timer 2 done. Timer 3 done.

The code runs in just one second, and properly runs all three functions at the same time as desired.

Be aware that it is not always this obvious. By using tokio::join!, all three tasks are guaranteed to run on the same thread, but if you replace it with tokio::spawn and use a multi-threaded runtime, you will be able to run multiple blocking tasks until you run out of threads. The default Tokio runtime spawns one thread per CPU core, and you will typically have around 8 CPU cores. This is enough that you can miss the issue when testing locally, but sufficiently few that you will very quickly run out of threads when running the code for real.

To give a sense of scale of how much time is too much, a good rule of thumb is no more than 10 to 100 microseconds between each .await. That said, this depends on the kind of application you are writing.

What if I want to block?

Sometimes we just want to block the thread. This is completely normal. There are two common reasons for this:

Expensive CPU-bound computation.

Synchronous IO.

In both cases, we are dealing with an operation that prevents the task from reaching an .await for an extended period of time. To solve this issue, we must move the blocking operation to a thread outside of Tokio's thread pool. There are three variations on this:

Use the tokio::task::spawn_blocking function.

Use the rayon crate.

Spawn a dedicated thread with std::thread::spawn.

Let us go through each solution to...

Async: What Is Blocking? (2022)

Related Articles

(no title)

Is AI ruining our skills? Early results are in – and they're not good

The Anatomy of an AI-Native Org

ZCode – Harness for GLM-5.2

Apertus – Open Foundation Model for Sovereign AI