Peakload Benchmarks for Operating Systems

Peakload benchmarks for operating systems - Showcase - LinuxCommunity.io

Peakload benchmarks for operating systems

Showcase

hardware,<br>virtualization

Dima

June 21, 2026, 11:44am

Peak-load benchmarks for operating systems: Linux runs over 5,000× slower than our virtual machines

We applied place-transition nets (PTNs) defined by System V semaphores (https://doi.org/10.1080/17445760.2026.2615010) to benchmark Linux (Ubuntu 24.04.4 LTS, kernel 6.17.0-35).

Using PTNs for matrix multiplication and arrays of concurrent multiplications, we compared Linux kernel performance with that of our virtual machines (https://doi.org/10.1080/17445760.2025.2490148).

A PTN executing 1,024 parallel multiplications of 6-bit data completed in 0.912 seconds on our VM, compared with 5,673.597 seconds on Linux running on the same hardware (AMD Ryzen 7 6800H @ 4.8 GHz, 32 GB RAM). The application contains 9,216 semaphores (places) and 8,192 processes (transitions) .

The Linux execution time is more than 5,000 times slower than that of our VM. We believe this gap cannot be explained solely by system-call and context-switching overhead. Instead, it points to the efficiency of the System V semaphore implementation in the Linux sem.c kernel module.

We are interested in collaborating on projects aimed at implementing semaphores with wait-for-all semantics in Linux, both at the kernel level for processes and as a runtime mechanism for fast, futex-like thread synchronization. While futex_waitv provides wait-for-any semantics, wait-for-all semantics could help eliminate many deadlocks caused by sequential resource acquisition.

For modeling, we use Tina (The TINA toolbox Home Page - TIme petri Net Analyzer - by LAAS/CNRS) as an IDE and generate large PTN models with our own toolchains. Models are exported through our NDRtoALL plugin as .h files for the PVZ machine, then recompiled and executed as Linux applications.

Our basic tools are available on GitHub:

GitHub - dimazaitsev/SNCtools

Contribute to dimazaitsev/SNCtools development by creating an account on GitHub.

bb69ecfb-dc30-44a9-a6cd-e213a97e2ffa1448×1086 572 KB

2 Likes

hydn<br>(Hayden James)

June 21, 2026, 11:45am

@Dima Welcome to our forums. Thanks for sharing.

2 Likes

ericmarceau<br>(Eric Marceau)

June 21, 2026, 4:56pm

Is your project strictly focused on the “scheduling” part of the Linux kernel?

Or are you also looking the actual “net” benefits, acknowledging that not all benefits can be realized because of context constraints?

I think that summary is comparing apples and oranges!

You need to provide some context parameters before putting out such extremely one-sided statements.

Is that a single computer setup or is it a network of peer computers sharing the workload?

How much memory is dedicated to the “engine” (the whole network) performing the work?

How many independant tasks are being performed simultaneously?

Are the tasks compute-oriented, memory-oriented, or I/O-oriented?

What is the intended market for such a compute paradigm: super-computing? multi-national multi-server computing? SME-server computing? desktop computing?

“Interesting” results … but … LOTS of unknowns!

2 Likes

Dima

June 21, 2026, 6:35pm

Thank you, dear Eric, for your interest in our R&D. We use PTNs as a graphical language of concurrent programming, developing virtual machines for MCU/GPU to run our applications in view of dedicated hardware (chips) implementation. Sleptsov and Salwicki amendments to Petri nets make them fast and massively parallel. Recently, as I proved that System V semaphores = inhibitor Petri nets, we compile our nets into data for a program called pvzm.c that uses semop() and fork() to run the net as Linux application. Today, to supplement our toolchains, I uploaded benchmark nets and applications to my GitHub. You can run them on your Linux. Thus, at present we benchmark a single computer though we can extend our benchmarks on clusters. The point is System V semaphores can be implemented much more efficiently both in the kernel and partially as runtime, like futexes. They resolve many deadlocks, and we can think of deadlock recognition on the fly, say with a system application started by a kernel. We also offered extended greedy semaphores, which have lots of advantages, and we think of the corresponding system call semopk() implementation. We worked in the kernel before implementing our novel stack of protocols e6 - https://www.ietf.org/archive/id/draft-zaitsev-e6-network-00.txt With warm regards, Dima, https://dimazaitsev.github.io/

2 Likes

Dima

June 21, 2026, 6:38pm

Thank you for your kind welcome, dear Hydn – Dima

1 Like

ericmarceau<br>(Eric Marceau)

June 21, 2026, 6:40pm

Still trying to wrap my head around what is done.

Are you creating this “scheduler” as a captive, single-task component sheduler, albeit massive, complex and parallel task?

1 Like

Dima

June 21, 2026, 6:47pm

It is an abstract machine based on a place-transition net. A place...

Peakload Benchmarks for Operating Systems

Related Articles

Apple WWDC 2026 Livestream

Claude Fable 5

US Government directive to suspend access to Fable 5 and Mythos 5

Is AI ruining our skills? Early results are in – and they're not good

The Anatomy of an AI-Native Org