Porting Btrfs-Progs to Rust

Porting btrfs-progs to Rust · xfbs's blog↓Skip to main content Last weekend, I was itching to write some code. But finding a good project can be difficult. What I try to look for is something challenging enough to learn from, yet self-contained so I know what success looks like. The idea that came to mind was porting btrfs-progs to Rust. For the uninitiated, btrfs is a copy-on-write (CoW) filesystem in Linux popular enough that Fedora has used it as the default filesystem since Fedora 33. Btrfs has several neat features. If you are familiar with ZFS, you may recognize that some of their feature sets overlap. In my mind, Btrfs is like ZFS, but for everyday usage. Copy-on-write : Writes are never in-place. Modified data goes to new blocks while old blocks remain intact. This enables cheap snapshots and atomic operations. You can take atomic snapshots of your entire filesystem and back them up, even incrementally (with btrfs send / btrfs receive). Integrity : Btrfs checksums all data and metadata, so it can detect silent data corruption (bit rot). When combined with redundancy, it can automatically repair corrupted blocks. Subvolumes : It supports subvolumes, which are lightweight, independently snapshottable directory trees that share the same underlying storage. Multi-device : Btrfs filesystems can span multiple devices, which you can add and remove on-the-fly. It has support for different data redundancy profiles built in (such as single, RAID0, RAID1, RAID10), so you don’t need to use things like LVM. Compression : Built-in transparent compression and deduplication. Online maintenance : You can defragment and resize a btrfs filesystem while it is mounted and in use. Since btrfs has capabilities that go beyond traditional filesystems, it comes with a userspace utility called btrfs. This command-line tool lets you interact with the features that are specific to it. # Take a read-only snapshot of the root filesystem btrfs subvolume snapshot -r / /snapshot

# Write out the entire snapshot to the file `snapshot.bin` btrfs send /snapshot > snapshot.bin This tool is part of btrfs-progs, and is unsurprisingly written in C. The tools work well and don’t have significant attack surface (they’re not exposed to the network or anything), so there’s no pressing need to rewrite them in a memory-safe language. But I wanted to do it anyway: it would help me understand how these tools actually work, and I wanted to see if I could create a simpler implementation that might be easier to maintain and test. Making a plan # Before starting this rewrite, I put some thought into how I wanted to approach this. When doing a rewrite of an existing codebase, there are two strategies: a “clean-room” rewrite, where you look only at the interface and effects of the tool but not its code, or a source-informed rewrite, where you study and translate the original code directly. The advantage of a clean-room approach is that your rewritten code is original work that you can license however you want. I chose the latter approach. I thought it would be easier if I actually studied the existing code. However, that means my rewrite needs to carry the same license as the original, since studying the code would likely influence the outcome, making it a derived work. I also decided to explore how useful an LLM could be for automating tedious tasks. I’m somewhat ambivalent about LLM usage: I’ve had bad experiences where they produced low-quality, incorrect code. At the same time, LLMs can be genuinely helpful for mechanical tasks, such as translating CLI command structures into clap declarations. I wanted to see if I could find the sweet spot of maintaining control of the architecture and quality of the codebase, while accelerating the process. A first approach # Initially, I wanted to understand how the original btrfs-progs codebase was organized. What’s the architecture? Does it use loose coupling with tidy, separated modules that I could translate individually to Rust and tie together using FFI? How does the compilation process work? How are the tools tested? I started by cloning the repository and browsing around to understand what the pieces are and how they fit together. Here’s what I understood about the structure: PathDescriptionkernel-lib/Low-level data structures and algorithms extracted from the Linux kernel, intended for reuse in userspace tools.kernel-shared/Btrfs-specific kernel code synchronized with the Linux kernel’s btrfs implementation. It implements core btrfs algorithms and on-disk format handling.libbtrfs/Library for interacting with btrfs filesystems (also exposed as a Python module).common/Shared utility code for btrfs tools (parsing, formatting, device scanning, filesystem utilities, etc.).libbtrfsutil/Higher-level library for managing btrfs filesystems with official Python bindings (subvolume, filesystem, and qgroup operations).cmds/Implementation code for...

Porting Btrfs-Progs to Rust

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

Apple WWDC 2026 Livestream

Claude Fable 5

US Government directive to suspend access to Fable 5 and Mythos 5

It's Not Just X. It's Y