A structure-aware fuzzing experiment in Rust

A Structure-Aware Fuzzing Experiment

Structure-aware fuzzing can better exercise the system under test (SUT) by crafting inputs in the format expected by the SUT, rather than throwing pseudorandom bytes against it. That is, it avoids “shallow” inputs that the SUT will reject early (for example, syntactically invalid source text when fuzzing a programming language’s compiler) and only produces inputs that go “deep” into the SUT (e.g. programs that type-check and exercise the mid-end optimizer and backend code generator). The Rust fuzzing ecosystem is largely built around cargo-fuzz and the libfuzzer-sys crate, which provides two methods for structure-aware fuzzing:

Generating structured inputs from scratch with the arbitrary crate

Mutating existing inputs from the fuzzer’s corpus in a structure-aware manner, thereby producing new structured inputs, via the fuzz_mutator! hook

While the two methods are not technically mutually exclusive, combining the two can be difficult and engineering resources are finite. So:

If we are only implementing one approach, is generation or mutation better?

To help answer this question, I implemented structure-aware generation and mutation of guaranteed-valid WebAssembly (Wasm) instruction sequences. This task is small enough to be easily understandable but large enough and real enough to (hopefully) be representative and applicable to other domains, or, at the very least, interesting.1 To evaluate their effectiveness, I used Wasmtime as the SUT, libfuzzer-sys as the fuzzing engine driving everything, and then compared code coverage over time when using mutation-based fuzzing versus generation-based fuzzing.

Additionally, there are many ways we can generate pseudorandom WebAssembly instruction sequences. In this experiment, I’ve evaluated three methods:

Unconstrained instruction sequence generation followed by a fixup pass to ensure validity

Generating valid instructions in a forwards, bottom-up manner (from operands to operators)

Generating valid instructions in a backwards, top-down manner (from operators to operands)

In contrast, while there are surely many ways to mutate a given WebAssembly instruction sequence into a new, valid instruction sequence, I’ve only implemented one method: perform an arbitrary instruction insertion, deletion, or replacement, producing a new but probably-invalid instruction sequence, and then run the same fixup pass mentioned previously to ensure validity. This is the direct mutation-based equivalent of the first generation-based method.

Before continuing further, I want to disclose that I am the author of wasm-smith and mutatis, and a maintainer of Wasmtime, arbitrary, libfuzzer-sys, and cargo-fuzz. That is, while I am familiar with Wasm, fuzzing, fuzzing Wasm, and both the arbitrary and mutatis crates, I may also be propagating my own biases into these implementations.

Background

Generation-Based and Mutation-Based Fuzzing

A generation-based fuzzer uses a generator to create a pseudo-random test cases from scratch, feeds these into the system under test, and reports any failures to the user:

fn generation_based_fuzzingT>( // A test-case generator. generator: impl Fn() -> T, // A function to run the system under test with a // generated test case, returning a result that // describes whether the run was successful or // not. run_system_under_test: impl Fn(&T) -> FuzzResult, ) { loop { // Generate an input. let input = generator();

// Run the input through the system under test. let result = run_system_under_test(&input);

// If the system crashed, panicked, failed an // assertion, violated an invariant, or etc... // then report that to the user. if let Err(failure) = result { report_to_user(&input, failure);

On the other hand, mutation-based fuzzers are given an initial corpus of inputs and create new inputs by mutating existing corpus members. They run each new input through the SUT, report failures the same as before, and if the new input was “interesting” (for example, exercised new code paths in the SUT that weren’t previously covered in any other input’s execution) then the new input is added into the corpus for use in future test iterations:

fn mutation_based_fuzzingT>( // A corpus of test cases. corpus: &mut CorpusT>, // A function to pseudo-randomly mutate an existing // input into a new input. mutate: impl Fn(&T) -> T, // A function to run an input in the system under // test, returning a result that describes whether // the run was successful or not. run_system_under_test: impl Fn(&T) -> FuzzResult, ) { loop { // Choose an old test case from the corpus. let old_input = corpus.choose_one();

// Pseudo-randomly mutate that old test case, // creating a new one. let input = mutate(old_input);

// Run the input through the system under test. let result = run_system_under_test(&input);

// If the system crashed, panicked, failed an // assertion, violated an invariant, or etc... // then report that to the user. if let...

A structure-aware fuzzing experiment in Rust

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

It's Not Just X. It's Y

Amazon, Facebook, FBI have access to a private intelligence-sharing network

Show HN: GoPeek – open links in live mini browser windows without new tabs

Agent Memory: An Anatomy