Liveness Proofs in Veil, Part I: The First Step

Liveness Proofs in Veil, Part I: The First Step | Proofs and Intuitions

Safety property means “nothing bad happens during the run of a program”; liveness property means “the program eventually does something good”. In this post, we walk through a simple proof of a liveness property in Veil, using a basic consensus protocol as an example.

This will be the first post in this series where we will explore a collection of techniques for proving liveness properties of distributed protocols, with a focus on how these proofs can be carried out in Veil, a verifier for distributed protocols in Lean. If you want to know more about Veil, start by reading this blog post.

The purpose of this post is to show that liveness is within comfortable reach of a Lean-based verifier, not just safety. Many mechanised protocol developments prove safety in full and then leave liveness as future work, to be proven, fittingly, eventually. This series sets out to do that future work. We begin here with the smallest example we could find—deliberately so, to keep the protocol out of the way and put the liveness argument in full view—and follow it end to end, from an informal argument to a machine-checked proof. The argument is what carries forward; later posts keep it and grow the protocol.

Introduction

Safety properties are a central part of how we specify and reason about distributed protocols. A consensus protocol, for example, should not allow two different values to be chosen. A termination detector should not announce termination when not all processes have terminated. A mutual exclusion protocol should not let two processes enter the critical section at the same time. These are all safety properties: they say that something bad never happens.

But many correctness properties ask for more: they require the system to make progress. This brings us to liveness properties that require answering the question:

Does something good eventually happen?1

For example, does a consensus protocol eventually choose a value? Does a termination detector eventually report termination once all processes have terminated? Does a waiting process eventually enter the critical section?

In this post, we will make this question concrete using a minimal example from the TLA+ examples repository. The example is small enough that the liveness argument fits on one page, but it already contains the essential ingredients: an action that can make progress, a fairness assumption that prevents the system from ignoring that action forever, and a temporal property saying that “something good will eventually happen.” We will use this example to see how liveness can be formalized using the notations from TLA, the Temporal Logic of Actions.2 Then we will prove the property using a standard proof rule for reasoning from weak fairness to progress.

Running Example: One-Step Consensus

TLA+ has been the de-facto language and framework of distributed protocol specification for more than three decades, and it remains the natural baseline against which any new tool for verifying such protocols should be measured. We will therefore use a small example from the TLA+ examples repository throughout this post. The example is an idealised abstraction of consensus, not a real protocol: it has no processes, messages, quorums, or failures, only the effect a consensus protocol is supposed to provide—eventually some value is chosen, and at most one value is ever chosen.3

CONSTANT Value

VARIABLE chosen

Init == chosen = {}

Next == /\ chosen = {} /\ \E v \in Value : chosen' = {v}

TypeOK == /\ chosen \subseteq Value /\ IsFiniteSet(chosen)

Inv == /\ TypeOK /\ Cardinality(chosen) \leq 1

ASSUME ValueNonempty == Value # {}

Success == <>(chosen # {})

Spec == Init /\ [][Next]_chosen

LiveSpec == Spec /\ WF_chosen(Next)

The state of the model is a single set chosen ⊆ Value. Initially chosen = {}. The only transition Next is enabled while chosen is empty, and replaces it by {v} for some v ∈ Value (here, \E is existential quantification, and chosen' is the value of chosen in the next state). In other words, the protocol can take exactly one meaningful step (i.e., the first step), choosing a value, after which nothing further happens; this single step is precisely the “something good” whose inevitability we are going to prove. The safety invariant Inv says, modulo the type information TypeOK, that Cardinality(chosen) ≤ 1—the agreement property of this tiny model. We need the assumption that Value is nonempty (ASSUME ValueNonempty) for liveness. The remaining lines—Success, Spec, and LiveSpec—live at the temporal level, which we unpack next.

Encoding Temporal Properties in TLA+

To make sense of Success, Spec, and LiveSpec, we look at infinite execution traces:4

\[e = s_0, s_1, s_2, \dots\]

and at the two temporal operators TLA+ inherits from TLA, the underlying logic:

$\square P$ (“always”, ASCII []P) means...

Liveness Proofs in Veil, Part I: The First Step

Related Articles

US Government directive to suspend access to Fable 5 and Mythos 5

Is AI ruining our skills? Early results are in – and they're not good

The Anatomy of an AI-Native Org

Apertus – Open Foundation Model for Sovereign AI

How to Earn a Billion Dollars