Two Years of OCaml

The other day I saw this post on OCaml discussed in Hacker News and Lobsters.

Almost two years ago I rewrote the Austral compiler from Standard ML to OCaml, so I thought I’d share my thoughts on OCaml after using it in writing a complex software project, explaining what is good and what is bad and how it compares mainly to Haskell.

If this seems overwhelmingly negative, it’s because the things OCaml does right are really just uncontroversial. They’re obviously right and hardly worth pointing out. It’s actually a weirdly optimistic thing: that a language with so many glaring deficiencies stands far above everything else.

Contents

Syntax

Aesthetics

Declaration Order

Comments

Type Specifiers

Generic Types

Type Annotations

Semicolons Work Sometimes

Inconsistencies

Nested Match Expressions

Do Notation

Modules: Better is Worse

Modules Are Better

Modules Are Worse

Equality

Multiple Implementations Are Unnecessary

Semantics

Currying is Bad

Type Inference is Bad

Mutation

Pragmatics

PPX

Tooling

How Do I Profile?

Testing

Minor Complaints

At Least It’s Not Haskell

My OCaml Style

Should You Use OCaml?

Syntax

Yeah, yeah, de gustibus, and people spend way too much time whining about syntax and other superficial issues, rather than focusing on language semantics and pragmatics.

But I’m not a partisan about syntax. I genuinely think code written in C, Java, Lisp, Pascal, and ML can be beautiful in different ways. Some of these complaints will be personal, others will be more objective.

Aesthetics

ML was born as the implementation language of a theorem prover, so naturally the syntax is meant to look like whiteboard math.

And it does look good for math. If you’re writing something like a symbolic differentiation engine:

let rec diff (e: expr): expr = match e with (* c' = 0 *) | Const _ -> Const 0.0 (* (f + g)' = f' + g' *) | Add (f, g) -> Add (diff f, diff g) (* (f - g)' = f' - g' *) | Sub (f, g) -> Sub (diff f, diff g) (* (fg)' = f'g + fg' *) | Mul (f, g) -> Add (Mul (diff f, g), Mul (f, diff g)) (* (f/g)' = (f'g - g'f)/gg *) | Div (f, g) -> Div (Sub (Mul (diff f, g), Mul (f, diff g)), Mul (g, g))

Then it’s simply delightful. It does tend to fall apart for everything else however.

OCaml, like Haskell, is expression-oriented, meaning that there is no separation of statements (control flow, variable assignment) and expressions (evaluate to values) and instead everything is an expression. Most expressions in OCaml tend not to have terminating delimiters.

This is very vague, but ML-family (meaning Standard ML, OCaml, Haskell and derivatives) code often feels like the expressions are “hanging in the air”, so to speak. Terminating delimiters (like semicolons in C or end in Wirth-family languages) make the code feel more “solid” in a way.

And expression orientation (which most modern languages advertise as a feature) cuts both ways. The benefit is simplicity and symmetry: you don’t need both an if statement and a ternary if expression. You can have a big expression that computes a value and then assigns it to a containing let, like so:

let a: ty = match foo with | Foo a -> (* ... *) let bar = (* ... *) (* imagine deeply nested expressions *) in (* etc *)

Without having to use an uninitialized variable or refactor your code into too-small functions. However, this generality comes at a cost: you can write arbitrarily deep and complex expressions, where a statement-oriented language would force you to keep your code flatter and break it down into small functions.

It takes discipline to write good code in an expression-oriented language. I often see e.g. Common Lisp code with functions hundreds of lines long. It’s almost impossible to track the flow of data in that context. This, by the way, is why Austral is statement-oriented, despite every modern language moving towards expression-oriented syntax.

Declaration Order

In OCaml, like in C, declaration must appear in dependency order. That is, you can’t write this:

let foo _ = bar ()

let bar _ = baz ()

let baz _ = print_endline "muh one-pass compilation"

Instead you must write:

let baz _ = print_endline "muh one-pass compilation"

let bar _ = baz ()

let foo _ = bar ()

Alternatively, you can use and to chain your declarations:

let rec foo _ = bar ()

and bar _ = baz ()

and baz _ = print_endline "muh one-pass compilation"

And the same thing is true of types:

type foo = Foo of bar

and bar = Bar of baz

and baz = Baz of unit

But, you can’t interleave an and-chain of functions with one of types. So you have a choice:

You can write all of your code backwards, with the utility functions and the leaf-nodes of the call graph up front, and the important code at the bottom.

Or, you can write a big and-chain of types at the start of the file, followed by a big and-chain of functions for the remainder of...

Two Years of OCaml

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

Apple WWDC 2026 Livestream

Claude Fable 5

It's Not Just X. It's Y

Show HN: GoPeek – open links in live mini browser windows without new tabs