Wordgard Release 0.1

Marijn Haverbeke's blog (license)

Thursday, July 2, 2026 typescript architecture prosemirror wordgard I am happy to announce that my latest project, which I've been talking about for years, is now out with a first release.

The project is called Wordgard. It is a new iteration of a ProseMirror-style rich text editor system, integrating the things I've learned since stabilizing ProseMirror, nine years ago. The architecture also takes a lot of inspiration from the version 6 redesign of the CodeMirror text editor.

Wordgard is (once again) a JavaScript library that uses the browser DOM to display its editor interface. It is licensed under an MIT license. The code is available on my Forgejo server.

It's a little concerning how I keep implementing new editors over and over (by my count this is the 6th non-trivial one). But doing this somehow hasn't lost its charm yet. It still feels like the designs get better every iteration. I don't expect I'll find editor-architecture nirvana before I retire, but I'm sure I'm getting closer to it.

Motivation

I'm still proud of ProseMirror, and ProseMirror isn't going anywhere—it will continue to be maintained. But there are parts of its design that make me wince every time I have to work with them, because at this point I know that I should have done them differently.

Instead of trying to change ProseMirror to incorporate these new insights, I have chosen to create a completely new system with a new name. A ProseMirror 2.0 with an incompatible interface would amount to the same but make it ambiguous what people mean when referring to ProseMirror. Trying to graft stuff on in a backwards-compatible way as an 1.x version would produce a compromised win32-style mess. I'm not all that fond of the ProseMirror pun anymore either (it's CodeMirror but for prose, get it?) So: green field full rewrite! You'll find a lot of ideas from ProseMirror in Wordgard, but the programming interface is built from scratch, without concern for compatibility.

Let's look at the parts of ProseMirror that I think I improved on.

Stop Doing Steps

“Make sure you compensate for the document shift caused by the first step when adding a second.” “To figure out what range of the document was replaced, you have to iterate through the sequence of steps in both directions, mapping positions in the new document forward and positions in the old document backward.” “Yes, I'll have one replace-around-step please.” — statements dreamed up by the utterly deranged.

ProseMirror change representation was designed by a person who was very much occupied with the problem of preserving semantic meaning for changes even if the changes were transformed, but who also didn't have a lot of experience with change formats. Steps break down changes into atomic parts that each do a single clear thing. A given editor update might involve any number of them, each defined to act on the document produced by the one before it. They serve their purpose, but they are seriously awkward to work with.

Wordgard uses a much simpler but arguably more powerful system based on my experience with CodeMirror's change representation, which derives from the old “delta” format from ShareJS. In CodeMirror, a change is a sequence of sections, each of which either preserves a part of the old document, or replaces it with a piece of new content. So in a document of length 10, inserting an L at 4 is represented [keep 4] [replace 0 with "L"] [keep 6], and deleting the first two characters would be [replace 2 with ""] [keep 8].

Wordgard extends this with modification sections, which preserve the structure of a section, but add or remove marks to it (which are things like emphasis, link style, or image alt text). Making the word from 3 to 6 bold would be represented as [keep 3] [update 3 +bold] [keep 4].

Of course, unlike CodeMirror's plain text, rich text content isn't just a flat string. Because Wordgard uses a token-counting indexing system for document positions (the same system ProseMirror uses) the change format can address the document as a flat sequence of tokens (node open and close tokens, and leaf tokens), into which it splices new sequences of tokens.

These types of changes can easily be combined, so that a single transaction always has a single change associated with it, which is easy to inspect and reason about. They also support a limited form of operational transformation, making it possible to merge a bunch of changes that are all described in terms of the start document. That gives us an ergonomic way of describing transactions with multiple changes and makes it possible to implement collaborative editing and an undo history that supports undoing some changes but not others.

But the document is not really a flat sequence of tokens. Those tokens only make sense if they combine to form a well-formed tree. If you delete a node closing token, for example, the tokens aren't balanced anymore,...

Wordgard Release 0.1

Related Articles

(no title)

Is AI ruining our skills? Early results are in – and they're not good

The Anatomy of an AI-Native Org

ZCode – Harness for GLM-5.2

Apertus – Open Foundation Model for Sovereign AI