the empty corner in brain-to-text<br>Meta released Brain2Qwerty v2: non-invasive brain-to-text at 61% word accuracy, and for its best subject, half of all sentences landing within one word of correct.1 no surgery, no electrodes in the brain. it’s a real result, and genuinely hard research.
it also can’t help the people brain-to-text is for, and that isn’t a maturity problem you close with more data. it’s structural. and the reason is more interesting than the headline.
what it actually decodes
start with what’s inside the number. Brain2Qwerty decodes typing. healthy volunteers sit in an MEG scanner — a room-sized, magnetically shielded machine that picks up the faint magnetic fields your neurons throw off, not a headset you’d wear at a desk — and physically type sentences. the model reads the brain activity driving the keystrokes and reconstructs the text.
it reads the motor cortex — the brain driving the hands as they type. not imagined speech. not the words you intend. the movement of hitting keys.
which sets up the problem the headline skips. the people who’d actually reach for a brain-to-text system — someone with ALS, a locked-in patient, anyone who can’t move or speak — are the people who can’t type. the system’s input is the exact thing they no longer have. it works because the subjects can move. the ones who can move are the ones who don’t need it. (which raises the obvious question of who this is really for. I don’t know — but a system that only works for people who can already type is useless in a clinic, and Meta has spent years building neural input for its AR glasses. I have a guess.)
what 61% buys you
then there’s what the number means. “word accuracy” sounds like a typo rate — 39% of words landing a little off. that’s not what failure looks like here.
for the worst subject, the paper reports the output can be “a coherent but entirely different sentence.”1 their own example: the model decoded “had she not fallen down the stairs” when the target was “cars are not allowed on this road.” not garbled, not a near miss — a fluent, grammatical, confident sentence the person never thought.
the authors wave it off: “successful communication relies on meaning rather than strict character matching,” which is true for autocomplete, but a coherent wrong sentence is a meaning failure — the worst kind here, because nothing marks it as an error. garbled output announces that something broke; a fluent, grammatical substitution reads as exactly what the person meant, so it gets taken at face value and passed on. for someone who has no other way to speak, that means going on record saying something they never did. a device meant to give someone back their voice instead hands them a fluent impostor of it.
the machine in the middle
Brain2Qwerty isn’t one model — it’s a three-stage pipeline that ends in a language model. the first stage reads the MEG and guesses letters. the last stage is a fine-tuned Qwen3 that takes those rough letters plus a brain-derived signal and writes the final sentence, one word at a time.
that last stage does a lot of the work, and it’s where the confident-wrong sentences come from. a language model’s job is to turn a noisy, partial input into fluent text — so when the brain signal is thin, it writes something fluent anyway, filling the gaps with whatever reads well. the paper measures the cost: adding the LLM gets more whole words and more meaning right than the raw letters, but it gets more individual letters wrong (a 31% character error rate against 28% without it). it trades literal accuracy for smooth reading. (if you want the full pipeline — encoder, word-level aligner, LLM, and the AI agents that did much of the tuning — I broke it all down separately in how brain2qwerty works.)
it’s structural, and they know it
you could read all of this as early days: more data, more subjects, patient trials, and the gap closes. the paper says otherwise, in its own words.
and it goes deeper than a patient not being able to use the finished system: you couldn’t build it for them in the first place. the whole pipeline is supervised by the key presses, which for a paralyzed person are “missing, not just during inference, but also during training.” the signal you’d learn from is the one movement they can’t make. the authors say as much — patient use “remains critical to demonstrate.”
they file this under future work. it isn’t a next step. it’s the shape of the whole problem.
two axes
to see why the gap is hard rather than just unfinished, lay the field out on two axes.
how close the sensor sits to the signal. outside the skull — MEG, EEG, non-invasive, but reading through bone and tissue that smear everything — or inside it, an electrode array on or in the cortex, invasive but sitting right on the source.
whether the action is carried out or only attempted. an executed action means muscles actually firing, which produces the strongest, cleanest neural signal there is. an attempted...