Three Measurable Failure Modes of Large Language Models

Skip to main

You are using an outdated browser. Please upgrade your browser to improve your experience.

Advanced Research in Artificial Intelligence Systems (ARAIS)

Published May 11, 2026

| Version v1

Preprint

Open

Three Measurable Failure Modes of Large Language Models

Authors/Creators

Hubka, Marek (Researcher)1

Show affiliations

On Tides of Uncertainty

Description

Human language is inherently ambiguous - not a deterministic code but an ensemble of overlapping meanings whose disambiguation depends on context that is often incomplete or absent. A system that processes natural language must therefore be probabilistic, not by architectural choice but by mathematical necessity. This paper argues that the resulting uncertainty has structure: what the field calls hallucinations is not one phenomenon but three structurally distinct failure modes of this probabilistic nature, each with a different causal origin, a different measurable signature, and a different class of solutions.

Mode 1 (autoregressive reinforcement) is the self-consistent wrong trajectory produced when an error contaminates the model's own conditioning context.

Mode 2 (confabulation) is fluent generation produced from parameter directions that received no training signal - the null space of the weight matrix.

Mode 3 (irreducible uncertainty) is the correct response of a calibrated probabilistic system to a genuinely ambiguous query.

Each mode has a computable quantitative metric: correction sensitivity $(\mathsf{CS})$, dimensional excess $(\mathsf{DE})$, and output entropy $(\mathsf{H}_{\mathrm{out}})$. The three measurements rest on a single coding-theoretic construction, the syndrome table $S = \mathcal{N}(\bar{J} \cdot V)^\top$, whose full derivation is in the companion paper "A Syndrome Algebra for Differentiable Parametric Systems".

A controlled experimental series on a synthetic LSTM ($D=256$, $L=10$, six fixed seeds) confirms the framework end to end. The three metrics separate cleanly: the $\mathsf{CS}$ gap between known and unknown domains narrows monotonically from $0.273 \pm 0.095$ at $k=1$ to $0.067 \pm 0.037$ at $k=10$. The Pearson correlation $r(\mathsf{DE}, \mathsf{CS}_{\mathrm{unknown}}) = 0.9896$ across k predicts out-of-domain failure from weight matrix alone. Causal localisation of an injected perturbation reaches $100\%$ accuracy over $180$ trials with a pre/post residual ratio of approximately $2\times 10^8$. Oracle correction is exact (cosine $1.000000$ over $36,000$ trials). A direct comparison of multicellular specialists against monolithic generalists shows the Singleton-bound multicellular advantage grows from $0.158 \pm 0.049$ at $N=5$ to $0.310 \pm 0.054$ at $N=10$ in $\mathsf{CS}$ gap, empirically justifying the modular hierarchy.

Additional notes:

This preprint is accompanied by the mathematical paper A Syndrome Algebra for Differentiable Parametric Systems (see related identifiers). Code and data are available at the linked GitHub repository. Model weights are not included due to size; they are regenerated deterministically from the provided scripts and canonical seeds.

Files

three_failure_modes_llm.pdf

Files (771.0 kB)

Name Size

Download all

three_failure_modes_llm.pdf

md5:a5c7e2afb2e3f6d4e291a055428326ac

771.0 kB

Preview

Download

Additional details

Additional titles

Subtitle

(English)

Structure of the Error Distribution in Autoregressive Stochastic Systems

Related works

Is supplement to

Preprint:

10.5281/zenodo.20127537

(DOI)

Is supplemented by

Software:

10.5281/zenodo.20290098

(DOI)

Software

Repository URL

https://github.com/MarcusSkynet/lstm2

Views

Downloads

Show more details

All versions This version

Views

Total views

Downloads

Total downloads

Data volume

Total data volume

44.8 MB 44.8 MB

More info on how stats are collected....

Versions

External resources

Indexed in

OpenAIRE

Communities

Keywords and subjects

Keywords

large language models

hallucinations

error correction

syndrome algebra

Gram metric

Jacobian variance

Singleton bound

Hamming Bound

modular architecture

ML reliability

LSTM

Details

DOI

DOI Badge

DOI

10.5281/zenodo.20127318

Markdown

[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.20127318.svg)](https://doi.org/10.5281/zenodo.20127318)

reStructuredText

.. image:: https://zenodo.org/badge/DOI/10.5281/zenodo.20127318.svg :target: https://doi.org/10.5281/zenodo.20127318

HTML

Image URL

https://zenodo.org/badge/DOI/10.5281/zenodo.20127318.svg

Target URL

https://doi.org/10.5281/zenodo.20127318

Resource type Preprint

Publisher Zenodo

Languages

English

Rights

License

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited.

Three Measurable Failure Modes of Large Language Models

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

It's Not Just X. It's Y

Amazon, Facebook, FBI have access to a private intelligence-sharing network

Show HN: GoPeek – open links in live mini browser windows without new tabs

Agent Memory: An Anatomy