The biggest problems in using AI

The biggest problems in using AI | Dan ShearerTable of ContentsThe Biggest Problems in Using AIHallucination and nonsense Context indexing is poor AI is amnesiac and lacks self-awareness Agentic AI doesn’t do permissions The composition architecture The status today Further Reading

There are many problems with the AI billions of people use in 2026, discussed endlessly at all levels of society. From the end of 2025 I became interested in the particular problems of ethics and reliability, and why the approaches taken by all of the large AI companies are not good enough. Predictability, or ‘alignment’ as they call it, is just not something we can expect from this type of AI. A colleague started working on a very different approach from these companies, and from February 2026 I have been contributing to and using prototype versions of the Artificial Organisations ↗ concept. This article explains why I believe Artificial Organisations are a promising new direction. Multi-agent Agentic AI is pretty important, as described here by the UK government ↗ , but it is rarely done well. If you want to try for yourself, you can use the core research code ↗ , as I do daily.

The Biggest Problems in Using AI# The Perseverance Composition Engine ↗ (PCE) uses Artificial Organisations to solve these pressing AI problems. PCE does not try to make LLMs behave better, but is designed instead so that their inevitable misbehaviour is detected and corrected. And regardless of the computer science, I found these ideas in Iain M Banks’ novels and the Mass Effect video game PCE works by assigning a task to LLM agents who each have a carefully enforced role to play. The agents iterate between each other until either the task is completed to specifications, or it fails by honestly saying “I can’t do this, the task is impossible for me.” So far, this arrangement seems effective at detecting and correcting common problems such as confident false assertions, hallucinations, or dangerous advice. With PCE, nobody needs to trust an AI, only the structure. The structure is recognisable by most people, since it is closely modelled on ones tried and tested for centuries. Like any organisation, Artificial Organisations have separation of duties, independent checks, and agents who can only see what they need to see. It works rather well. This design addresses three failure modes that the usual training and instruction cannot fully fix: hallucination, context issues, and memory issues. 💡 The other biggest problem Many harms can be caused by AI including death, but we should expect AI to harvest our personal data and use it without consent. That’s why I am interested in things such as offline AI, and on-device small language models, and why I try to use PCE such that the robots in the sky don’t learn any more about me than they already do.

Hallucination and nonsense# Language models generate text according to probability, where the next piece of text (a ’token’) is selected based on patterns, not by retrieving facts from a database. If a model does not have a pool of highly relevant text to select from (the ‘context’) it will probabilistically generate text anyway because that it what it is programmed to do. The result is confabulation, where the model sounds confident while making a false or misleading claim. The better the AIs become at expressing themselves, the more convincing these hallucinations can become. Research keeps concluding ↗ that training does not eliminate hallucination, and newer surveys ↗ describe hallucinations as potentially “fundamental mathematical inevitabilities inherent to [the model’s] architecture.” The AI companies are trying to solve this by giving better instruction and training, but if hallucination is indeed inevitable then this will never be reliable. I am persuaded the architecture needs to change for AI to become more trustworthy. Context input to a model is called the ‘prior’. A quality prior comprises the best available documents, previous relevant decisions, germane background, and from it AI generates much better output. Just like a human organisation, Artificial Organisations strive to deal with the best quality input documents in order to improve decisionmaking, and to carefully label or even reject guesswork. This is the first structural way we can tackle hallucinations. A second technique is also familiar: have someone else check the work. PCE has an agent called the Corroborator whose only job is to read what the Composer agent wrote, and to verify every claim against the source documents. The Corroborator has the sources right in front of it, so if the Composer invented a claim, the Corroborator will see it is unsupported. Corroborator is unmoved by plausible confabulation, because it is instructed to only accept what can be proven from the sources to hand, including references on the internet if...

The biggest problems in using AI

Related Articles

US Government directive to suspend access to Fable 5 and Mythos 5

Is AI ruining our skills? Early results are in – and they're not good

The Anatomy of an AI-Native Org

Apertus – Open Foundation Model for Sovereign AI

Britain Became as Poor as Mississippi