The AI Agent Architecture Is Published. The Thresholds Are Not. - Electronic Trading Hub
Skip to content
Search for:
infrastructurehedge fundsMain NewsQuant
Ariel Silahian<br>Ariel Silahian is a senior technology executive in institutional electronic trading, with 30+ years across the buy and sell side (New York, Miami, London, Hong Kong). He is the author of "C++ High Performance for Financial Systems" (Packt) and the creator of VisualHFT, the open-source microstructure analytics stack. He writes on exchange architecture, market microstructure, and execution quality, and advises a select number of trading firms on infrastructure decisions that move P&L. Talk architecture: https://hftadvisory.com
Table of Contents
The Pattern Is Now Public
What Each Firm Actually Built
The Crypto-Native Proof of Concept
What Nobody Is Publishing: The Thresholds
The Governance Deadline
The Bloomberg Problem
What a CTO Should Audit Right Now
The Pattern Is Now Public
Four firms have published enough about their AI agent architectures that the shape is now legible to anyone reading carefully. The pattern is not accidental. It is not the product of shared tooling or a common vendor stack. It converged independently, and that convergence tells you something.
The architecture has four recognizable elements: a containment layer that limits what agents can access and do, an audit trail that makes every LLM call traceable after the fact, a sequential veto structure that routes output through human review before anything enters live trading, and a federated guardrail model that distributes deployment while centralizing standards.
Reading the March 2026 OpenAI case study on Balyasny, the Two Sigma AI outlook published March 30, 2026, the Bloomberg coverage of Man Group’s AlphaGPT, and industry analyst reporting on D.E. Shaw’s internal stack, the same four elements appear across all of them. The firms differ in size, strategy, and culture. The architecture looks the same.
The AIMA 2025 survey of 150 fund managers representing $788 billion in AUM found that 95 percent are now using generative AI, up from 86 percent in 2023. Adoption is not the hard question anymore. The hard question is: what does a deployment look like when it is built to survive contact with production? These four firms answer that question in enough public detail to learn from. What they do not publish, the calibration values, the budget thresholds, the trigger logic, is another matter.
Two Sigma named the strategic shift precisely: "Large language models (LLMs) are widening the top, shifting the bottleneck from ‘we need more ideas’ to ‘we need to evaluate ideas faster.'" That is not a technology observation. It is an organizational architecture observation. The constraint has moved.
What Each Firm Actually Built
D.E. Shaw: Gateway and audit hashes
Public coverage of D.E. Shaw’s internal stack describes an LLM Gateway that logs every call, strips PII before it reaches any model, and throttles usage per-desk with budget controls. A component called DocLab adds cryptographic audit hashes to every document retrieval, creating a timestamped chain of what the model saw when it generated an output. Quants build tools against these interfaces in approximately ten lines of code. The productivity gain is real, but the envelope is enforced. Every call goes through the gateway. Nothing goes around it.
Thank you for reading this post, don't forget to subscribe!
Subscribe by Email
I want to be precise about the sourcing here: these details trace to industry analyst coverage (notably Resonanz Capital’s November 2025 synthesis of hedge-fund AI deployments) rather than a D.E. Shaw primary publication. D.E. Shaw does not publish internal stack specifications. The pattern, gateway, PII filter, per-desk budget, retrieval audit, is consistent with how every firm in this cohort has approached the same problem.
Man Group: AlphaGPT and the three-agent chain
Man Group’s AlphaGPT is the most publicly documented AI agent system in buy-side trading. Bloomberg has reported its structure: a three-agent chain handling ideation, implementation, and evaluation. The chain is sequential. A hypothesis-generating agent produces research directions. An implementing agent converts those into executable code. An evaluating agent applies statistical scrutiny before anything advances.
What matters architecturally is not the chain itself, multi-agent pipelines are common, but the placement of human review. Humans review every step before any signal enters live trading. Man Group has also been explicit about the failure mode: hallucination "remains a big issue," per Bloomberg’s July 2025 coverage, and the firm ships the system anyway. The architecture is designed to absorb hallucination rather than prevent it. The containment layer is the answer to the failure mode, not the elimination of the failure mode.
Balyasny: Federated deployment, central guardrails
Balyasny established a...