Building DeepSeek's Answer to Claude Code

DeepSeek Agent Harness: Technical deep-dive & the open-source blueprint | Hacker News style

🧠 DeepSeek Harness Agent Engineering & Open‑Source Analysis

6 min read

DEEPSEEK · AGENT HARNESS

Model + Harness = Agent: building DeepSeek’s answer to Claude Code

📅 May 21, 2026 🎯 10+ technical references 🐚 deepseek-tui deep dive

#AgentLoop #MCP #RLM #RustLang #HarnessEngineering

📖 Table of contents

1. The vision behind Agent Harness

2. Job requirements & technical stack

3. How it compares to Claude Code

4. DeepSeek-TUI: the accidental open‑source blueprint

5. Core engineering layers: loop, memory, MCP, desktop

6. Resources & further reading

⚙️ The “Model + Harness” equation

DeepSeek is hiring an Agent Harness R&D Engineer to build the missing layer between their frontier models and production‑ready agents. The team’s mission is explicit: Model + Harness = Agent. Everything except the model itself (tooling, memory, agent loop, desktop integration, MCP, and feedback systems) belongs to the Harness. This is not just a wrapper — it’s a co‑design effort where the harness and the model evolve together.

“We are turning DeepSeek’s cutting-edge models into leading Agent products. Harness encompasses all the infrastructure, from orchestration to desktop UX.” — official job description.

📋 What the role demands: engineering stack

The job posting (Beijing, full‑time) requires 2+ years of software development, deep LLM agent knowledge, and hands‑on expertise with Agent Loop, KV cache, MCP, Memory, Multi‑Agent, Planning, Skills . Below is the inferred technical stack based on the “Harness Engineering” keywords:

LayerTechnologies / patternsWhy it matters for Harness

Agent LoopAsync Rust/Python, state machine, turn‑based orchestrationDecides when to call tools, reason, replan – core runtime Context EngineeringKV cache optimization, progressive disclosure, summarizationManage cost & latency; reuse prefix caches across agent turns MemorySQLite + vector extensions, LanceDB, episodic & semantic memoryCross‑session learning, personalization Tool Use / MCPModel Context Protocol, plugin system, typed tool registryExtensibility & interoperability (files, shell, web, APIs) Desktop UITauri (Rust + web frontend) or Electron, OS accessibility hooksNative desktop Agent product beyond CLI Model‑Harness co‑designFeedback loops, reasoning traces, fine‑tuning data collectionImprove model behavior for agentic tasks (planning, tool use)

The role explicitly mentions Prompt Engineering, Context Engineering, and Harness Engineering as distinct disciplines — a signal that DeepSeek is investing in agentic infrastructure as a first‑class product.

🤖 Claude Code for DeepSeek? Not quite — broader

On the surface, the Harness looks like DeepSeek’s version of Anthropic’s Claude Code: an AI‑powered dev tool with tool use, planning, and loops. But the job description reveals significant differences:

AspectClaude CodeDeepSeek Harness (vision)

Primary interfaceTerminal (CLI)Desktop Agent (GUI + TUI) – multi‑modal Model integrationBlack‑box Claude modelsCo‑design with DeepSeek’s own training team; model & harness co‑evolve ScopeSoftware development focusedGeneralist agent harness (coding as first use‑case, but extensible to arbitrary tasks) Memory & personalizationLimited conversation memoryLong‑term vector memory + session replay Open ecosystemClosed sourceMCP‑compatible, potential community skills

You’d be building a platform for agentic tools where coding is the first application, but the architecture aims for much more: desktop automation, research assistance, and “Harness” as a product.

🦀 DeepSeek-TUI: the Rust open‑source blueprint

Ironically, the most concrete implementation of the “Harness” philosophy isn’t internal — it’s an external MIT‑licensed project: deepseek-tui (2.3k+ stars, built by Hunter Bown). The tool is a terminal‑based agent written in Rust that demonstrates almost every technical requirement listed in the job posting:

Agent loop – engine/turn_loop.rs with streaming and tool orchestration.

MCP client – full stdio transport, dynamic tool discovery.

RLM (Recursive Language Model) – subagent fan‑out: main V4‑Pro delegates to 1–16 parallel V4‑Flash children. At $0.14/M tokens, 16 parallel agents cost less than one V4‑Pro call.

Memory – persistent SQLite event timeline, replayable sessions.

LSP integration – post‑edit diagnostics injected back into context.

Three execution modes – Plan (read‑only), Agent (approval), YOLO (auto).

# deepseek-tui dual‑binary architecture deepseek (dispatcher CLI) → deepseek-tui (runtime) → ratatui UI ↔ async engine

# Turn loop pseudo‑implementation (simplified) loop { build_messages_and_tools() call DeepSeek V4 API (stream thinking/tool_calls) if tool_calls: approval_gate() → execute → inject results else: break

Where it falls short: early stage maturity; many commands (/memory, /tasks) are reported as non‑functional, and the skills ecosystem is incomplete. This is exactly...

Building DeepSeek's Answer to Claude Code

Related Articles

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits

PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play

Old Reddit Is Down

The ultimate female fantasy – A feminist critique of Beauty and the Beast