AI Can Do Anything

AmDab2 pts0 comments

Clawd Cursor v0.9.7 — the local MCP server for safe desktop control

v0.9.7 — latest stable

The local MCP server that gives<br>any agent safe desktop control

Any model. Any app. One MCP entry. Local-only. 6 compact tools, single safety chokepoint, no telemetry. v0.9: MCP-only protocol, Reflector feedback, guides marketplace.

View on GitHub

Quick Start

Compact Tools (public surface)

97<br>Granular Tools (compat / debug)

Operating Systems

Two ways to use it

Run it yourself, or hand it to your agent.

Test from the CLI

Plain English in, actions out.

clawdcursor doctor<br>clawdcursor agent

Wire it into your agent

One MCP entry, desktop control appears as native tools.

Claude Code<br>Cursor<br>Windsurf<br>OpenClaw<br>Zed

Pick a mode

How will your AI talk to it?

Same tools, three entry shapes. Pick once during install.

clawdcursor mcp — recommended

AI lives in your editor (Claude Code, Cursor, Windsurf, Zed). Editor spawns clawdcursor on demand over stdio. No daemon, no port.

"mcpServers": {<br>"clawdcursor": {<br>"command": "clawdcursor",<br>"args": ["mcp", "--compact"]

6 / 97<br>Compact / Granular tools

stdio<br>Transport

clawdcursor agent — autonomous daemon

clawdcursor brings its own LLM brain (configured via doctor). For unattended runs, scheduled tasks, multi-process orchestration.

Run clawdcursor doctor &middot; pick a provider

Run clawdcursor agent

POST tasks to 127.0.0.1:3847/mcp

:3847<br>HTTP MCP

13+<br>Providers

clawdcursor agent --no-llm — BYO brain

Your agent already has a brain — you just want HTTP tools. Same daemon, no built-in pipeline.

Run clawdcursor agent --no-llm

97 tools on :3847/mcp

Stateless — no session init needed

97<br>Granular tools (compat)

any<br>HTTP client

How it works

Cheap paths first.

A11y tree before pixels. Vision only when needed.

1 Router

Zero LLM<br>Pattern-match shortcuts for common tasks. Sub-second.

2 Agent

One loop, three modes<br>Blind reads the a11y tree. Hybrid adds on-demand screenshots. Vision is the fallback for canvas UIs.

3 Safety

Single chokepoint<br>Every tool call gates through one safety layer. Destructive actions need confirmation.

🎯

Compact tool surface

6 compound tools — computer, accessibility, window, system, browser, task. ~12× smaller catalog than the granular surface.

🧩

One adapter per OS

Windows, macOS, Linux behind a single interface. Linux covers X11 and Wayland.

Features

Any OS. Any model.

🍎

macOS

TCC-safe. clawdcursor grant handles Accessibility + Screen Recording.

🪟

Windows

Native UIA + Windows.Media.Ocr. x64 and ARM64.

🐧

Linux

X11 and Wayland. AT-SPI for a11y, Tesseract for OCR.

🖱️

Smart tools

Click by name, type by label, read screen. A11y first, OCR as fallback.

⌨️

Shortcuts engine

Platform-aware key combos — Cmd on macOS, Ctrl elsewhere. No LLM cost.

📖

Guides marketplace

Community shortcuts & workflows for popular apps. Fetched on demand. Contribute via PR →

Guides Marketplace

Community knowledge for popular apps.

The agent reasons from scratch on unfamiliar apps. For popular ones, fetch the shortcuts.

5–10× faster on known apps

Guides ship keyboard shortcuts, workflow patterns, layout cues, and failure modes. The agent skips the discovery loop.

🌐

Fetched on demand

Cached locally for 7 days, LRU at 50 entries. Agent never blocks on the network — offline falls back to first-principles reasoning.

🛡️

Linter every load

Schema check + prompt-injection patterns + dangerous-prose detection on every guide before injection. Defense-in-depth.

10 verified guides at launch

discord &middot;<br>excel &middot;<br>figma &middot;<br>gmail &middot;<br>mspaint &middot;<br>olk (new Outlook) &middot;<br>outlook &middot;<br>slack &middot;<br>spotify &middot;<br>youtube (19 workflows, 36 shortcuts, 13 tips)

# browse + manage cache<br>clawdcursor guides available<br>clawdcursor guides install youtube<br>clawdcursor guides list

# submit your own<br>clawdcursor guides lint my-app.json<br>clawdcursor guides submit my-app.json

Source: github.com/AmrDab/clawdcursor-guides<br>&middot; Served at clawdcursor.com/app-guides<br>&middot; PRs via GitHub &middot; Ratings via 👍/👎 on vote: issues

Tools<br>6 compact tools + 97 granular ›<br>The 6 compact compounds are the recommended public surface. Each row lists the actions you pass via { "action": "…" }. The 97 granular tools (one schema per verb) are listed below for compatibility and debugging — use them when your runtime requires every primitive as a top-level MCP tool.

Compound<br>Purpose<br>Actions

computer<br>Mouse, keyboard, screenshots. The raw I/O surface.<br>screenshot · click · double_click · right_click · triple_click · hover · scroll · scroll_horizontal · drag · drag_path · type · key · wait

accessibility<br>Drive UI by element name, not by pixel. Survives DPI, resize, layout shifts.<br>read_tree · find · get_element · focused · invoke · focus · set_value · get_value · expand · collapse · toggle · select · state · list_children · wait_for

window<br>Launch, focus, resize. App-level state management.<br>list · active · focus · maximize ·...

clawdcursor tools agent middot guides compact

Related Articles