CueBench for Developers is live: score how well you drive coding agents

Your AI Fluency

See how you're using AI — and level up.

Drop in your AI coding session logs and get scored on the four AI Fluency skills. Track your growth, spot your habits, and get concrete coaching from your own real sessions. Built for developers, about your own work — nobody's watching over your shoulder.

AI fluency skills

0–100

Score per session

Drop in

Upload → scored

Get your AI fluency score

Continue with Google

Your scores are yours. Session files are scored, then deleted — never stored.

New session scored

View session

Coach walkthrough

Your coach is reviewing this session…

Reading the trace, your scores, and your recent habits. A few seconds.

{{ wtStepT }} {{ wtStepSigLabel }}

What happened

The coaching

Try instead

Session takeaway

✓ Keep doing

→ Change

Want to make it stick? Take on a measurable focus challenge for your next sessions.

← Back

Developer beta

1. Acceptance. By using CueBench you agree to these Terms and our Privacy Policy.

2. The Service. CueBench analyzes your AI coding sessions and provides scores and coaching. This is a developer beta: features may change or break.

3. Your Content. You keep all rights to your session logs. Uploaded files are deleted after scoring; we keep the derived scores, insights, and timelines (incl. short prompt excerpts).

4. Data Use & Model Improvement. As a beta participant, you agree we may use your usage data — scores, telemetry, and short prompt excerpts — to operate, evaluate, improve, and train the models powering the Service. Raw uploaded files are never used for training. "Delete my data" in Settings removes your data at any time.

5. Acceptable Use. No uploading content you lack rights to, probing others' data, or reverse-engineering the models.

6. Beta Disclaimer. Provided "as is", no warranties; scores are informational only.

Full text: Terms of Use · Privacy Policy

I agree to the Terms of Use and Privacy Policy, including the use of my beta usage data to improve and train CueBench's models.

{{ tosError }} {{ tosSubmitLabel }}

30 seconds, once

Tell us who you are

How would you describe yourself?

Where do you work (optional)

Your role (optional)

I manage a team (optional)

Where did you find CueBench? *

What insights would you want from your sessions?

{{ svError }} {{ svSubmitLabel }}

Feedback & bug reports

Found a bug, hit an error, or have an idea? Tell us — it goes straight to the founders.

{{ fbStatus }} {{ fbSendLabel }}

Recommended setup

Connect your sessions permanently

Skip manual uploads: a tiny background agent watches for finished Claude Code, Cursor, and Codex sessions and sends each one to your dashboard automatically. It only looks for session logs — it never sees your code, your files, or anything else on your machine. Scoring happens on our servers and session files are deleted right after.

curl -fsSL {{ agentInstallUrl }} | bash -s -- --key {{ orgApiKey }} --api {{ agentApiUrl }} --origin {{ agentOrigin }}

We'd recommend doing it now — but you can always install it later from Settings. {{ agentStatusLabel }}

✎Feedback / report a bug

✦Contact for Enterprise

Log out

Updated {{ updatedAt }}

Cost & efficiency

Total spend

Avg per session

Model usage

No session data yet — cost metrics will appear once sessions are recorded.

Risk signals

0 ? '#D97706' : '#A1A1AA' }};">{{ riskLoopsDisplay }}

Sessions with loops

of total sessions flagged

0 ? '#DC2626' : '#A1A1AA' }};">{{ riskBelowThreshold }}

Below threshold

developers below {{ alertBelow }}-point target

Review developers →

People & Skills

Developers

Updated {{ updatedAt }}

! ALERT {{ alertText }} Review {{ alertCount }} developers

Side-by-side comparison

Clear ×

{{ ce.role }} · {{ ce.dept }}

Overall

All roles Staff Engineer Engineering Lead Senior SWE SWE II SWE I ML Engineer Data Engineer

All scores Top performers (≥ 80) On track (65–79) Needs attention (

Clear filters

{{ filteredCount }} developers

{{...

CueBench for Developers is live: score how well you drive coding agents

Related Articles

(no title)

Is AI ruining our skills? Early results are in – and they're not good

The Anatomy of an AI-Native Org

ZCode – Harness for GLM-5.2

Apertus – Open Foundation Model for Sovereign AI