CueBench for Developers is live: score how well you drive coding agents

DillonMehta1 pts0 comments

Your AI Fluency

See how you're using AI — and level up.

Drop in your AI coding session logs and get scored on the four AI Fluency skills. Track your growth, spot your habits, and get concrete coaching from your own real sessions. Built for developers, about your own work — nobody's watching over your shoulder.

AI fluency skills

0–100

Score per session

Drop in

Upload → scored

Get your AI fluency score

Sign in, drop in a session log, and see your first score in under a minute.

Continue with Google

{{ loginError }}

Your scores are yours. Session files are scored, then deleted — never stored.

Privacy Policy · Terms of Use

New session scored

{{ toastTask }}

{{ toastScore }}

View session

Coach walkthrough

{{ wtTitle }}

{{ wtCounter }}

Your coach is reviewing this session…

Reading the trace, your scores, and your recent habits. A few seconds.

{{ wtErrorMsg }}

Close

{{ wtStepT }}<br>{{ wtStepSigLabel }}

{{ wtStepHeadline }}

What happened

{{ wtStepHappened }}

The coaching

{{ wtStepCoaching }}

Try instead

{{ wtStepTry }}

Session takeaway

{{ wtSummaryHeadline }}

✓ Keep doing

{{ wtSummaryKeep }}

→ Change

{{ wtSummaryChange }}

Want to make it stick? Take on a measurable focus challenge for your next sessions.

{{ wtChallengeLabel }}

← Back

{{ wtNextLabel }}

Developer beta

Terms of Use

1. Acceptance. By using CueBench you agree to these Terms and our Privacy Policy.

2. The Service. CueBench analyzes your AI coding sessions and provides scores and coaching. This is a developer beta: features may change or break.

3. Your Content. You keep all rights to your session logs. Uploaded files are deleted after scoring; we keep the derived scores, insights, and timelines (incl. short prompt excerpts).

4. Data Use & Model Improvement. As a beta participant, you agree we may use your usage data — scores, telemetry, and short prompt excerpts — to operate, evaluate, improve, and train the models powering the Service. Raw uploaded files are never used for training. "Delete my data" in Settings removes your data at any time.

5. Acceptable Use. No uploading content you lack rights to, probing others' data, or reverse-engineering the models.

6. Beta Disclaimer. Provided "as is", no warranties; scores are informational only.

Full text: Terms of Use · Privacy Policy

{{ tosBoxMark }}

I agree to the Terms of Use and Privacy Policy, including the use of my beta usage data to improve and train CueBench's models.

{{ tosError }}<br>{{ tosSubmitLabel }}

30 seconds, once

Tell us who you are

How would you describe yourself?

{{ lv.label }}

Where do you work (optional)

Your role (optional)

{{ svManagesMark }}

I manage a team (optional)

Where did you find CueBench? *

What insights would you want from your sessions?

{{ svError }}<br>{{ svSubmitLabel }}

Feedback & bug reports

Found a bug, hit an error, or have an idea? Tell us — it goes straight to the founders.

{{ k.label }}

{{ fbStatus }}<br>{{ fbSendLabel }}

Recommended setup

Connect your sessions permanently

Skip manual uploads: a tiny background agent watches for finished Claude Code, Cursor, and Codex sessions and sends each one to your dashboard automatically. It only looks for session logs — it never sees your code, your files, or anything else on your machine. Scoring happens on our servers and session files are deleted right after.

curl -fsSL {{ agentInstallUrl }} | bash -s -- --key {{ orgApiKey }} --api {{ agentApiUrl }} --origin {{ agentOrigin }}

We'd recommend doing it now — but you can always install it later from Settings. {{ agentStatusLabel }}

{{ connectCopyLabel }}

✎Feedback / report a bug

✦Contact for Enterprise

Log out

{{ dashKicker }}

{{ dashTitle }}

Updated {{ updatedAt }}

{{ ac.label }}

{{ ac.value }}

{{ ac.sub }}

{{ heroLabel }}

{{ heroBody }}

{{ v.label }}

{{ v.delta }}

{{ v.score }}

{{ v.spark }}

Cost & efficiency

{{ costTotal }}

Total spend

{{ costAvg }}

Avg per session

Model usage

{{ mr.model }}

{{ mr.count }}

No session data yet — cost metrics will appear once sessions are recorded.

Risk signals

0 ? '#D97706' : '#A1A1AA' }};">{{ riskLoopsDisplay }}

Sessions with loops

of total sessions flagged

0 ? '#DC2626' : '#A1A1AA' }};">{{ riskBelowThreshold }}

Below threshold

developers below {{ alertBelow }}-point target

Review developers →

{{ c.tag }}

{{ c.initials }}

{{ c.name }}

{{ c.role }}

{{ c.value }}

{{ c.valueSub }}

People & Skills

Developers

Updated {{ updatedAt }}

! ALERT<br>{{ alertText }}<br>Review {{ alertCount }} developers

Side-by-side comparison

Clear ×

{{ ce.name }}

{{ ce.role }} · {{ ce.dept }}

{{ cv.label }}

{{ cv.score }}

Overall

{{ ce.overall }}

{{ tableTitle }}

{{ tableMeta }}

All roles<br>Staff Engineer<br>Engineering Lead<br>Senior SWE<br>SWE II<br>SWE I<br>ML Engineer<br>Data Engineer

All scores<br>Top performers (≥ 80)<br>On track (65–79)<br>Needs attention (

Clear filters

{{ filteredCount }} developers

{{ col.label }}

{{ r.chkCell }}

{{ r.rank }}

{{ r.initials }}

{{...

session sessions data developers scores score

Related Articles