Google Gemini-SQL2 tops text-to-SQL benchmarks

Google Research's Gemini-SQL2 tops text-to-SQL benchmarks by a wide margin

The Decoder

AI, Menschen, Wirtschaft

-->

Subscribe Now

The Decoder

Opens discord in a new tab Opens LinkedIn in a new tab

Google Research's Gemini-SQL2 tops text-to-SQL benchmarks by a wide margin

Matthias Bastian

View the LinkedIn Profile of Matthias Bastian

Jun 13, 2026

Google Research unveiled Gemini-SQL2, a new text-to-SQL system built on Gemini 3.1 Pro. It translates natural language into executable SQL database queries. On the BIRD benchmark, which measures how accurately these translations work, Gemini-SQL2 hits an execution accuracy of 80.04 percent, putting it in first place, according to Google. OpenAI's GPT-5.5-xhigh scores about 72.8 percent, and Anthropic's Claude Opus 4.6 lands around 70.9 percent. Models from Databricks, AWS, Tencent, and Alibaba all trail well behind.

Gemini-SQL2 leads the BIRD text-to-SQL leaderboard with 80.04 percent execution accuracy, outpacing competitors from OpenAI, Anthropic, Databricks, and others. | Image: Google Google Research points out that turning natural language into correct SQL is especially hard because data is often layered and queries need to account for complex business logic. The generated SQL queries both look correct and execute successfully, the company says.

Better SQL understanding could improve natural language features across Google's data services more broadly, according to Google. The research team hasn't said anything about a public release of the model, and there's no paper yet either.Ad DEC_D_Incontent-1

AI News Without the Hype – Curated by Humans

Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section.

Subscribe now

Source: Google Research

Ask about this article…

Top Stories

Landmark German ruling declares Google's AI Overviews are Google's own words and makes it liable for false answers

Anthropic releases Claude Fable 5 and Mythos 5 with major gains in coding and science

Researchers pinpoint why larger language models pick up skills that small ones miss

OpenAI now says "entirely automating everything is not the future we want"

OpenAI says "chat is dead" and plans to rebuild ChatGPT as a full-blown agent app

Don't Miss What Matters

Stay in the loop on AI. Clear, useful, no fluff.

Your email address

Subscribe to our Newsletter

Most Popular

Landmark German ruling declares Google's AI Overviews are Google's own words and makes it liable for false answers

For $1.3 million a month, OpenClaw founder Peter Steinberger runs 100 AI agents that code, review PRs, and find bugs

Google launches a tiny board that runs Gemma 3 locally

Fields Medalist says ChatGPT 5.5 Pro delivered "PhD-level" math research in under two hours with zero human help

Researchers let Claude Code discover AI scaling algorithms that humans probably wouldn't have designed

Claude Mythos reportedly solves OpenAI's landmark Erdős problem with a "cute, simple proof"

AI Community & Insights

The Decoder

Follow The Decoder for AI news, background stories and expert analyses.

Opens LinkedIn in a new tab

Opens Discord in a new tab

BETA-TEST

Start new search

Send

wpDiscuz

Insert

BETA-TEST

Start new search

Send

wpDiscuz

Insert

Google Gemini-SQL2 tops text-to-SQL benchmarks

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

Apple WWDC 2026 Livestream

Claude Fable 5

US Government directive to suspend access to Fable 5 and Mythos 5

It's Not Just X. It's Y