Gemini 3.5 Flash: more expensive, but Google plan to use it for everything

flyaway1231 pts0 comments

Gemini 3.5 Flash: more expensive, but Google plan to use it for everything

Simon Willison’s Weblog

Subscribe

Sponsored by: Datadog — Ship reliable AI faster with LLM Observability. Read the best practices guide

Gemini 3.5 Flash: more expensive, but Google plan to use it for everything

19th May 2026

Today at Google I/O, Google released Gemini 3.5 Flash. This one skipped the -preview modifier and went straight to general availability, and Google appear to be using it for a whole lot of their key products:

3.5 Flash is available today to billions of people globally:

For everyone via the Gemini app and AI Mode in Google Search

For developers in our agent-first development platform Google Antigravity and Gemini API in Google AI Studio and Android Studio

For enterprises in Gemini Enterprise Agent Platform and Gemini Enterprise.

As usual with Gemini, the most interesting details are tucked away in the What’s new in Gemini 3.5 Flash developer documentation. It mostly has the same set of platform features as the previous Gemini 3.x series, albeit with no computer use. The model ID is gemini-3.5-flash. The knowledge cut-off is January 2025, and it supports 1,048,576 input tokens and 65,536 maximum output tokens.

Google are also pushing a new Interactions API, currently in beta, which looks to me like their version of the patterns introduced by OpenAI Responses—in particular server-side history management.

The price has gone up

Gemini 3.5 Flash is accompanied by a notable price bump. The previous models in the “Flash” family were Gemini 3 Flash Preview and Gemini 3.1 Flash-Lite. The new 3.5 Flash is 3x the price of 3 Flash Preview and 6x the price of 3.1 Flash-Lite (see price comparison here).

At $1.50/million input and $9/million output it’s getting close in price to Google’s Gemini 3.1 Pro, which is $2 and $12.

The Gemini team promise that 3.5 Pro will roll out “next month”—presumably at an even higher price.

This fits a trend: OpenAI’s GPT-5.5 was 2x the price of GPT-5.4, and Claude Opus 4.7 is around 1.46x the price of 4.6 when you take the new tokenizer into account.

Given the price increase it’s interesting to see Google roll it out for so many of their own free-to-consumer products. It feels like all three of the major AI labs are starting to probe the price tolerance of their API customers.

Artificial Analysis publish the cost to run their proprietary benchmark against models, which is a useful way to take things like tokenization and increased volume of reasoning tokens into account. Some numbers worth comparing:

Gemini 3.5 Flash (high): $1,551.60

Gemini 3.1 Pro Preview: $892.28

Gemini 3 Flash Preview (Reasoning): $278.26

Gemini 3.1 Flash-Lite Preview: $93.60

Running the benchmark for 3.5 Flash (high) cost significantly more than 3.1 Pro Preview!

Here are some numbers from other vendors:

Claude Opus 4.7 (Adaptive Reasoning, Max Effort): $5,117.14

Claude Opus 4.7 (Non-reasoning, High Effort): $1,217.23

GPT-5.5 (xhigh): $3,357.00

GPT-5.5 (medium): $1,199.14

A pelican on a bicycle

I ran “Generate an SVG of a pelican riding a bicycle” against the Gemini API and got back this pelican, which is a lot:

From the code comments:

hedgehog on Hacker News:

That pelican looks like it’s in Miami for a crypto conference.

That one cost me 11 input tokens and 14,403 output tokens, for a total cost of just under 13 cents.

Posted 19th May 2026 at 10:40 pm · Follow me on Mastodon, Bluesky, Twitter or subscribe to my newsletter

More recent articles

The last six months in LLMs in five minutes - 19th May 2026

Notes on the xAI/Anthropic data center deal - 7th May 2026

This is Gemini 3.5 Flash: more expensive, but Google plan to use it for everything by Simon Willison, posted on 19th May 2026.

google<br>408

ai<br>2026

generative-ai<br>1793

llms<br>1759

gemini<br>188

llm-pricing<br>73

pelican-riding-a-bicycle<br>115

llm-release<br>200

Previous: The last six months in LLMs in five minutes

Monthly briefing

Sponsor me for $10/month and get a curated email digest of the month's most important LLM developments.

Pay me to send you less!

Sponsor & subscribe

Disclosures

Colophon

&copy;

2002

2003

2004

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

gemini flash google price preview tokens

Related Articles