Skip to main content

Google announces Gemini 3 Flash with Pro-level performance, rolling out now

Following last month’s launch of Gemini 3 Pro, Google today announced Gemini 3 Flash for consumers and developers.

The tagline, of sorts, for Gemini 3 Flash is “frontier intelligence built for speed at a fraction of the cost.” It retains Gemini 3’s complex reasoning, multimodal/vision understanding, and performance in agentic/vibe coding tasks, but adds “Flash-level latency, efficiency and cost.” The Flash model series is Google’s most popular offering.

It unsurprisingly surpasses 2.5 Flash across the board and “significantly” outperforms 2.5 Pro across several benchmarks. 3 Flash is comparable to 3 Pro, while even beating it in some areas, including MMMU Pro, Toolathlon, and MPC Atlas.

  • GPQA Diamond (scientific knowledge): 90.4%
  • Humanity’s Last Exam (academic reasoning): 33.7% without tools
  • MMMU Pro (multimodal understanding/reasoning): 81.2%
  • SWE-Bench Verified (agentic coding): 78%
  • Toolathlon (long horizon real-world software tasks): 49.4%
  • MCP Atlas (multi-step workflows using MCP): 57.4%

Google touts how 3 Flash “outperforms 2.5 Pro while being 3x faster at a fraction of the cost.” This strong reasoning, tool use, and multimodal capabilities can translate to “more complex video analysis, data extraction and visual Q&A” for third-party developers building customer support agents or in-game assistants.

Advertisement - scroll for more content

…priced at $0.50/1M input tokens and $3/1M output tokens (audio input remains at $1/1M input tokens).

Gemini app + AI Mode

Gemini 3 Flash is rolling out to the Gemini app now and replaces 2.5 Flash as the default model. It’s billed as a “major upgrade to your everyday AI” for delivering “smarts and speed.”  For example:

…you can quickly build fun, useful apps from scratch without prior coding knowledge. Just ask Gemini to help you iterate on an idea. You can dictate stream-of-consciousness thoughts on-the-go and turn those into a prototype.

Notably, Gemini 3 Flash will be available in the model picker as two options: “Fast” for quick answers and “Thinking” for complex problems. Gemini 3 Pro will appear as “Pro” for advanced math and code prompts.

It’s also rolling out globally as the default model in AI Mode. Gemini 3 Flash lets you ask more nuanced questions. Meanwhile, Google is making Gemini 3 Pro with its generative UI and Nano Banana Pro available for everyone in the US.  

… excels at grasping the nuance of your question to serve thoughtful, comprehensive responses that are visually digestible – pulling real-time local information and helpful links from across the web.

For developers, Gemini 3 Flash is available in preview via AI Studio, Google Antigravity, Gemini CLI, and Android Studio. It joins Gemini 3 Pro and Gemini 3 Deep Think from earlier this month.

FTC: We use income earning auto affiliate links. More.

You’re reading 9to5Google — experts who break news about Google and its surrounding ecosystem, day after day. Be sure to check out our homepage for all the latest news, and follow 9to5Google on Twitter, Facebook, and LinkedIn to stay in the loop. Don’t know where to start? Check out our exclusive stories, reviews, how-tos, and subscribe to our YouTube channel

Comments

Author

Avatar for Abner Li Abner Li

Editor-in-chief. Interested in the minutiae of Google and Alphabet. Tips/talk: abner@9to5g.com