Gemini 3 Flash

Gemini 3 Flash delivers Gemini 3's pro-grade reasoning at flash-level latency and cost, outperforming Gemini 2.5 Pro across most benchmarks with meaningful gains in token efficiency. Your use subject to Google's Terms & Privacy Policies.

ReasoningTool UseFile InputVision (Image)Web Searchtiered-costImplicit Caching

Use with AI Gateway View docs

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'google/gemini-3-flash',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Max Output	Latency	Throughput	Input	Output	Cache	Web Search	Capabilities	ZDR	No Training	Release Date

Google Vertex AI

Legal:Terms•Privacy

65K

0.9s

152tps

$0.50/M

$3/M

Read:

$0.05/M

Write:

—

$14/K

+ input costs

12/17/2025

Google

Legal:Terms•Privacy

64K

0.7s

176tps

$0.50/M

$3/M

Read:

$0.05/M

Write:

—

$14/K

+ input costs

12/17/2025

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Gemini 3 Flash

Providers