Gemini 3 Flash

Gemini 3 Flash delivers Gemini 3's pro-grade reasoning at flash-level latency and cost, outperforming Gemini 2.5 Pro across most benchmarks with meaningful gains in token efficiency. Your use subject to Google's Terms & Privacy Policies.

ReasoningTool UseFile InputVision (Image)Web Searchtiered-costImplicit Caching

Use with AI Gateway View docs

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'google/gemini-3-flash',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

More models by Google

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Capabilities	Providers	ZDR	No Training	Release Date

google/gemini-3.6-flash

2.2s

173tps

$1.50/M

$7.50/M

Read:$0.15/M

Write:—

$14/K

+ input costs

07/21/2026

google/gemini-3.5-flash-lite

0.7s

255tps

$0.30/M

$2.50/M

Read:$0.03/M

Write:—

$14/K

+ input costs

07/21/2026

google/gemini-3.5-flash

2.5s

160tps

$1.50/M

$9/M

Read:$0.15/M

Write:—

$14/K

+ input costs

05/19/2026

google/gemini-3.1-flash-lite

0.6s

287tps

$0.25/M

$1.50/M

Read:$0.03/M

Write:—

$14/K

+ input costs

05/07/2026

google/gemini-3.1-pro-preview

4.3s

99tps

$2/M

$12/M

Read:

$0.2/M

Write:

—

$14/K

+ input costs

02/19/2026

google/gemini-2.5-flash-lite

0.3s

228tps

$0.10/M

$0.40/M

Read:$0.01/M

Write:—

$35/K

+ input costs

06/17/2025

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Gemini 3 Flash

More models by Google