MiniMax M3

MiniMax M3 is MiniMax's first model with a 1.0M tokens context window and native multimodal input. It targets software engineering, terminal-based tool use, and agentic web browsing, with a max output of 1.0M tokens per request. Your use subject to MiniMax's Terms & Privacy Policies.

Implicit CachingReasoningTool UseVision (Image)

Use with AI Gateway View docs

TypeScript

Python

import { streamText } from 'ai'

const result = streamText({
  model: 'minimax/minimax-m3',
  prompt: 'Why is the sky blue?'
})

Read docs

Overview About Providers Throughput Latency Uptime Status Similar FAQ

Throughput24 hours

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

MiniMax M3