MiniMax M3
MiniMax M3 is MiniMax's first model with a 1M tokens context window and native multimodal input. It targets software engineering, terminal-based tool use, and agentic web browsing, with a max output of 1M tokens per request.
import { streamText } from 'ai'
const result = streamText({ model: 'minimax/minimax-m3', prompt: 'Why is the sky blue?'})P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.