Model Overview
Grok-3-Mini-Fast is the high-speed variant of Grok-3-Mini, offering the same lightweight reasoning capabilities with enhanced response speed for time-critical applications.
Key Features
- High intelligence (3/4 dots rating)
- Very fast speed (5/5 lightning bolts rating)
- 131,072 context window
- Medium max output tokens (estimated 4,096+)
- November 17, 2024 knowledge cutoff
- Text input support
- Text output with thinking traces
Technical Specifications
- Pricing: $0.60 per 1M tokens (input), $4.00 per 1M tokens (output)
- Supports: Input: text; Output: text with reasoning traces
- Features: Low-latency reasoning, accessible thinking traces, real-time applications
Snapshots
- grok-3-mini-fast (alias for grok-3-mini-fast-latest)
- grok-3-mini-fast-latest
Positioning and Use Cases
Grok-3-Mini-Fast combines the reasoning capabilities of Grok-3-Mini with enhanced response speed. It's perfect for interactive educational applications, real-time tutoring systems, and live problem-solving scenarios where both transparency in reasoning and quick response times are essential. The model maintains the same thinking trace accessibility while delivering faster performance for time-sensitive logic-based tasks.
Rate Limits
- Information not publicly available
Additional Notes
- Knowledge Cutoff: All Grok-3 family models have a knowledge cutoff of November 17, 2024
- No Internet Access: Unlike grok.com and Grok in X, API models are not connected to the internet
- Flexible Role Order: No role order limitation - you can mix system, user, or assistant roles in any sequence
- Model Aliases: Latest versions are automatically updated through aliases for seamless upgrades
- Fast vs Standard: Fast variants offer identical quality with reduced latency at higher cost
Documentation
Official Documentation