Model Overview
GPT-4o ("o" for "omni") is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is our most capable model outside of our o-series models.
Key Features
- 128,000 context window
- 16,384 max output tokens
- Oct 01, 2023 knowledge cutoff
- High intelligence with medium speed
Technical Specifications
- Input price: $2.50 per million tokens
- Cached input price: $1.25 per million tokens
- Output price: $10.00 per million tokens
- Supports: Input: text and image, Output: text only
- Features: Streaming, function calling, structured outputs, fine-tuning, distillation and predicted outputs supported
Snapshots
- gpt-4o
- gpt-4o-2024-11-20
- gpt-4o-2024-08-06
- gpt-4o-2024-05-13
Positioning and Use Cases
GPT-4o ("o" for "omni") is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is our most capable model outside of our o-series models.
Official Documentation