Model Overview
GPT Image 1 is a state-of-the-art image generation model capable of accepting both text and image inputs to produce high-quality image outputs.
Key Features
- High performance image generation
- Multimodal capabilities (accepts text and image inputs)
- Multiple quality tiers and resolution options
- Inpainting support
Technical Specifications
- Text input price: $5.00 per million tokens
- Text cached input price: $1.25 per million tokens
- Image input price: $10.00 per million tokens
- Image cached input price: $2.50 per million tokens
- Image output price: $40.00 per million tokens
- Image generation price: Varies by quality and size (from $0.011 to $0.25 per image)
- Supports: Input: text and image, Output: image
Image Generation Pricing
Pricing varies based on quality level and dimensions:
Low Quality
- 1024×1024: $0.011
- 1024×1536: $0.016
- 1536×1024: $0.016
Medium Quality
- 1024×1024: $0.042
- 1024×1536: $0.063
- 1536×1024: $0.063
High Quality
- 1024×1024: $0.167
- 1024×1536: $0.25
- 1536×1024: $0.25
Endpoints
- Image generation: v1/images/generations
- Image edit: v1/images/edits
Snapshots
Rate Limits
Varies by tier, ranging from 5 images per minute (Tier 1) to 250 images per minute (Tier 5).
Official Documentation