Model Overview
Grok-2-Image is xAI's specialized image generation model, capable of creating high-quality images from text descriptions.
Key Features
- High creativity (4/4 dots rating)
- Medium speed (3/5 lightning bolts rating)
- 131,072 context window
- Image generation capability
- 2024 knowledge cutoff (estimated)
- Text input support
- Image output support
Technical Specifications
- Pricing: $0.07 per generated image
- Supports: Input: text prompts; Output: generated images
- Features: High-quality image synthesis, creative generation, prompt-based control
Snapshots
- grok-2-image-1212
- grok-2-image (alias for grok-2-image-latest)
- grok-2-image-latest
Positioning and Use Cases
Grok-2-Image is designed for creative applications requiring high-quality image generation from text descriptions. It excels at artistic creation, concept visualization, marketing materials, product mockups, and creative content generation. The model offers cost-effective image synthesis with competitive quality, making it suitable for both professional creative workflows and experimental artistic projects.
Rate Limits
- Information not publicly available
Additional Technical Notes
- Image Input Specifications: Maximum 10MiB per image, unlimited number of images, supports JPG/JPEG and PNG formats
- Flexible Input Order: Text and image inputs can be mixed in any order within conversations
- Model Versioning: Date-specific versions (e.g., -1212) provide consistency, while aliases auto-update to latest versions
- Context Limitations: Grok-2-Vision has smaller context window (8K) compared to other models (131K)
- Pricing Structure: Image generation uses per-image pricing, while text models use token-based pricing
Documentation
Official Documentation