GPT-4o mini

Model Overview

GPT-4o mini ("o" for "omni") is a fast, affordable small model for focused tasks.

Key Features

  • 128,000 context window
  • 16,384 max output tokens
  • Oct 01, 2023 knowledge cutoff
  • Average intelligence with fast speed

Technical Specifications

  • Input price: $0.15 per million tokens
  • Cached input price: $0.075 per million tokens
  • Output price: $0.60 per million tokens
  • Supports: Input: text and image, Output: text only
  • Features: Streaming, function calling, structured outputs, fine-tuning supported

Snapshots

  • gpt-4o-mini
  • gpt-4o-mini-2024-07-18

Positioning and Use Cases

Fast, affordable small model for focused tasks. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is ideal for fine-tuning, and model outputs from a larger model like GPT-4o can be distilled to GPT-4o-mini to produce similar results at lower cost and latency.

Official Documentation

OpenAI

Pioneer in AI, globally renowned for GPT series models

GPT-4o mini

Parameters Unknow
Output tokens 16,384 tokens

GPT-4o mini Fast, affordable small model for focused tasks

Official: $0.15 • $0.6 Our Price: $0.12 • $0.48 Save 20%

Frequently Asked Questions

What is the uptime guarantee?
We guarantee 99.9% uptime with our enterprise-grade infrastructure and redundant systems.
How is pricing calculated?
Pricing is based on the number of tokens processed. Both input and output tokens are counted in the final cost.
What is the difference between GPT-4 and GPT-4 Turbo?
GPT-4 Turbo is the latest version with improved performance, longer context window, and more recent knowledge cutoff date.