GPT-4o mini

Model Overview

GPT-4o mini ("o" for "omni") is a fast, affordable small model for focused tasks.

Key Features

128,000 context window
16,384 max output tokens
Oct 01, 2023 knowledge cutoff
Average intelligence with fast speed

Technical Specifications

Input price: $0.15 per million tokens
Cached input price: $0.075 per million tokens
Output price: $0.60 per million tokens
Supports: Input: text and image, Output: text only
Features: Streaming, function calling, structured outputs, fine-tuning supported

Snapshots

gpt-4o-mini
gpt-4o-mini-2024-07-18

Positioning and Use Cases

Fast, affordable small model for focused tasks. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is ideal for fine-tuning, and model outputs from a larger model like GPT-4o can be distilled to GPT-4o-mini to produce similar results at lower cost and latency.

Official Documentation

OpenAI

Pioneer in AI, globally renowned for GPT series models

GPT-4o mini

Parameters Unknow

Output tokens 16,384 tokens

GPT-4o mini Fast, affordable small model for focused tasks

Official: $0.15 • $0.6 Our Price: $0.12 • $0.48 Save 20%

Back To List Try Now

Frequently Asked Questions

What is the uptime guarantee?

We guarantee 99.9% uptime with our enterprise-grade infrastructure and redundant systems.

How is pricing calculated?

Pricing is based on the number of tokens processed. Both input and output tokens are counted in the final cost.

What is the difference between GPT-4 and GPT-4 Turbo?

GPT-4 Turbo is the latest version with improved performance, longer context window, and more recent knowledge cutoff date.