o1-mini

Model Overview

The o1-mini model is a small model alternative to o1, designed to provide reasoning capabilities with faster speed and at a more affordable price point.

Key Features

  • 128,000 context window
  • 65,536 max output tokens
  • Oct 01, 2023 knowledge cutoff
  • Reasoning token support
  • High reasoning capabilities with slower speed

Technical Specifications

  • Input price: $1.10 per million tokens
  • Cached input price: $0.55 per million tokens
  • Output price: $4.40 per million tokens
  • Supports: Input: text only, Output: text only
  • Features: Streaming supported; Function calling, structured outputs, fine-tuning, distillation and predicted outputs not supported

Snapshots

  • o1-mini
  • o1-mini-2024-09-12

Positioning and Use Cases

The o1 reasoning model is designed to solve hard problems across domains. o1-mini is a faster and more affordable reasoning model, but OpenAI recommends using the newer o3-mini model that features higher intelligence at the same latency and price as o1-mini.

Official Documentation

OpenAI

Pioneer in AI, globally renowned for GPT series models

o1-mini

Parameters Unknow
Qutput tokens 65,536

A small model alternative to o1

Official: $1.1 • $4.4 Our Price: $0.88 • $3.52 Save 20%

Frequently Asked Questions

What is the uptime guarantee?
We guarantee 99.9% uptime with our enterprise-grade infrastructure and redundant systems.
How is pricing calculated?
Pricing is based on the number of tokens processed. Both input and output tokens are counted in the final cost.
What is the difference between GPT-4 and GPT-4 Turbo?
GPT-4 Turbo is the latest version with improved performance, longer context window, and more recent knowledge cutoff date.