GPT-4o mini Realtime

Model Overview

GPT-4o mini Realtime is a smaller realtime model capable of text and audio inputs and outputs.

Key Features

  • 128,000 context window
  • 4,096 max output tokens
  • Oct 01, 2023 knowledge cutoff
  • Average intelligence with very fast speed

Technical Specifications

  • Text input price: $0.60 per million tokens
  • Text cached input price: $0.30 per million tokens
  • Text output price: $2.40 per million tokens
  • Audio input price: $10.00 per million tokens
  • Audio cached input price: $0.30 per million tokens
  • Audio output price: $20.00 per million tokens
  • Supports: Input: text and audio, Output: text and audio
  • Features: Realtime API, function calling supported

Snapshots

  • gpt-4o-mini-realtime-preview
  • gpt-4o-mini-realtime-preview-2024-12-17

Positioning and Use Cases

This is a preview release of the GPT-4o-mini Realtime model, capable of responding to audio and text inputs in realtime over WebRTC or a WebSocket interface.

Official Documentation

OpenAI

Pioneer in AI, globally renowned for GPT series models

GPT-4o mini Realtime

Parameters Unknow
Output tokens 4,096 tokens

GPT-4o mini Realtime Smaller realtime model for text and audio inputs and outputs

Official: $0.6 • $2.4 Our Price: $0.48 • $1.92 Save 20%

Frequently Asked Questions

What is the uptime guarantee?
We guarantee 99.9% uptime with our enterprise-grade infrastructure and redundant systems.
How is pricing calculated?
Pricing is based on the number of tokens processed. Both input and output tokens are counted in the final cost.
What is the difference between GPT-4 and GPT-4 Turbo?
GPT-4 Turbo is the latest version with improved performance, longer context window, and more recent knowledge cutoff date.