GPT-4o mini Realtime

Model Overview

GPT-4o mini Realtime is a smaller realtime model capable of text and audio inputs and outputs.

Key Features

128,000 context window
4,096 max output tokens
Oct 01, 2023 knowledge cutoff
Average intelligence with very fast speed

Technical Specifications

Text input price: $0.60 per million tokens
Text cached input price: $0.30 per million tokens
Text output price: $2.40 per million tokens
Audio input price: $10.00 per million tokens
Audio cached input price: $0.30 per million tokens
Audio output price: $20.00 per million tokens
Supports: Input: text and audio, Output: text and audio
Features: Realtime API, function calling supported

Snapshots

gpt-4o-mini-realtime-preview
gpt-4o-mini-realtime-preview-2024-12-17

Positioning and Use Cases

This is a preview release of the GPT-4o-mini Realtime model, capable of responding to audio and text inputs in realtime over WebRTC or a WebSocket interface.

Official Documentation

OpenAI

Pioneer in AI, globally renowned for GPT series models

GPT-4o mini Realtime

Parameters Unknow

Output tokens 4,096 tokens

GPT-4o mini Realtime Smaller realtime model for text and audio inputs and outputs

Official: $0.6 • $2.4 Our Price: $0.48 • $1.92 Save 20%

Back To List Try Now

Frequently Asked Questions

What is the uptime guarantee?

We guarantee 99.9% uptime with our enterprise-grade infrastructure and redundant systems.

How is pricing calculated?

Pricing is based on the number of tokens processed. Both input and output tokens are counted in the final cost.

What is the difference between GPT-4 and GPT-4 Turbo?

GPT-4 Turbo is the latest version with improved performance, longer context window, and more recent knowledge cutoff date.