GPT-4o Audio

Model Overview

This is a preview release of the GPT-4o Audio models. These models accept audio inputs and outputs, and can be used in the Chat Completions REST API.

Key Features

128,000 context window
16,384 max output tokens
Oct 01, 2023 knowledge cutoff
High intelligence with medium speed

Technical Specifications

Text Input price: $2.50 per million tokens
Text Output price: $10.00 per million tokens
Audio Input price: $40.00 per million tokens
Audio Output price: $80.00 per million tokens
Supports: Input: text and audio, Output: text and audio
Features: Streaming, function calling supported

Snapshots

gpt-4o-audio-preview
gpt-4o-audio-preview-2024-12-17
gpt-4o-audio-preview-2024-10-01

Positioning and Use Cases

GPT-4o Audio models capable of audio inputs and outputs. These models accept audio inputs and outputs, and can be used in the Chat Completions REST API.

Official Documentation

OpenAI

Pioneer in AI, globally renowned for GPT series models

GPT-4o Audio

Parameters Unknow

Output tokens 16,384 tokens

GPT-4o Audio GPT-4o models capable of audio inputs and outputs

Official: $2.5 • $10 Our Price: $2 • $8 Save 20%

Back To List Try Now

Frequently Asked Questions

What is the uptime guarantee?

We guarantee 99.9% uptime with our enterprise-grade infrastructure and redundant systems.

How is pricing calculated?

Pricing is based on the number of tokens processed. Both input and output tokens are counted in the final cost.

What is the difference between GPT-4 and GPT-4 Turbo?

GPT-4 Turbo is the latest version with improved performance, longer context window, and more recent knowledge cutoff date.