GPT-4o Audio

Model Overview

This is a preview release of the GPT-4o Audio models. These models accept audio inputs and outputs, and can be used in the Chat Completions REST API.

Key Features

  • 128,000 context window
  • 16,384 max output tokens
  • Oct 01, 2023 knowledge cutoff
  • High intelligence with medium speed

Technical Specifications

  • Text Input price: $2.50 per million tokens
  • Text Output price: $10.00 per million tokens
  • Audio Input price: $40.00 per million tokens
  • Audio Output price: $80.00 per million tokens
  • Supports: Input: text and audio, Output: text and audio
  • Features: Streaming, function calling supported

Snapshots

  • gpt-4o-audio-preview
  • gpt-4o-audio-preview-2024-12-17
  • gpt-4o-audio-preview-2024-10-01

Positioning and Use Cases

GPT-4o Audio models capable of audio inputs and outputs. These models accept audio inputs and outputs, and can be used in the Chat Completions REST API.

Official Documentation

OpenAI

Pioneer in AI, globally renowned for GPT series models

GPT-4o Audio

Parameters Unknow
Output tokens 16,384 tokens

GPT-4o Audio GPT-4o models capable of audio inputs and outputs

Official: $2.5 • $10 Our Price: $2 • $8 Save 20%

Frequently Asked Questions

What is the uptime guarantee?
We guarantee 99.9% uptime with our enterprise-grade infrastructure and redundant systems.
How is pricing calculated?
Pricing is based on the number of tokens processed. Both input and output tokens are counted in the final cost.
What is the difference between GPT-4 and GPT-4 Turbo?
GPT-4 Turbo is the latest version with improved performance, longer context window, and more recent knowledge cutoff date.