GPT-4o mini TTS

Model Overview

GPT-4o mini TTS is a text-to-speech model built on GPT-4o mini, a fast and powerful language model. Use it to convert text to natural sounding spoken text. The maximum number of input tokens is 2000.

Key Features

  • Higher performance (4/4 dots rating)
  • Fast speed (4/5 lightning bolts rating)
  • Text-to-speech model powered by GPT-4o mini
  • Accepts text input and produces audio output
  • Maximum input token limit: 2000 tokens

Technical Specifications

  • Pricing: $0.60 per 1M input tokens, $12.00 per 1M output tokens
  • Supports: Input: text only, Output: audio only
  • Features: Speech generation supported via v1/audio/speech endpoint

Snapshots

  • gpt-4o-mini-tts

Positioning and Use Cases

As a text-to-speech model powered by GPT-4o mini, this model is designed for converting text to natural sounding spoken text with high performance and fast speed.

Rate Limits

  • Free tier: Not supported
  • Tier 1: 500 RPM, 50,000 TPM
  • Tier 2: 2,000 RPM, 150,000 TPM
  • Tier 3: 5,000 RPM, 600,000 TPM
  • Tier 4: 10,000 RPM, 2,000,000 TPM
  • Tier 5: 10,000 RPM, 8,000,000 TPM

Documentation

Official Documentation

OpenAI

Pioneer in AI, globally renowned for GPT series models

GPT-4o mini TTS

Parameters Unknow

GPT-4o mini TTS Text-to-speech model powered by GPT-4o mini

Official: $0.6 • $12 Our Price: $0.48 • $096 Save 20%

Frequently Asked Questions

What is the uptime guarantee?
We guarantee 99.9% uptime with our enterprise-grade infrastructure and redundant systems.
How is pricing calculated?
Pricing is based on the number of tokens processed. Both input and output tokens are counted in the final cost.
What is the difference between GPT-4 and GPT-4 Turbo?
GPT-4 Turbo is the latest version with improved performance, longer context window, and more recent knowledge cutoff date.