Model Overview
TTS-1 HD is a text-to-speech model optimized for quality. Use it to convert text to natural sounding spoken text with the Speech endpoint in the Audio API.
Key Features
- High performance (3/4 dots rating)
- Medium speed (3/5 lightning bolts rating)
- Text-to-speech model optimized for quality
- Accepts text input and produces audio output
Technical Specifications
- Pricing: $30.00 per 1M tokens
- Supports: Input: text only, Output: audio only
- Features: Speech generation supported via v1/audio/speech endpoint
Snapshots
Positioning and Use Cases
TTS is a model that converts text to natural sounding spoken text. The tts-1-hd model is optimized for high quality text-to-speech use cases. Use it with the Speech endpoint in the Audio API.
Rate Limits
- Free tier: Not supported
- Tier 1: 500 RPM
- Tier 2: 2,500 RPM
- Tier 3: 5,000 RPM
- Tier 4: 7,500 RPM
- Tier 5: 10,000 RPM
Documentation
Official Documentation