Gemini 2.5 Flash Preview TTS

Model Overview

Gemini 2.5 Flash Preview TTS is Google's price-performant text-to-speech model, delivering high control and transparency for structured workflows.

Key Features

High quality TTS capabilities
Low latency audio generation
8,000 input token limit
16,000 output token limit
Text input support
Audio output support

Technical Specifications

Model Code: gemini-2.5-flash-preview-tts
Supports: Input: text; Output: audio
Features: Audio generation, controllable single- and multi-speaker text-to-speech
Pricing:
- Input: $0.50 per 1M tokens (text)
- Output: $10.00 per 1M tokens (audio)
Free Tier: Not available

Snapshots

gemini-2.5-flash-preview-tts

Positioning and Use Cases

Perfect for structured workflows like podcast generation, audiobooks, customer support, and other applications requiring high-quality text-to-speech conversion with control and transparency.

Rate Limits

More restricted rate limits since it is an experimental/preview model

Documentation

Official Documentation

Google

Next-generation AI models backed by powerful technical expertise

Gemini 2.5 Flash Preview TTS

Parameters Unknow

Output tokens 16,000 tokens

Gemini 2.5 Flash Preview TTS is Google's price-performant text-to-speech model, delivering high control and transparency for structured workflows.

Official: $0.50 • $10.00 Our Price: $0.40 • $8.00 Save 20%

Back To List Try Now

Frequently Asked Questions

What is the uptime guarantee?

We guarantee 99.9% uptime with our enterprise-grade infrastructure and redundant systems.

How is pricing calculated?

Pricing is based on the number of tokens processed. Both input and output tokens are counted in the final cost.

What is the difference between GPT-4 and GPT-4 Turbo?

GPT-4 Turbo is the latest version with improved performance, longer context window, and more recent knowledge cutoff date.