Gemini 2.5 Flash Preview TTS

Model Overview

Gemini 2.5 Flash Preview TTS is Google's price-performant text-to-speech model, delivering high control and transparency for structured workflows.

Key Features

  • High quality TTS capabilities
  • Low latency audio generation
  • 8,000 input token limit
  • 16,000 output token limit
  • Text input support
  • Audio output support

Technical Specifications

  • Model Code: gemini-2.5-flash-preview-tts
  • Supports: Input: text; Output: audio
  • Features: Audio generation, controllable single- and multi-speaker text-to-speech
  • Pricing:
    • Input: $0.50 per 1M tokens (text)
    • Output: $10.00 per 1M tokens (audio)
  • Free Tier: Not available

Snapshots

  • gemini-2.5-flash-preview-tts

Positioning and Use Cases

Perfect for structured workflows like podcast generation, audiobooks, customer support, and other applications requiring high-quality text-to-speech conversion with control and transparency.

Rate Limits

  • More restricted rate limits since it is an experimental/preview model

Documentation

Official Documentation

Google

Next-generation AI models backed by powerful technical expertise

Gemini 2.5 Flash Preview TTS

Parameters Unknow
Output tokens 16,000 tokens

Gemini 2.5 Flash Preview TTS is Google's price-performant text-to-speech model, delivering high control and transparency for structured workflows.

Official: $0.50 • $10.00 Our Price: $0.40 • $8.00 Save 20%

Frequently Asked Questions

What is the uptime guarantee?
We guarantee 99.9% uptime with our enterprise-grade infrastructure and redundant systems.
How is pricing calculated?
Pricing is based on the number of tokens processed. Both input and output tokens are counted in the final cost.
What is the difference between GPT-4 and GPT-4 Turbo?
GPT-4 Turbo is the latest version with improved performance, longer context window, and more recent knowledge cutoff date.