Ultra-realistic and low latency
speech generation
Build with high-quality, controllable speech generation for real-time and bulk applications. Models optimized for latency, fidelity, and long-form consistency.
Built on the most powerful Voice AI models
Choose the right model for your use case: from ultra-low latency agents to expressive, long-form narration.
Nova
Lowest latency speech synthesis
- ✓Ultra-low latency (~25ms)
- ✓32 languages supported
- ✓40,000 character limit
- ✓~$0.06 per minute
Sonic
Balanced quality and latency
- ✓Low latency (~250-300ms)
- ✓High quality voice generation
- ✓32 languages supported
- ✓40,000 character limit
- ✓~$0.06 per minute
Opus
Most emotionally rich model
- ✓Natural-sounding output
- ✓70+ languages supported
- ✓3,000 character limit
- ✓Multi-speaker dialogue
- ✓~$0.12 per minute
Everything you need to build production-ready speech
Generate expressive, controllable speech with models built for real-time, long-form, and production use.
Control emotion and delivery
Create controllable, expressive speech, layered with emotion, audio events, and immersive soundscapes.
Access 10,000+ voices
Explore an ever-growing collection of expressive, lifelike voices for any use case.
Voice design & cloning
Create in over 30 languages with natural voices, expressive accents, and localized audio tailored to your audience.
Multi-speaker dialogue
Create natural multi-speaker conversations across 30+ languages with expressive, controllable voices.
Audio events and direction
Control delivery with audio tags, timing cues, and narrative direction built into the speech.
Pronunciation dictionaries
Define custom pronunciations to ensure consistent, accurate speech for names and terminology.
APIs built for production
Trusted by enterprise customers worldwide for mission-critical applications.
Enterprise-level data protection
Data is encrypted in transit and at rest, with support for SOC 2, HIPAA, and GDPR compliance. EU Data Residency and Zero Retention modes are available for stricter data control.
Python and TypeScript SDKs
Official SDKs for seamless integration with your existing tech stack.
Elevated support and custom deployments
Dedicated support team and custom deployment options for enterprise customers.
Powering world's leading companies and brands
Trusted by companies building at scale
Meta
"From dubbing Reels in local languages, to generating music and character voices in Horizon, 60db platform enables global creators, businesses, and enterprises to build with voice, music, and sound at scale."
Chess.com
"Millions of people learn chess from creators like Hikaru, Levy, and Magnus every day. With 60db, we've taken a big step toward creating immersive, personal learning experiences."
Twilio
"We've integrated 60db's generative AI voice technology into our CPaaS, enhancing conversational interactions with the most expressive, human-sounding voices available."
Frequently asked questions
Find answers to common questions about our API
Latest updates
Getting Started with Text-to-Speech API
Learn how to integrate our TTS API into your application in minutes.
Best Practices for Production Voice Applications
Optimize latency, quality, and cost in your voice-powered applications.
Multi-language Voice Generation at Scale
Deploy voice applications across 30+ languages with consistent quality.
Building Real-time Voice Agents with our API
Create responsive voice agents with our ultra-low latency models.
Custom Voice Cloning for Brand Voice
Create a unique brand voice with our voice cloning technology.
Emotion and Delivery Control in Speech
Master expressive speech generation with emotion tags and delivery cues.
