60db Logo

Ultra-realistic and low latency
speech generation

Build with high-quality, controllable speech generation for real-time and bulk applications. Models optimized for latency, fidelity, and long-form consistency.

73 / 100

Built on the most powerful Voice AI models

Choose the right model for your use case: from ultra-low latency agents to expressive, long-form narration.

Nova

Lowest latency speech synthesis

  • Ultra-low latency (~25ms)
  • 32 languages supported
  • 40,000 character limit
  • ~$0.06 per minute

Sonic

Balanced quality and latency

  • Low latency (~250-300ms)
  • High quality voice generation
  • 32 languages supported
  • 40,000 character limit
  • ~$0.06 per minute

Opus

Most emotionally rich model

  • Natural-sounding output
  • 70+ languages supported
  • 3,000 character limit
  • Multi-speaker dialogue
  • ~$0.12 per minute

Everything you need to build production-ready speech

Generate expressive, controllable speech with models built for real-time, long-form, and production use.

🎭

Control emotion and delivery

Create controllable, expressive speech, layered with emotion, audio events, and immersive soundscapes.

🎤

Access 10,000+ voices

Explore an ever-growing collection of expressive, lifelike voices for any use case.

🎨

Voice design & cloning

Create in over 30 languages with natural voices, expressive accents, and localized audio tailored to your audience.

👥

Multi-speaker dialogue

Create natural multi-speaker conversations across 30+ languages with expressive, controllable voices.

🎯

Audio events and direction

Control delivery with audio tags, timing cues, and narrative direction built into the speech.

📚

Pronunciation dictionaries

Define custom pronunciations to ensure consistent, accurate speech for names and terminology.

APIs built for production

Trusted by enterprise customers worldwide for mission-critical applications.

🔒

Enterprise-level data protection

Data is encrypted in transit and at rest, with support for SOC 2, HIPAA, and GDPR compliance. EU Data Residency and Zero Retention modes are available for stricter data control.

⚙️

Python and TypeScript SDKs

Official SDKs for seamless integration with your existing tech stack.

🤝

Elevated support and custom deployments

Dedicated support team and custom deployment options for enterprise customers.

Powering world's leading companies and brands

Trusted by companies building at scale

Meta

"From dubbing Reels in local languages, to generating music and character voices in Horizon, 60db platform enables global creators, businesses, and enterprises to build with voice, music, and sound at scale."

Chess.com

"Millions of people learn chess from creators like Hikaru, Levy, and Magnus every day. With 60db, we've taken a big step toward creating immersive, personal learning experiences."

Twilio

"We've integrated 60db's generative AI voice technology into our CPaaS, enhancing conversational interactions with the most expressive, human-sounding voices available."

Frequently asked questions

Find answers to common questions about our API

Latest updates

Getting Started

Getting Started with Text-to-Speech API

Learn how to integrate our TTS API into your application in minutes.

Best Practices

Best Practices for Production Voice Applications

Optimize latency, quality, and cost in your voice-powered applications.

Advanced

Multi-language Voice Generation at Scale

Deploy voice applications across 30+ languages with consistent quality.

Tutorials

Building Real-time Voice Agents with our API

Create responsive voice agents with our ultra-low latency models.

Features

Custom Voice Cloning for Brand Voice

Create a unique brand voice with our voice cloning technology.

Advanced

Emotion and Delivery Control in Speech

Master expressive speech generation with emotion tags and delivery cues.