Text to speech,
translated & spoken in any language

Text in, MP3 URL back in under 800ms. Auto-translation across 60+ languages built in — no separate translation API needed. Pay with Bitcoin, Ethereum, USDT and more. No credit card, no KYC, no monthly deadlines.

Try live demo
No credit card required
Pay with crypto
Live in < 5 minutes
Live Playground206 chars
Live demo • Audio expires after 24 hours

Why developers choose SayIt

No SDKs to install, no infrastructure to manage, no surprise bills. Just text in and MP3 out - at the price other APIs charge for complexity.

Fast enough to feel instant

Text in, MP3 URL back in under 800ms. No queuing, no batch delays, no polling - your audio is ready before your user notices.

Auto-translate to 60+ languages

Write in English, output in Korean. Write in German, output in Spanish. The API detects your input language and translates before synthesising - no extra steps, no separate translation API needed.

Fine-grained speech control

Speech tags let you add emphasis, insert pauses, and whisper sections inline in your text — fine-grained control without a separate SSML document or extra API calls. See the docs for the full tag reference.

One endpoint, no SDK needed

POST your text, get back an MP3 URL. That's it. No client libraries, no auth flows, no setup - a single curl command gets you started in seconds.

Built for crypto and Web3

Pay with Bitcoin, Ethereum, USDT and more - no bank account, no credit card, no KYC. Characters never expire, so there's no pressure to use them before a monthly deadline. Ideal for DeFi apps, NFT platforms, DAOs, and any Web3 project that wants voice without the traditional payment friction.

Usage you can actually see

Characters used, requests made, and spend to date - all in one dashboard. Set quota alerts so you find out before your app does.

Five distinct voices

Each voice is optimized for different use cases. Sample them before you build.

E
Eve
FemaleAmerican English

Warm, professional, and clear. Ideal for customer-facing apps, explainer videos, and corporate content.

Corporate
E-learning
IVR
A
Ara
FemaleAmerican English

Bright, conversational, and friendly. Perfect for consumer apps, social content, and casual narration.

Consumer apps
Podcasts
Social
R
Rex
MaleAmerican English

Deep, authoritative, and measured. Great for news, documentaries, and high-stakes announcement content.

News
Documentaries
Ads
S
Sal
NeutralAmerican English

Calm, balanced, and highly intelligible. Excellent for accessibility tools, screen readers, and instructional content.

Accessibility
Instructions
Assistants
L
Leo
MaleAmerican English

Energetic, expressive, and engaging. Built for gaming, entertainment, and content that needs personality.

Gaming
Entertainment
Promos

How it works

From zero to production voice synthesis in four straightforward steps.

01

Get your API key

Sign up, confirm your email, and your free API key is ready instantly. No credit card, no waiting. The free tier includes 10,000 characters.

02

Make your first request

POST your text to https://api.sayitapi.com/v1/tts with your SAYIT-API-KEY header. Specify a voice and output language - the API auto-translates if your text is in a different language.

03

Receive your audio URL

You get back a JSON response with a signed audio URL pointing to your generated MP3 (or WAV). The URL is valid for 24 hours — download it to your own storage for permanent access, or serve it directly for short-lived use cases.

04

Serve or download

Stream the URL directly to your users, download it to your server, embed it in your app, or pipe it into your video/podcast production workflow.

Straightforward pricing

Start free. Top up with crypto when you need more. No card, no KYC, no monthly deadlines - characters never expire.

Free
Try it, no card needed
$0to start
10,000 chars
  • 10,000 characters
  • 5 voices
  • 60+ languages
  • MP3 output
  • Auto-translation
  • Usage dashboard
  • WAV output
  • Webhooks
  • Batch processing API
  • CSV export
Basic
Indie devs, bots & Web3 projects
$10one-time
1,000,000 chars
  • 1,000,000 characters
  • 5 voices
  • 60+ languages
  • MP3 + WAV output
  • Auto-translation
  • Usage dashboard
  • Webhooks
  • Batch processing API
  • CSV export
Best value
Standard
DeFi apps, SaaS & content creators
$30one-time
5,000,000 chars
  • 5,000,000 characters
  • 5 voices
  • 60+ languages
  • MP3 + WAV output
  • Auto-translation
  • Usage dashboard
  • Webhooks
  • Batch processing API
  • CSV export
Premium
Agencies, DAOs & high-volume apps
$100one-time
20,000,000 chars
  • 20,000,000 characters
  • 5 voices
  • 60+ languages
  • MP3 + WAV output
  • Auto-translation
  • Usage dashboard
  • Webhooks
  • Batch processing API
  • CSV export

What happens when you run out of characters?

API requests return a quota_exceeded error — no silent failures, no overage charges. We'll email you at 80% and again at 100% so you have time to top up. Top up any time from your dashboard.

Need something bigger?

Volume pricing, dedicated infrastructure, custom SLAs, and 24/7 support — email us at enterprise@sayitapi.com.

Get in touch

Pay with Bitcoin, Ethereum, USDT, and 100+ other cryptocurrencies via NowPayments. No bank account required. Characters never expire — top up when you need to, not on a schedule.

One endpoint. That's it.

Built for developers

A clean, predictable REST API. POST text in, get a signed audio URL back. No SDKs required. No proprietary formats. No streaming setup.

1
One endpoint
POST /v1/tts - no separate voice listing, no streaming endpoints to manage.
2
Speech tags
Use <emphasis>, [pause], and <whisper> for fine control.
3
Audio URL response
The response includes a direct audio URL valid for 24 hours. Serve it straight to your users or download it to your own storage.
bash - request
curl -X POST https://api.sayitapi.com/v1/tts \
  -H "SAYIT-API-KEY: sk_live_your_api_key_here" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Hello! This is <emphasis>SayIt API</emphasis>. [pause] Powered by SayIt API.",
    "voice_id": "eve",
    "language": "en",
    "output_format": "mp3",
    "speech_tags": true,
    "speed": 1.0
  }'
json - response
{
  "id": "tts_01hxyz...",
  "status": "completed",
  "audio_url": "https://api.sayitapi.com/audio/01hxyz...mp3?expires=1743260591000&sig=a3f...",
  "duration_seconds": 3.24,
  "characters_used": 82,
  "voice_id": "eve",
  "created_at": "2026-03-29T14:23:11Z"
}

Built for

Wherever your project is, SayIt fits without friction.

DeFi & crypto apps

Add voice alerts, narration, and audio summaries to your DeFi dashboard or crypto project. Pay with crypto, no KYC, no bank account needed.

Indie hackers

Add voice to your SaaS product over a weekend. No vendor lock-in, no complex setup, predictable one-time costs.

Game developers

Prototype NPC dialogue instantly. Generate hundreds of spoken lines without a voice actor on call.

Accessibility tools

Make your product usable by more people. Reliable, clear speech synthesis for screen readers and audio UIs.

Voice agents

Give your LLM-powered assistant a real voice. Low-latency synthesis fits neatly into agent pipelines.

SaaS products

Ship voice notifications, audio summaries, or spoken reports without building TTS infrastructure yourself.

Ready to ship voice
in minutes?

Get your free API key and have audio in any language in under a second. Auto-translation built in - write in any language, output in any other. No credit card. No sales call. Just text in, MP3 out.

10,000 free characters • No credit card • Characters never expire