For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
ExamplesConsole
OverviewText to SpeechAPI ReferenceChangelog
OverviewText to SpeechAPI ReferenceChangelog
  • Get Started
    • Overview
    • Quickstart
    • Authentication
    • Models
    • API Limits
    • Official SDKs
  • Features
    • Voice Cloning
    • Language Support
    • Streaming
    • Emotion Control
    • SSML
    • Speech Marks
LogoLogo
ExamplesConsole
On this page
  • Introduction
  • What you can do
  • SDKs
  • Resources
Get Started

Overview

Convert text to natural-sounding speech with a single API call

Was this page helpful?

Quickstart

Make your first text-to-speech API call in 5 minutes

Next
Built with

Introduction

The Speechify API converts text into lifelike audio. Send text, get back audio — in MP3, OGG, AAC, WAV, or raw PCM.

Python
TypeScript
cURL
1from speechify import Speechify
2
3client = Speechify() # uses SPEECHIFY_API_KEY env var
4
5response = client.tts.audio.speech(
6 input="Welcome to Speechify!",
7 voice_id="george",
8 audio_format="mp3",
9)
10
11with open("output.mp3", "wb") as f:
12 f.write(response.audio_data)

What you can do

FeatureDescription
Text to SpeechConvert up to 2,000 characters per request
StreamingStream audio for up to 20,000 characters
Voice CloningClone any voice from a 10-30 second sample
Emotion ControlAdd emotions like cheerful, sad, angry to speech
50+ LanguagesFull support for 6 languages, 17 in beta, 26 coming soon
SSMLFine-grained control over pitch, rate, pauses, and emphasis

SDKs

Python

pip install speechify-api

TypeScript

npm install @speechify/api

REST API

Direct HTTP calls

Resources

  • Quickstart — Make your first API call in 5 minutes
  • API Reference — Full endpoint documentation
  • Examples Repository — End-to-end demo projects