Overview

Convert text to natural-sounding speech with a single API call

Introduction

The Speechify API converts text into lifelike audio. Send text, get back audio — in MP3, OGG, AAC, WAV, or raw PCM.

1from speechify import Speechify
2
3client = Speechify() # uses SPEECHIFY_API_KEY env var
4
5response = client.tts.audio.speech(
6 input="Welcome to Speechify!",
7 voice_id="george",
8 audio_format="mp3",
9)
10
11with open("output.mp3", "wb") as f:
12 f.write(response.audio_data)

What you can do

FeatureDescription
Text to SpeechConvert up to 2,000 characters per request
StreamingStream audio for up to 20,000 characters
Voice CloningClone any voice from a 10-30 second sample
Emotion ControlAdd emotions like cheerful, sad, angry to speech
50+ LanguagesFull support for 6 languages, 17 in beta, 26 coming soon
SSMLFine-grained control over pitch, rate, pauses, and emphasis

SDKs

Resources