API: Differentiated Rate Limits for Free and Paid Users

We’ve introduced differentiated rate limiting based on user plan type to ensure fair usage and optimal performance for all users.

Rate Limits

Plan TypeRate Limit
Free1 request/second
Paid20 requests/second

What This Means

  • Free users are now subject to a lower rate limit of 1 request per second
  • Paid users continue to enjoy the standard rate limit of 20 requests per second
  • When rate limited, responses include a Retry-After header indicating when to retry

Affected Endpoints

  • POST /v1/audio/speech
  • POST /v1/audio/stream

If you exceed the rate limit, you’ll receive a 429 Too Many Requests response. Check the Retry-After header for guidance on when to retry your request.