API: Differentiated Rate Limits for Free and Paid Users
We’ve introduced differentiated rate limiting based on user plan type to ensure fair usage and optimal performance for all users.
Rate Limits
What This Means
- Free users are now subject to a lower rate limit of 1 request per second
- Paid users continue to enjoy the standard rate limit of 20 requests per second
- When rate limited, responses include a
Retry-Afterheader indicating when to retry
Affected Endpoints
POST /v1/audio/speechPOST /v1/audio/stream
If you exceed the rate limit, you’ll receive a 429 Too Many Requests response.
Check the Retry-After header for guidance on when to retry your request.