Skip to main content

Rate Limits

Rate limits are restrictions that our API enforces on how often users can access our services within a given time period. Rate limits can be identified via HTTP 429 error codes.

Rate Limits by Tier

TierRate Limit
Free1 request / sec
Paid (Developer or Enterprise)100 requests / sec
VPCUnlimited

There is an initial daily token limit of 10M tokens / day / account. If you want to raise this limit, please reach out to support@predibase.com.

Rate Limits in Headers

TierRate Limit
x-envoy-ratelimitedWhether the rate limit has been reached
x-ratelimit-limitThe max number of requests until the rate limit is reached
x-ratelimit-remainingThe remaining number of requests until the rate limit is reached
x-ratelimit-resetAmount of time (seconds) until you can query again

If you're interested in higher rate limits, please reach out to us here or via our Discord.