Rate Limits
Rate limits are restrictions that our API enforces on how often users can access our services within a given time period. Rate limits can be identified via HTTP 429 error codes.
Rate Limits by Tier
Tier | Rate Limit |
---|---|
Free | 1 request / sec |
Paid (Developer or Enterprise) | 100 requests / sec |
VPC | Unlimited |
There is an initial daily token limit of 10M tokens / day / account. If you want to raise this limit, please reach out to support@predibase.com.
Rate Limits in Headers
Tier | Rate Limit |
---|---|
x-envoy-ratelimited | Whether the rate limit has been reached |
x-ratelimit-limit | The max number of requests until the rate limit is reached |
x-ratelimit-remaining | The remaining number of requests until the rate limit is reached |
x-ratelimit-reset | Amount of time (seconds) until you can query again |
If you're interested in higher rate limits, please reach out to us here or via our Discord.