Use Predibase’s shared model deployments
Tier | Rate Limit | Daily | Monthly |
---|---|---|---|
Free | 1 request / sec | 1 million tokens / day | 10 million tokens / day |
Enterprise SaaS | 100 requests / sec | 1 million tokens / day | 10 million tokens / day |
Enterprise VPC | Does not apply | Does not apply | Does not apply |
Header | Description |
---|---|
x-envoy-ratelimited | Whether the rate limit has been reached |
x-ratelimit-limit | The max number of requests until the rate limit is reached |
x-ratelimit-remaining | The remaining number of requests until the rate limit is reached |
x-ratelimit-reset | Amount of time (seconds) until you can query again |