Overview
Learn about Predibase VPC deployments, architecture, and deployment options
Predibase offers Virtual Private Cloud (VPC) deployments that give you complete control over your data:
- The dataplane is deployed in your own Virtual Private Cloud and data never leaves your environment.
- You maintain control while benefiting from SaaS agility
- Supported clouds: Amazon Web Services (AWS) and Microsoft Azure
Architecture
Predibase uses a dual-plane architecture consisting of:
- Controlplane: Handles business logic and orchestration
- Dataplane: Manages operations related to customer-sensitive data
Key Components
- Controlplane: Multi-tenant architecture managed by Predibase
- Dataplane: Secure environment deployed within your AWS or Azure cloud account
- Private Link: Optional direct connection between your applications and the Predibase dataplane
Deployment Options
1. Predibase Cloud (Fully Managed)
- Instant access to model training
- Complete infrastructure management by Predibase
- Includes both controlplane and dataplane operations
- Available as a free trial with limited features
2. VPC Deployment
- Dataplane runs in your Virtual Private Cloud (VPC)
- Data never leaves your environment
- Simple deployment via CloudFormation or Azure Resource Manager
- Automatic updates and upgrades
- Recommended: Create a dedicated cloud account for the Predibase dataplane
VPC Features
Direct Ingress
Direct ingress is a latency optimization feature for VPC customers that enables direct connection to your LLM deployment during inference. By bypassing the Predibase control plane, direct ingress reduces network hops and decreases time to first token (TTFT) latency by tens to hundreds of milliseconds. This bypass also enhances security by ensuring that prompt requests and responses are only processed by customer-controlled infrastructure.
Predibase direct ingress uses AWS PrivateLink (Azure and GCP also supported) to establish secure private connections between your customer-controlled VPC (hosting application code) and the Predibase-configured VPC (hosting LLMs). PrivateLink connections remain within AWS-managed networks, ensuring security and data privacy. When both VPCs are located in the same AWS region, connections are established between regional hosts, minimizing network latency.
Direct ingress is available exclusively to VPC customers and requires onboarding assistance from Predibase support. To enable direct ingress for your tenant, please contact our support team.
See private deployments for more details on how to prompt using direct ingress.
Cloud Providers and Regions with GPU Availability
Users can deploy Predibase into Amazon Web Services (AWS) or Microsoft Azure. While both cloud providers have a number of regions, the curated list below are regions that have our recommended GPUs available.
AWS Regions
us-east-1
N. Virginia - Available
us-east-2
Ohio - Available upon Request
us-west-2
Oregon - Available
ap-northeast-1
Tokyo - Available upon Request
eu-central-1
Frankfurt - Available upon Request
eu-south-2
Spain - Available upon Request
Azure Regions
us-east
East US - Available upon Request
us-west-2
West US 2 - Available
us-south-central
South Central US - Available upon Request
europe-west
West Europe - Available upon Request
australia-east
Australia East - Available upon Request
Need deployment in another cloud provider (e.g., Google Cloud Platform) or region? Contact us at support@predibase.com.