Overview

Predibase offers Virtual Private Cloud (VPC) deployments that give you complete control over your data:

The dataplane is deployed in your own Virtual Private Cloud and data never leaves your environment.
You maintain control while benefiting from SaaS agility
Supported clouds: Amazon Web Services (AWS) and Microsoft Azure

Architecture

Predibase uses a dual-plane architecture consisting of:

Controlplane: Handles business logic and orchestration
Dataplane: Manages operations related to customer-sensitive data

Key Components

Controlplane: Multi-tenant architecture managed by Predibase
Dataplane: Secure environment deployed within your AWS or Azure cloud account
Private Link: Optional direct connection between your applications and the Predibase dataplane

Deployment Options

1. Predibase Cloud (Fully Managed)

Instant access to model training
Complete infrastructure management by Predibase
Includes both controlplane and dataplane operations
Available as a free trial with limited features

2. VPC Deployment

Dataplane runs in your Virtual Private Cloud (VPC)
Data never leaves your environment
Simple deployment via CloudFormation or Azure Resource Manager
Automatic updates and upgrades
Recommended: Create a dedicated cloud account for the Predibase dataplane

VPC Features

Direct Ingress

Direct ingress is a latency optimization feature for VPC customers that enables direct connection to your LLM deployment during inference. By bypassing the Predibase control plane, direct ingress reduces network hops and decreases time to first token (TTFT) latency by tens to hundreds of milliseconds. This bypass also enhances security by ensuring that prompt requests and responses are only processed by customer-controlled infrastructure.

Predibase direct ingress uses AWS PrivateLink (Azure and GCP also supported) to establish secure private connections between your customer-controlled VPC (hosting application code) and the Predibase-configured VPC (hosting LLMs). PrivateLink connections remain within AWS-managed networks, ensuring security and data privacy. When both VPCs are located in the same AWS region, connections are established between regional hosts, minimizing network latency.

Direct ingress is available exclusively to VPC customers and requires onboarding assistance from Predibase support. To enable direct ingress for your tenant, please contact our support team.

See private deployments for more details on how to prompt using direct ingress.

Cloud Providers and Regions with GPU Availability

Users can deploy Predibase into Amazon Web Services (AWS) or Microsoft Azure. While both cloud providers have a number of regions, the curated list below are regions that have our recommended GPUs available.

AWS Regions

us-east-1

N. Virginia - Available

us-east-2

Ohio - Available upon Request

us-west-2

Oregon - Available

ap-northeast-1

Tokyo - Available upon Request

eu-central-1

Frankfurt - Available upon Request

eu-south-2

Spain - Available upon Request

Azure Regions

us-east

East US - Available upon Request

us-west-2

West US 2 - Available

us-south-central

South Central US - Available upon Request

europe-west

West Europe - Available upon Request

australia-east

Australia East - Available upon Request

Need deployment in another cloud provider (e.g., Google Cloud Platform) or region? Contact us at support@predibase.com.

Getting Started

Inference

Fine-Tuning

Account

Integrations

Examples

Resources

Architecture

Key Components

Deployment Options

1. Predibase Cloud (Fully Managed)

2. VPC Deployment

VPC Features

Direct Ingress

Cloud Providers and Regions with GPU Availability

AWS Regions

us-east-1

us-east-2

us-west-2

ap-northeast-1

eu-central-1

eu-south-2

Azure Regions

us-east

us-west-2

us-south-central

europe-west

australia-east

Next Steps

Getting Started

Inference

Fine-Tuning

Account

Integrations

Examples

Resources

​Architecture

​Key Components

​Deployment Options

​1. Predibase Cloud (Fully Managed)

​2. VPC Deployment

​VPC Features

​Direct Ingress

​Cloud Providers and Regions with GPU Availability

​AWS Regions

us-east-1

us-east-2

us-west-2

ap-northeast-1

eu-central-1

eu-south-2

​Azure Regions

us-east

us-west-2

us-south-central

europe-west

australia-east

​Next Steps

Architecture

Key Components

Deployment Options

1. Predibase Cloud (Fully Managed)

2. VPC Deployment

VPC Features

Direct Ingress

Cloud Providers and Regions with GPU Availability

AWS Regions

Azure Regions

Next Steps