Data Flow to VPC Deployments
There are three ways to prompt a VPC deployment in Predibase:UI
Requests sent via the Predibase Prompt UI are sent to public
serving.app.predibase.com
and then sent through
our SaaS controlplane and our internal network before it reaches your dataplane, and ultimately your LLM deployment.SDK/API
SDK or API requests to
serving.app.predibase.com
are routed through our SaaS controlplane and internal network before reaching your deployment.Direct Ingress
Direct ingress is Predibase feature which uses services like AWS PrivateLink to create a direct connection between your VPC
and your Predibase dataplane. By routing through the cloud provider’s network, requests directly reach your deployment without external hops
Direct Ingress Setup
To enable direct ingress in your VPC:- Contact Predibase support to enable the feature
- Create deployments with
direct_ingress=True
- Use VPC endpoint instead of
serving.app.predibase.com

AWS PrivateLink Network Architecture
Data Storage
Fine-tuning Datasets
With the exception of explicitly uploading a file to Predibase (under the
‘dataset connection page’), data will never be saved to disk on our servers.
- Credentials are stored in a secure HashiCorp Vault instance
- Credentials are read at runtime by fine-tuning workers in your VPC
- Workers use these credentials to access your datasets
- Files are saved to a private cloud storage bucket within your Predibase dataplane cloud account that you manage
Request Data
By default, Predibase will never log your prompts to your LLM deployments or
the responses that are generated.
Data Stored By Predibase
Controlplane
Predibase Metadata Database- Model metadata
- Example: Base model name, context length
- Adapter metadata
- Example: Name, description, number of training steps, name of dataset
- Dataset metadata
- Example: Number of rows, name of columns in dataset
- Debug logs from deployments
- Debug logs do not include prompts and responses by default. If we require more detailed logs to help triage issues, we will always contact you before turning them on.
- Debug logs from Predibase system components
- Metrics from LLM deployments and Predibase system level components
- Ex: GPU utilization, CPU utilization, filesystem utilization, number of requests
Both the Predibase Metadata Database and Predibase Observability Database
encrypt data at rest. Requests going between our application and these databases
are encrypted via TLS. Both databases require secure authentication to
connect to them.