Skip to main content

pb.deployments.create

Note: This method is for creating a private serverless deployment. You can also query base or fine-tuned models via shared serverless endpoints.

pb.deployments.create

Create a new private serverless deployment

Parameters:

   name: str
Name of the private serverless deployment

   description: str, default None
Description for the deployment

   config: Deployment Config

Returns:

   Deployment

Example:

Create a new private serverless deployment

pb.deployments.create(
name="my-mistral-7b",
config=DeploymentConfig(
base_model="mistral-7b-instruct-v0-2",
# cooldown_time=3600, # Value in seconds, defaults to 43200 (12hrs)
min_replicas=0, # Auto-scales to 0 replicas when not in use
max_replicas=1
)
# description="", # Optional
)
Note regarding base_model

Use the short names provided in the list of available models.