SDK operations for managing model deployments
min_replicas=2
to ensure at least one replica is up to serve traffic during the update. Updating the replica counts of a deployment will not cause downtime.lorax_image_tag="<current>"
in the config parameter.shared
or private
to return only deployments of the specified type