Deployment functionality #222

vishnubob · 2023-12-25T21:22:31Z

I would like to be able to spin up and shutdown deployments from the API. From looking over the API and python client, this doesn’t seem possible. Am I missing something or would it be possible to add this functionality?

Thanks!

mattt · 2024-01-30T18:25:58Z

Hi, @vishnubob. You're correct that Replicate doesn't currently expose any APIs for managing deployments. However, you can configure your deployment with a min / max number of concurrent predictions to handle, and the autoscaler will spin up and down down model instances based on inbound requests.

vishnubob · 2024-01-31T00:29:16Z

Hi @mattt, thanks for your response. I am using replicate for an interactive photobooth, so my use case is a bit unusual. Since the installation is temporal, I only need the deployment while the installation is available. In order to reduce any latency, I standup a single node deployment while the installation is available, and spin down the nodes when I strike. However, it's a complicated installation, and I sometimes forget to spin down the deployments during strike, so I end up paying for idle deployments. Being able to automate the deployment from the software would be a huge win.

For now, I have transitioned this part of the project to tailscale which lets me use my own server at home, but if I could automate the deployment, I would switch back to using replicate.

9108702032 mentioned this issue Jan 31, 2024

set up Billings? #239

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deployment functionality #222

Deployment functionality #222

vishnubob commented Dec 25, 2023

mattt commented Jan 30, 2024

vishnubob commented Jan 31, 2024

Deployment functionality #222

Deployment functionality #222

Comments

vishnubob commented Dec 25, 2023

mattt commented Jan 30, 2024

vishnubob commented Jan 31, 2024