Save VM cost when using Rest API deploying models for online inference
ADB allows us to deploy the models for online inference through a REST API. To that aim ADB creates a VM dedicated to serve a specific model. Data Scientist can create and deploy several models for testing online inference, thus the cost can rapidly grow uncontroled and unnecessarily because those newly created VMs are running all the time.
It is possible to shut them down after certain period of inactivity?
2
Answers
Microsoft answered me that, in a future Azure Databricks version, Model serving will be improved – using containers instead of using virtual machines.
Yes you can enable auto scaling in Azure-Databricks(job cluster), this would terminate all except min # of workers(would be kept alive).
https://docs.databricks.com/clusters/configure.html#enable-and-configure-autoscaling