skip to Main Content

Save VM cost when using Rest API deploying models for online inference
ADB allows us to deploy the models for online inference through a REST API. To that aim ADB creates a VM dedicated to serve a specific model. Data Scientist can create and deploy several models for testing online inference, thus the cost can rapidly grow uncontroled and unnecessarily because those newly created VMs are running all the time.

It is possible to shut them down after certain period of inactivity?

2

Answers


  1. Chosen as BEST ANSWER

    Microsoft answered me that, in a future Azure Databricks version, Model serving will be improved – using containers instead of using virtual machines.


  2. Yes you can enable auto scaling in Azure-Databricks(job cluster), this would terminate all except min # of workers(would be kept alive).

    https://docs.databricks.com/clusters/configure.html#enable-and-configure-autoscaling

    Login or Signup to reply.
Please signup or login to give your own answer.
Back To Top
Search