Serving ML models at scale using Mlflow on Kubernetes – Part 2
25 October 2021 This article is the second part of a series in which we go through the process of logging models using Mlflow, serving them as an API endpoint, and finally scaling them up according to our application needs. We encourage you to read our previous article in which we show how to deploy a tracking instance on k8s and check the hands-on prerequisites (secrets, environment variables…) as we will continue to build upon them here. In the following, we show how to serve a machine learning model that is already registered in Mlflow and expose it as an API endpoint on k8s.






