What is Machine Learning Operations

If you are reading this and you are not me, you should navigate to this link: https://ml-ops.org/

There are so many basis and nice charts that explain What is Machine Learning Operations (or ml-ops). I’m doing a summary for me 🙂

Definition of MLOps

The term Machine Learning Operations is defined as “the extension of the DevOps methodology to include Machine Learning and Data Science assets as first-class citizens within the DevOps ecology.

Overview of the End-to-End Machine Learning Workflow

The main phases are:

  • Data Engineering: data acquisition & data preparation,
  • ML Model Engineering: ML model training & serving, and
  • Code Engineering :integrating ML model into the final product.
Machine Learning Engineering

Iterative-Incremental Process in MLOps

Agile ML Workflow

Automation

The level of automation of the Data, ML Model, and Code pipelines determines the maturity of the ML process.

  1. Manual process
  2. ML pipeline automation
  3. CI/CD pipeline automation

The following picture shows the automated ML pipeline with CI/CD routines:

Automated ML Pipeline

Data Version Control

If you have the lifecycle of your code operating under DevOps and you are using Git to manage all this, how do you manage the versions of data when adding machine learning projects? One possible answer is: Data Version Control.

DVC.org is an Open-source Version Control System for Machine Learning Projects. They have some nice videos where you can see how it works and how they implement so many of the best practices from ml-ops.

For instance, this introduction video: https://www.youtube.com/watch?v=UbL7VUpv1Bs&t=2s

Leave a Comment