https://entropia.de/GPN19:ML_Workflow_Tools_Overview https://pretalx.entropia.de/gpn19/talk/B3HDR3/
Modern ML workflow requires to run experiments fast at a large scale. In order to stay sane and keep an overview of what is going on, there are some tools out there.
We will demonstrate 2 tools in this workshop : ML flow and edflow. These tools focus on different parts of the development workflow : fast model iteration and monitoring (dashboard).
We will provide some basic code and instructions on how to add the tools to basic examples. We will cover a standard classification based problem to demonstrate a simple use-case.
- create new virtual environment, for example using conda
conda create --name ml_talk python=3.6
source activate ml_talk
- then install requirements
pip install -r mlflow_excersize/requirements.txt
pip install -r requirements_edflow.txt
https://github.com/pesser/edflow
edflow -t problem1_solution/train.yaml # train model
edflow -t problem1_solution/train.yaml -p project_folder # continue training model
edflow -t problem2_solution/train.yaml -e problem2_solution/validation.yaml # add validation
It consists only of adding some simple logging statements to mlflow_excersize/linear_model.py
and mlflow_excersize/linear_model_lasso.py
(compare mlflow_excersize/linear_model_mlflow.py
) and afterwards running mlflow ui --filestore mlflow_excersize/mlflow
.
The template is available under: https://github.com/LeanderK/cookiecutter-ml