This is a demonstration of how MLOps can be incorporated into an ML project using the classic Iris Dataset classification problem
The raw dataset used in this repository comes was downloaded here. This dataset is in a CSV format with the following features from left to right:
- Sepal legth (cm)
- Sepal width (cm)
- Petal length (cm)
- Petal width (cm)
- Target
In this project, the raw and prepared data are versioned using DVC. Since it's made in Python, it was added to dev_requirements.in.
To import all the data necessary for this project via DVC, you can perform
$ dvc pull