You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
An automated machine learning tool which trains, graphs performance and saves the model in SQL. Using interpretable ML models (from [interpretml](https://github.com/interpretml/interpret/)) to train models which are explainable and interpretable, so called 'glassbox' models. With the outputted model in SQL format which can be used to put a model in 'production' in an SQL environment.
23
+
An automated machine learning cli tool which trains, graphs performance and saves the model in SQL. Using interpretable ML models (from [interpretml](https://github.com/interpretml/interpret/)) to train models which are explainable and interpretable, so called 'glassbox' models. With the outputted model in SQL format which can be used to put a model in 'production' in an SQL environment.
24
24
This tool can be used by anybody, but is aimed for people who want to do a quick analysis and/or deploy a model in an SQL system.
4. Wait until all packages are installed (could take a few minutes)
72
-
5. You are ready to go (the virtual env does not need to be activated to use this tool)
52
+
1. Make sure you have python >= 3.8
53
+
2.`pip install ml2sql`
54
+
3. Run: `ml2sql init`
55
+
This will create the folders, `input/data/`, `input/configuration/ and `trained_models/`
73
56
74
57
<br>
75
58
</details>
@@ -78,9 +61,7 @@ This tool can be used by anybody, but is aimed for people who want to do a quick
78
61
<br>
79
62
80
63
1. In the terminal in the root of this folder run:
81
-
- `python3 run.py` (Mac/Linux)
82
-
- `python run.py` (Windows)
83
-
2. Follow the instructions on screen by selecting the example data and similarly named config file
64
+
`ml2sql run`, follow the instructions on screen and select the demo data and config
84
65
3. Check the output in the newly created folder
85
66
86
67
<br>
@@ -90,9 +71,7 @@ This tool can be used by anybody, but is aimed for people who want to do a quick
90
71
<br>
91
72
92
73
1. Save csv file containing target and all features in the `input/data/` folder (more info on [input data](#data))
93
-
2. In the terminal in the root of this folder run:
94
-
- `python3 run.py` (Mac/Linux)
95
-
- `python run.py` (Windows)
74
+
2. Run: `ml2sql run`
96
75
3. Select your CSV file
97
76
4. Select `Create a new config` and choose `Automatic` option (a config file will be made and can be edited later) (more info on [config json](#configuration-json))
98
77
5. Select newly created config
@@ -110,8 +89,7 @@ This tool can be used by anybody, but is aimed for people who want to do a quick
110
89
1. Make sure the new dataset has the same variables as the dataset the model was trained on (same features and target)
111
90
2. Save dataset in the `input/data/` folder (more info on [input data](#data))
112
91
3. In the terminal in the root of this folder run:
113
-
- `python3 check_model.py` (Mac/Linux)
114
-
- `python check_model.py` (Windows)
92
+
`ml2sql check-model`
115
93
4. Follow the instructions on screen
116
94
5. The output will be saved in the folder `trained_models/<selected_model>/tested_datasets/<selected_dataset>/`
117
95
@@ -256,12 +234,13 @@ Can be found in the created model's folder under `/model`
256
234
257
235
## Notes
258
236
- Limited to 3 models (EBM, linear/logistic regression, and Decision Tree).
259
-
- Data imbalance treatments (e.g., oversampling + model calibration) are not fully implemented.
237
+
- Data imbalance treatments (e.g., oversampling + model calibration) are not implemented.
260
238
- Only accepts CSV files.
261
239
- Interactions with more than 2 variables are not supported.
262
240
263
241
## TODO list
264
242
Check docs/TODO.md for an extensive list of planned features and improvements.
243
+
Feel free to open an issue in case a feature is missing or not working properly.
265
244
266
245
# Troubleshooting
267
246
If you encounter an unclear error message after following the instructions above, feel free to create an Issue on the GitHub repository.
- Compile dev requirements-dev.txt file: `python -m piptools compile --index-url=https://pypi.org/simple --extra dev -o docs/requirements-dev.txt -c docs/requirements.txt pyproject.toml`
23
-
(Making sure packages in both files have the same version, [stackoverflow source](https://stackoverflow.com/questions/76055688/generate-aligned-requirements-txt-and-dev-requirements-txt-with-pip-compile))
29
+
(Making sure packages in both files have the same version, [stackoverflow source](https://stackoverflow.com/questions/76055688/generate-aligned-requirements-txt-and-dev-requirements-txt-with-pip-compile))
0 commit comments