Several improvements to NeuralNetBinaryClassifier #515

BenjaminBossan · 2019-08-25T17:44:21Z

more fit runs because of flaky output (fixes fail to pass test in 0.6.0 release #514)
add a classes_ attribute for things like CalibratedClassifierCV (fixes NeuralNetBinaryClassifier with CalibratedClassiferCV #465)
make y_proba 2-dim, which works better with certain sklearn metrics (fixes NeuralNetBinaryClassifier fails with GridSearchCV and roc_auc #442)
automatically squeezes output if module returns (n,1) shape (fixes Error in Binary Classification in NeuralNetBinaryClassifier. #502)

This could break backwards compatibility if someone relies on y_proba to be 1d.

* more fit runs because of flaky output (fixes #514) * add a classes_ attribute for things like CalibratedClassifierCV (fixes #465) * make y_proba 2-dim, which works better with certain sklearn metrics (fixes #442) * automatically squeezes output if module returns (n,1) shape (fixes #502) This could break backwards compatibility if someone relies on y_proba to be 1d.

BenjaminBossan · 2019-08-25T17:46:06Z

@ZaydH Since you worked and had trouble with NeuralNetBinaryClassifier I would like to hear your opinion on this PR.

thomasjpfan · 2019-09-04T23:45:13Z

skorch/tests/test_classifier.py

@@ -196,10 +196,10 @@ def test_predict_predict_proba(self, net, data, threshold):
        net.fit(X, y)

        y_pred_proba = net.predict_proba(X)
-        assert y_pred_proba.ndim == 1
+        assert y_pred_proba.shape[1] == 2


Nit

Suggested change

assert y_pred_proba.shape[1] == 2

assert y_pred_proba.shape == (X.shape[0], 2)

thomasjpfan · 2019-09-04T23:46:39Z

skorch/classifier.py

    # pylint: disable=signature-differs
    def check_data(self, X, y):
        super().check_data(X, y)
        if get_dim(y) != 1:
            raise ValueError("The target data should be 1-dimensional.")

+    def infer(self, x, **fit_params):
+        y_infer = super().infer(x, **fit_params)
+        if y_infer.dim() < 2:


If we want to be really direct:

Suggested change

if y_infer.dim() < 2:

if y_infer.dim() == 1:

thomasjpfan · 2019-09-04T23:47:07Z

CHANGES.md


 ### Changed

 - Improve numerical stability when using `NLLLoss` in `NeuralNetClassifer` (#491)
+- NeuralNetBinaryClassifier.predict_proba now returns a 2-dim array; to access the "old" y_proba, take y_proba[:, 1]


Add PR number?

thomasjpfan · 2019-09-04T23:49:18Z

skorch/tests/test_classifier.py

+
+    def test_with_calibrated_classifier_cv(self, net_fit, data):
+        from sklearn.calibration import CalibratedClassifierCV
+        cccv = CalibratedClassifierCV(net_fit, cv=3)


Do we need the fitted module here?

If we care alot about test run time speed, we can set cv=2?

Changed to cv=2. Net has to be initialized for this to work.

thomasjpfan · 2019-09-04T23:50:03Z

skorch/tests/test_classifier.py

@@ -166,7 +166,7 @@ def test_not_fitted_raises(self, net_cls, module_cls, data, method):
               "before using this method.")
        assert exc.value.args[0] == msg

-    @flaky(max_runs=3)
+    @flaky(max_runs=5)


That is getting pretty flaky indeed.

I made a grid search (yay) to find better parameters. Hopefully the test is less flaky now.

Check shapes more precisely

BenjaminBossan · 2019-09-07T10:27:27Z

@thomasjpfan Pls review again

thomasjpfan

LGTM

ottonemo · 2019-09-16T14:41:01Z

skorch/classifier.py

    # pylint: disable=signature-differs
    def check_data(self, X, y):
        super().check_data(X, y)
        if get_dim(y) != 1:
            raise ValueError("The target data should be 1-dimensional.")

+    def infer(self, x, **fit_params):


I think a docstring that documents the automatic squeezing of module outputs would be helpful here (i.e., that it is done and why it is done).

skorch/tests/test_classifier.py

Co-Authored-By: ottonemo <[email protected]>

* add a docstring * correct error message when output dim > 2 * also works when more than 1 output is returned

…ithub.com/skorch-dev/skorch into feature/improvements-to-binary-classifier

BenjaminBossan · 2019-09-17T19:32:33Z

@ottonemo I changed the requested things, and then some more minor things. Please review again.

BenjaminBossan requested review from thomasjpfan and ottonemo August 25, 2019 17:44

BenjaminBossan self-assigned this Aug 25, 2019

BenjaminBossan mentioned this pull request Aug 25, 2019

fail to pass test in 0.6.0 release #514

Closed

BenjaminBossan mentioned this pull request Aug 25, 2019

NeuralNetBinaryClassifier fails with GridSearchCV and roc_auc #442

Closed

thomasjpfan reviewed Sep 4, 2019

View reviewed changes

BenjaminBossan added 3 commits September 7, 2019 12:02

Address reviewer comments

0ece9ac

Check shapes more precisely

Add PR number to CHANGES

02a3d57

Use better hyperparameters to reduce flakyness of binary classifer

0d14b46

Merge branch 'master' into feature/improvements-to-binary-classifier

a7a0bf8

thomasjpfan approved these changes Sep 13, 2019

View reviewed changes

ottonemo requested changes Sep 16, 2019

View reviewed changes

BenjaminBossan and others added 4 commits September 17, 2019 21:11

Remove obsolote line in test

ee2da63

Co-Authored-By: ottonemo <[email protected]>

Update infer method in NeuralNetBinaryClassifier

9ee45ce

* add a docstring * correct error message when output dim > 2 * also works when more than 1 output is returned

Merge branch 'feature/improvements-to-binary-classifier' of https://g…

f17ba17

…ithub.com/skorch-dev/skorch into feature/improvements-to-binary-classifier

Merge branch 'master' into feature/improvements-to-binary-classifier

0c25bfc

ottonemo approved these changes Sep 27, 2019

View reviewed changes

ottonemo merged commit 0581bc6 into master Sep 27, 2019

BenjaminBossan deleted the feature/improvements-to-binary-classifier branch October 13, 2019 11:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Several improvements to NeuralNetBinaryClassifier #515

Several improvements to NeuralNetBinaryClassifier #515

BenjaminBossan commented Aug 25, 2019

BenjaminBossan commented Aug 25, 2019

thomasjpfan Sep 4, 2019

thomasjpfan Sep 4, 2019

thomasjpfan Sep 4, 2019

thomasjpfan Sep 4, 2019

BenjaminBossan Sep 7, 2019

thomasjpfan Sep 4, 2019

BenjaminBossan Sep 7, 2019

BenjaminBossan commented Sep 7, 2019

thomasjpfan left a comment

ottonemo Sep 16, 2019

BenjaminBossan commented Sep 17, 2019

	assert y_pred_proba.shape[1] == 2
	assert y_pred_proba.shape == (X.shape[0], 2)

Several improvements to NeuralNetBinaryClassifier #515

Several improvements to NeuralNetBinaryClassifier #515

Conversation

BenjaminBossan commented Aug 25, 2019

BenjaminBossan commented Aug 25, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

BenjaminBossan commented Sep 7, 2019

thomasjpfan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

BenjaminBossan commented Sep 17, 2019