Make skorch work with sklearn 1.6.0, attempt 2 #1078

BenjaminBossan · 2024-12-16T11:33:48Z

Alternative to #1076

As described in that PR, skorch is currently not compatible with sklearn 1.6.0 or above. As per suggestion, instead of implementing __sklearn_tags__, this PR solves the issue by inheriting from BaseEstimator.

Related changes:

It is important to set the correct order when inheriting from BaseEstimator and, say, ClassifierMixin (BaseEstimator should come last).
As explained in FIX Make skorch work with sklearn 1.6.0 #1076, using GridSearchCV with y being a torch tensor fails and two tests had to be adjusted.

Unrelated changes

Removed unnecessary imports from callbacks/base.py.

Altenative to #1076 As described in that PR, skorch is currently not compatible with sklearn 1.6.0 or above. As per suggestion, instead of implementing __sklearn_tags__, this PR solves the issue by inheriting from BaseEstimator. Related changes: - It is important to set the correct order when inheriting from BaseEstimator and, say, ClassifierMixin (BaseEstimator should come last). - As explained in #1076, using GridSearchCV with y being a torch tensor fails and two tests had to be adjusted. Unrelated changes - Removed unnecessary imports from callbacks/base.py.

BenjaminBossan · 2024-12-16T11:43:10Z

@adrinjalali As per your suggestion, I started inheriting from BaseEstimator and the switch was totally painless (inheritance order being the only small trap). This change should probably also play nicer with metadata routing. WDYT?

adrinjalali

This is nice!

Two notes:

scikit-learn now supports array API in many places, which means tensors remain tensors in cross validation etc. It's still experimental and in progress, but you might want to experiment with the config flag.
You might want to have a set_fit_request like in this PR to properly enable metadata routing:

https://github.com/keras-team/keras/pull/20599/files#diff-88e81a2722091ada78e7d521623be2efde60a959a9ccb35cdc2c1a3de5af83faR99

BenjaminBossan · 2024-12-16T13:19:00Z

This is nice!

Okay, I'll close the other PR in favor of this one then.

scikit-learn now supports array API in many places, which means tensors remain tensors in cross validation etc. It's still experimental and in progress, but you might want to experiment with the config flag.

I'm not sure if the error is directly related to the array API, I think it's more that sklearn is inconsistent in checking the input arrays? Here is an example that shows that mixing numpy arrays and torch tensors works with LogisticRegression but not with GridSearchCV:

import torch
from sklearn.datasets import make_classification
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import GridSearchCV

X, y = make_classification(1000, 20, n_informative=10, random_state=0)
yt = torch.tensor(y)

lr = LogisticRegression()
lr.fit(X, yt)  # works

gs = GridSearchCV(lr, param_grid={"C": [1, 10, 100]})
gs.fit(X, yt)  # fails

The error message is a not very helpful:

ValueError: Input arrays use different devices: cpu, cpu

(both devices's reprs are "cpu", but one is "cpu" and the other torch.device("cpu"))

The reason is the check from lines like this:

https://github.com/scikit-learn/scikit-learn/blob/1922303a79aa776768e2ee89bbda5b6eb4dd5d8b/sklearn/metrics/_classification.py#L224

which causes an error in

https://github.com/scikit-learn/scikit-learn/blob/1922303a79aa776768e2ee89bbda5b6eb4dd5d8b/sklearn/utils/_array_api.py#L141

When removing this check, the score can be calculated correctly, but of course there is no guarantee for that.

You might want to have a set_fit_request like in this PR to properly enable metadata routing:

Probably, right now it's not officially supported (aka there are no tests for routing). But let's leave that for another PR.

thomasjpfan

LGTM

thomasjpfan · 2024-12-16T14:30:37Z

skorch/tests/test_helper.py

@@ -450,12 +451,13 @@ def test_grid_search_with_slds_works(
        gs = GridSearchCV(
            net, params, refit=False, cv=3, scoring='accuracy', error_score='raise'
        )
-        gs.fit(slds, y)  # does not raise
+        gs.fit(slds, to_numpy(y))  # does not raise


Not supporting a mixture of NumPy arrays and CPU pytorch arrays, feels like a regression from scikit-learn's side.

For me, the question is: is it an intended change or was it introduced accidentally?

If it's unintended, one way to resolve this would be here:

https://github.com/scikit-learn/scikit-learn/blob/1922303a79aa776768e2ee89bbda5b6eb4dd5d8b/sklearn/utils/_array_api.py#L177

There would need to be another check

if one of the devices is an instance of torch.device

and for this device, device.type == "cpu"

and the other device is "cpu"

it should be allowed.

Looks like there is already a bug report and a PR to fix it in scikit-learn:

scikit-learn/scikit-learn#29107 (comment)
scikit-learn/scikit-learn#30454

..when sklearn > 1.6 is released.

Please welcome skorch 1.1.0 - a smaller release with a few fixes, a new notebook showcasing learning rate schedulers and mainly support for scikit-learn 1.6.0. Full list of changes: ### Added - Added a [notebook](https://github.com/skorch-dev/skorch/blob/master/notebooks/Learning_Rate_Scheduler.ipynb) that shows how to use Learning Rate Scheduler in skorch.(#1074) ### Changed - All neural net classes now inherit from sklearn's [`BaseEstimator`](https://scikit-learn.org/stable/modules/generated/sklearn.base.BaseEstimator.html). This is to support compatibility with sklearn 1.6.0 and above. Classification models additionally inherit from [`ClassifierMixin`](https://scikit-learn.org/stable/modules/generated/sklearn.base.ClassifierMixin.html) and regressors from [`RegressorMixin`](https://scikit-learn.org/stable/modules/generated/sklearn.base.RegressorMixin.html). (#1078) - When using the `ReduceLROnPlateau` learning rate scheduler, we now record the learning rate in the net history (`net.history[:, 'event_lr']` by default). It is now also possible to to step per batch, not only by epoch (#1075) - The learning rate scheduler `.simulate()` method now supports adding step args which is useful when simulation policies such as `ReduceLROnPlateau` which expect metrics to base their schedule on. (#1077) - Removed deprecated `skorch.callbacks.scoring.cache_net_infer` (#1088) ### Fixed - Fix an issue with using `NeuralNetBinaryClassifier` with `torch.compile` (#1058)

adrinjalali approved these changes Dec 16, 2024

View reviewed changes

BenjaminBossan requested review from githubnemo and thomasjpfan December 16, 2024 12:58

This was referenced Dec 16, 2024

FIX Make skorch work with sklearn 1.6.0 #1076

Closed

Add step_args parameter to LRScheduler.simulate #1077

Merged

thomasjpfan approved these changes Dec 16, 2024

View reviewed changes

BenjaminBossan mentioned this pull request Dec 17, 2024

AttributeError: 'super' object has no attribute '__sklearn_tags__' #1079

Closed

Add comment to remove to_numpy in test

9a7d500

..when sklearn > 1.6 is released.

githubnemo approved these changes Dec 18, 2024

View reviewed changes

BenjaminBossan merged commit 4f755b9 into master Dec 18, 2024
16 checks passed

BenjaminBossan deleted the net-classes-inherit-from-baseestimator branch December 18, 2024 17:04

antoinecollas mentioned this pull request Jan 6, 2025

PyPi release #1085

Closed

BenjaminBossan mentioned this pull request Feb 14, 2025

Implement sklearn API Metadata routing #1095

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make skorch work with sklearn 1.6.0, attempt 2 #1078

Make skorch work with sklearn 1.6.0, attempt 2 #1078

BenjaminBossan commented Dec 16, 2024

BenjaminBossan commented Dec 16, 2024

adrinjalali left a comment

BenjaminBossan commented Dec 16, 2024

thomasjpfan left a comment

thomasjpfan Dec 16, 2024

BenjaminBossan Dec 16, 2024

thomasjpfan Dec 16, 2024 •

edited

Loading

Make skorch work with sklearn 1.6.0, attempt 2 #1078

Make skorch work with sklearn 1.6.0, attempt 2 #1078

Conversation

BenjaminBossan commented Dec 16, 2024

BenjaminBossan commented Dec 16, 2024

adrinjalali left a comment

Choose a reason for hiding this comment

BenjaminBossan commented Dec 16, 2024

thomasjpfan left a comment

Choose a reason for hiding this comment

thomasjpfan Dec 16, 2024

Choose a reason for hiding this comment

BenjaminBossan Dec 16, 2024

Choose a reason for hiding this comment

thomasjpfan Dec 16, 2024 • edited Loading

Choose a reason for hiding this comment

thomasjpfan Dec 16, 2024 •

edited

Loading