Fix array api in mean_absolute_percentage_error for older versions #29490

OmarManzoor · 2024-07-15T11:03:28Z

Reference Issues/PRs

Follow up of #29300

What does this implement/fix? Explain your changes.

FIx an error that occurred in the doc-min-dependencies CI which probably results from older version of libraries like numpy and polars. Error in build log

Any other comments?

github-actions · 2024-07-15T11:04:55Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 87a634d. Link to the linter CI: here}

OmarManzoor · 2024-07-15T11:58:51Z

@lesteve @adrinjalali I did fix the error resulting from the mean_absolute_percentage_error function in the example plot_time_series_lagged_features.py and it looks like the tests pass. However if you check the output of ./build_tools/circle/build_doc.sh in the doc-min-dependencies CI, it seems that both mean_squared_error and mean_absolute_error are throwing the same error but probably they are failing silently within the example so the error is not shown in the output.

lesteve · 2024-07-15T12:13:42Z

Indeed no idea what is happening but https://app.circleci.com/pipelines/github/scikit-learn/scikit-learn/59983/workflows/84052e1d-6041-4a3d-89cf-fb076b09ba99/jobs/280390?invite=true#step-105-86409_95 is showing a warning about the scoring failing with a similar stack-trace as before. So I am guessing that the underlying issue is still there?

/home/circleci/project/sklearn/model_selection/_validation.py:997: UserWarning: Scoring failed. The score on this train-test partition for these parameters will be set to nan. Details: 
Traceback (most recent call last):
  File "/home/circleci/project/sklearn/metrics/_scorer.py", line 139, in __call__
    score = scorer._score(
  File "/home/circleci/project/sklearn/metrics/_scorer.py", line 376, in _score
    return self._sign * self._score_func(y_true, y_pred, **scoring_kwargs)
  File "/home/circleci/project/sklearn/utils/_param_validation.py", line 213, in wrapper
    return func(*args, **kwargs)
  File "/home/circleci/project/sklearn/metrics/_regression.py", line 596, in root_mean_squared_error
    mean_squared_error(
  File "/home/circleci/project/sklearn/utils/_param_validation.py", line 186, in wrapper
    return func(*args, **kwargs)
  File "/home/circleci/project/sklearn/metrics/_regression.py", line 516, in mean_squared_error
    dtype = _find_matching_floating_dtype(y_true, y_pred, xp=xp)
  File "/home/circleci/project/sklearn/utils/_array_api.py", line 681, in _find_matching_floating_dtype
    floating_dtypes = [
  File "/home/circleci/project/sklearn/utils/_array_api.py", line 682, in <listcomp>
    a.dtype for a in dtyped_arrays if xp.isdtype(a.dtype, "real floating")
  File "/home/circleci/project/sklearn/utils/_array_api.py", line 442, in isdtype
    return isdtype(dtype, kind, xp=self)
  File "/home/circleci/project/sklearn/utils/_array_api.py", line 198, in isdtype
    return _isdtype_single(dtype, kind, xp=xp)
  File "/home/circleci/project/sklearn/utils/_array_api.py", line 215, in _isdtype_single
    return dtype in supported_float_dtypes(xp)
TypeError: Cannot interpret 'Int64' as a data type

OmarManzoor · 2024-07-15T12:18:30Z

Yes and the error is now showing in functions that were updated before. Do we want to keep the numpy version or can we update it? Otherwise we will need to add some kind of a check in _find_matching_floating_dtype.

betatim · 2024-07-15T12:32:39Z

Do we want to keep the numpy version or can we update it?

The way I understand the "min" in doc-min-dependencies is that it uses the minimum version for all dependencies. So I think we can't update Numpy. Even if we do update it, I think we should have a good user experience when they use array API support and an older version of Numpy (maybe that version isn't as old as the oldest scikit-learn supports, but older than Numpy 2 I'd say).

OmarManzoor · 2024-07-15T12:44:02Z

I think then we would need to handle this special case in _find_matching_floating_dtype or in _isdtype_single where it actually occurs.

betatim · 2024-07-15T13:01:38Z

Do you have a small code snippet that reproduces the problem?

I am trying to understand why we are calling array API code from an example. Naively I was expecting that the examples don't use the array API and hence we shouldn't touch any of the related code.

OmarManzoor · 2024-07-15T13:08:36Z

Do you have a small code snippet that reproduces the problem?

I am trying to understand why we are calling array API code from an example. Naively I was expecting that the examples don't use the array API and hence we shouldn't touch any of the related code.

I think to reproduce this we would need to downgrade a number of libraries like numpy, polars which are used in this example. So as @lesteve mentioned follow the instructions in quick doc to set up an environment. After that we can simply run the example. I don't have a smaller snippet because I haven't been able to reproduce this even with the full example on my mac 😄

lesteve · 2024-07-15T13:29:24Z

Here is a snippet that reproduces for me see versions in doc-min-dependencies environment.yml amongst other things polars 0.20.23, numpy 1.19.5.

This one reproduces the issue in main but not in this PR:

import polars as pl

s = pl.Series([1, 2, 3])

from sklearn.metrics import mean_absolute_percentage_error

mean_absolute_percentage_error(s, s)

I need to look more about reproducing the warnings, it looks like this is happening in joblib.Parallel for some reason ...

betatim · 2024-07-15T13:52:41Z

Thanks for the example. I also struggle to get it to reproduce, but mostly because of difficulty with getting an environment setup with the right versions (I suspect).

Looking around the code base for uses of _find_matching_floating_dtype I found

scikit-learn/sklearn/metrics/pairwise.py

Line 167 in d79cb58

dtype_float = _find_matching_floating_dtype(X, Y, xp=xp)

which makes me think we should indeed gate the call to it with a check that the array API is being used. But there are also lots of examples in sklearn/metrics/_regression.py where it is used without such a gate. Loic, could you try your reproducer with r2_score(s, s)? Maybe the problem exists in all of those and we just haven't noticed because they aren't used in a context where a polars series is passed in (I think that is another neccessary ingredient for the reproducer).

OmarManzoor · 2024-07-15T13:58:00Z

Basically the series dtype Int64 seems to be throwing an error when we check this line

scikit-learn/sklearn/utils/_array_api.py

Lines 214 to 215 in d79cb58

    
           elif kind == "real floating": 
        
               return dtype in supported_float_dtypes(xp)

But this works fine for the newer versions as we are just checking whether it matches with the float dtypes.

lesteve · 2024-07-15T15:46:02Z

I also struggle to get it to reproduce, but mostly because of difficulty with getting an environment setup with the right versions (I suspect).

Just curious, I guess the difficulty is that for arm64 macOS it's hard to get older versions, because they have not been built in conda-forge?

Looking a bit further. I think this PR actually fixes the issue for mean_absolute_percentage_error and the warnings comes from the other metrics that are used in the example and we did not notice indeed ...

I think a test should be added in sklearn/metrics/tests/test_common.py to check that the metric called on two polar series does what we expect. This would be tested for our min dependencies. I haven't figured out the polars and numpy combination that makes this issue happen.

As asked by Tim, r2_score has the same issue with polars Series.

I tweaked the example (see tweaked version) and it seems that other metrics are still problematic (root_mean_square_error and mean_absolute_error). Here is the value of cv_results after running the example:

{'fit_time': array([1.8457191 , 1.91570544, 1.12068701]),
 'score_time': array([0.01725125, 0.02016592, 0.01188159]),
 'test_MAPE': array([0.44300752, 0.27772182, 0.3697178 ]),
 'test_RMSE': array([nan, nan, nan]),
 'test_MAE': array([nan, nan, nan]),
 'test_pinball_loss_05': array([15.90543294, 18.45444449, 18.74125985]),
 'test_pinball_loss_50': array([19.07197374, 21.10759731, 18.4528097 ]),
 'test_pinball_loss_95': array([22.23851453, 23.76075013, 18.16435955])}

OmarManzoor · 2024-07-16T07:01:09Z

@lesteve Thank you for reporting. I think the fix in this PR might not be suitable generally. I think we need to handle the error occurring within the functions that cause this error. We cannot actually replace _find_matching_floating_dtype in all the places, as this function is used to find the appropriate float dtype even for cases that might not involve the array api.

adrinjalali · 2024-07-16T08:47:21Z

I think in a sense this is similar to #29452, as in, while array api dispatch is not enabled, we see side effects. I think we should make sure there is no side effects when dispatch is not enabled.

lesteve · 2024-07-16T10:54:27Z

I haven't figured out the polars and numpy combination that makes this issue happen.

For completeness it seems like numpy 1.20.3 is the latest version that has the issue, numpy>= 1.21.0 works fine. I am guessing that for scikit-learn 1.6 we will be able to bump numpy enough that maybe we will avoid this issue?

Still I share Adrin's concern about the fact that the coverage needs to be increased because it can break in mysterious ways ...

OmarManzoor · 2024-07-16T11:24:10Z

I have removed the warning because it seems unnecessary.

lesteve · 2024-07-16T12:12:40Z

For completeness still, a snippet that shows the issue with numpy 1.20.3 and not 1.21.0 (polars version does not matter):

import numpy as np
import polars as pl

s = pl.Series([1, 2, 3])

numpy_float64_dtype = np.dtype(np.float64)
# False
numpy_float64_dtype == pl.Int64
# False
s.dtype == numpy_float64_dtype
# TypeError: Cannot interpret 'Int64' as a data type
numpy_float64_dtype == s.dtype

lesteve · 2024-07-16T13:20:50Z

I added a small common test to check metric on pandas and polars series. I guess there is probably some room for improvement. This fails locally on main with numpy 1.20.3 and passes on this PR.

The most wonderful thing about it (this is sarcasm just to be clear 😉) is that this is not going to make the CI red because it seems like we don't even have a build for numpy 1.19 or numpy 1.20 ... and even if we had one polars or pandas would not be installed anyway so 😓 ...

I opened #29502 to actually use our minimum supported version in our min-dependencies-build and add pandas and polars into it.

sklearn/utils/_array_api.py

OmarManzoor · 2024-07-16T14:52:16Z

I think we might not be able to fix the codecov issue considering that the TypeError is not raised in the general case.

lesteve

I am going to approve this but it would be great if someone looked at the test I added because right now, it looks quite minimal.

In particular maybe there is a better way to know which metrics support 1d input than using a try/except?

lesteve · 2024-07-17T12:43:39Z

sklearn/utils/_array_api.py

@@ -439,7 +439,15 @@ def reshape(self, x, shape, *, copy=None):
        return numpy.reshape(x, shape)

    def isdtype(self, dtype, kind):
-        return isdtype(dtype, kind, xp=self)
+        try:


Thinking a bit more about it we may as well put the try/except closer to where the problem happens namely _isdtype_single i.e. have something like this

def _isdtype_single(dtype, kind, *, xp): try: # all the current code of _isdtype_single except TypeError: return False

Wouldn't it be better to keep it in the NumpyArrayWrapper because that is what is used in the default case and when array api dispatch is not enabled?

Short answer: I know a lot less than you about the array API code so I would trust you on this 😉

Naively I thought the problem would happen with _isdtype_single(dtype, "real floating", xp=np) but it doesn't because np.float32 is not a dtype whereas xp.float32 is a dtype (i.e. np.dtype(np.float32)) not sure if this is a very naive assumption on my part ...

In other words with numpy < 1.21 here is the behaviour

from sklearn.utils._array_api import _isdtype_single from sklearn.utils._array_api import _NumPyAPIWrapper import polars as pl dtype = pl.Series([1, 2, 3]).dtype # no issue because dtype is compared to np.float32 which is not a dtype # I would have naively expected an error since I thought the comparison # would be with np.dtype(np.float32) _isdtype_single(dtype, "real floating", xp=np) xp = _NumPyAPIWrapper() # issue because under the hood dtype == np.float32 happens, which is an error # TypeError: Cannot interpret 'Int64' as a data type _isdtype_single(dtype, "real floating", xp=xp)

I understand your point and to handle all scenarios I think it would make sense to catch the exception in _isdtype_single. However I don't think we would want to use this function directly from the code. As for any case other than numpy, since we would have array apis I think this would break anyways because series objects would probably not be compatible with other array types.

OmarManzoor · 2024-07-17T13:56:42Z

I am going to approve this but it would be great if someone looked at the test I added because right now, it looks quite minimal.

I think the tests look fine since we are just checking that polars and pandas work without any errors.

OmarManzoor · 2024-07-18T08:27:05Z

@adrinjalali @betatim Could you have a look at this PR, so that we can resolve the CI failures occurring?

adrinjalali

Otherwise LGTM.

sklearn/metrics/tests/test_common.py

lesteve · 2024-07-18T15:36:08Z

I enabled auto-merge, let's make doc-min-dependencies gr~~eat~~een again 💚!

Fix array api in mean_absolute_percentage_error for older versions

380ff9c

github-actions bot added the module:metrics label Jul 15, 2024

OmarManzoor added the No Changelog Needed label Jul 15, 2024

[doc build]

a872110

OmarManzoor added 2 commits July 16, 2024 12:15

Handle a type error and raise a UserWarning

811552b

[doc build]

eb82bc1

Remove the warning

13eb91d

Add common test for metric on pandas and polars series

8bd5acb

lesteve mentioned this pull request Jul 16, 2024

CI Use conda-forge for min-dependencies build and add polars and pandas #29502

Merged

lesteve reviewed Jul 16, 2024

View reviewed changes

sklearn/utils/_array_api.py Outdated Show resolved Hide resolved

Improve the comment

58916e0

lesteve approved these changes Jul 17, 2024

View reviewed changes

adrinjalali approved these changes Jul 18, 2024

View reviewed changes

sklearn/metrics/tests/test_common.py Outdated Show resolved Hide resolved

OmarManzoor added 2 commits July 18, 2024 19:51

Improvement of test based on PR suggestion

937d9ac

Merge branch 'main' into fix_array_api_mape_error

87a634d

lesteve enabled auto-merge (squash) July 18, 2024 15:35

lesteve merged commit 21ab5e1 into scikit-learn:main Jul 18, 2024
28 checks passed

OmarManzoor deleted the fix_array_api_mape_error branch July 19, 2024 07:26

Uh oh!

Fix array api in mean_absolute_percentage_error for older versions #29490

Fix array api in mean_absolute_percentage_error for older versions #29490

Uh oh!

Conversation

OmarManzoor commented Jul 15, 2024

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

github-actions bot commented Jul 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

OmarManzoor commented Jul 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lesteve commented Jul 15, 2024

Uh oh!

OmarManzoor commented Jul 15, 2024

Uh oh!

betatim commented Jul 15, 2024

Uh oh!

OmarManzoor commented Jul 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

betatim commented Jul 15, 2024

Uh oh!

OmarManzoor commented Jul 15, 2024

Uh oh!

lesteve commented Jul 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

betatim commented Jul 15, 2024

Uh oh!

OmarManzoor commented Jul 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lesteve commented Jul 15, 2024

Uh oh!

OmarManzoor commented Jul 16, 2024

Uh oh!

adrinjalali commented Jul 16, 2024

Uh oh!

lesteve commented Jul 16, 2024

Uh oh!

OmarManzoor commented Jul 16, 2024

Uh oh!

lesteve commented Jul 16, 2024

Uh oh!

lesteve commented Jul 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

OmarManzoor commented Jul 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lesteve left a comment

Choose a reason for hiding this comment

Uh oh!

lesteve Jul 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

OmarManzoor Jul 17, 2024

Choose a reason for hiding this comment

Uh oh!

lesteve Jul 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

OmarManzoor Jul 17, 2024

Choose a reason for hiding this comment

Uh oh!

OmarManzoor commented Jul 17, 2024

Uh oh!

OmarManzoor commented Jul 18, 2024

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

github-actions bot commented Jul 15, 2024 •

edited

Loading

OmarManzoor commented Jul 15, 2024 •

edited

Loading

OmarManzoor commented Jul 15, 2024 •

edited

Loading

lesteve commented Jul 15, 2024 •

edited

Loading

OmarManzoor commented Jul 15, 2024 •

edited

Loading

lesteve commented Jul 16, 2024 •

edited

Loading

OmarManzoor commented Jul 16, 2024 •

edited

Loading

lesteve Jul 17, 2024 •

edited

Loading

lesteve Jul 17, 2024 •

edited

Loading