API Standardize `X` as `inverse_transform` input parameter #28756

williambdean · 2024-04-02T19:11:36Z

Reference Issues/PRs

closes #27654
related to #27666

What does this implement/fix? Explain your changes.

This changes all X to Xt in inverse_transform methods. This then makes the API:

Transform: Xt = transformer.transform(X)
Inverse Transform: X = transformer.inverse_transform(Xt)

Any other comments?

Hi @glemaitre . I am finally getting back to this!

It turns out that X is much more common as a parameter than Xt. I've started with a few in order to get some feedback before rinsing and repeating the changes. Please let me know if you have any feedback on the:

deprecation helper function
Docstring changes
Changes to variable names

One thing I suspect is that tests might have used keyword arguments as well and thus cause warnings. I will switch if that is the case

github-actions · 2024-04-02T19:12:49Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: a4e3fe4. Link to the linter CI: here}

betatim · 2024-04-03T12:44:01Z

The approach looks reasonable to me. It is based on how the deprecation of Y for y is done.

For me the big question is if it is worth the effort to do this. What is the problem we are fixing by doing this (except for solving a naming inconsistency)?

williambdean · 2024-04-03T16:00:17Z

The approach looks reasonable to me. It is based on how the deprecation of Y for y is done.

For me the big question is if it is worth the effort to do this. What is the problem we are fixing by doing this (except for solving a naming inconsistency)?

I encountered it while trying to switch a Pipeline out with a StandardScaler and using kwargs.
This was my reproducible example from the issue:

# Has Xt arg
pipeline = make_pipeline(StandardScaler())
Xt = pipeline.fit_transform(X)
X_again = pipeline.inverse_transform(Xt=Xt)

# inverse_transform takes X instead of Xt
transformer = StandardScaler()
Xt = transformer.fit_transform(X)
X_again = transformer.inverse_transform(Xt=Xt)
# TypeError: inverse_transform() got an unexpected keyword argument 'Xt'

Pipelines might be the most used object but I'd think having a uniformed API with the transformers themselves might be good.

What are your thoughts?

betatim · 2024-04-04T09:03:33Z

I agree it is a bit annoying that using/not using a Pipeline changes whether your code works or not.

The big question for me is how typical transformer.inverse_transform(Xt) vs transformer.inverse_transform(Xt=Xt) is. My guess would be that most people use the "positional argument" version. So they never run into the problem that the argument is sometimes called X and sometimes Xt. However I don't know how we could resolve this question with some data, instead of people's hunches :-/

Maybe other people/maintainers have an opinion? @scikit-learn/core-devs

GaelVaroquaux · 2024-04-04T09:17:15Z

I'm not really in favor of such a change:

It breaks backward compatibility for cosmetic reason, which I something I frown on, as it is pushing pain to our users
It breaks consistency across models, all the examples that I looked up use "X" in inverse transform (PCA https://scikit-learn.org/stable/modules/generated/sklearn.decomposition.PCA.html#sklearn.decomposition.PCA.inverse_transform , RobustScaler, feature selection object...)

jeremiedbb · 2024-04-04T10:22:42Z

Here's the list:

CCA                           X
DictVectorizer                X
FastICA                       X
FeatureAgglomeration          Xt
FunctionTransformer           X
GaussianRandomProjection      X
GenericUnivariateSelect       X
GridSearchCV                  Xt
IncrementalPCA                X
KBinsDiscretizer              Xt
KernelPCA                     X
LabelBinarizer                Y
LabelEncoder                  y
MaxAbsScaler                  X
MinMaxScaler                  X
MiniBatchNMF                  Xt
MiniBatchSparsePCA            X
MultiLabelBinarizer           yt
NMF                           Xt
OneHotEncoder                 X
OrdinalEncoder                X
PCA                           X
PLSCanonical                  X
PLSRegression                 X
Pipeline                      Xt
PowerTransformer              X
QuantileTransformer           X
RFE                           X
RFECV                         X
RandomizedSearchCV            Xt
RobustScaler                  X
SelectFdr                     X
SelectFpr                     X
SelectFromModel               X
SelectFwe                     X
SelectKBest                   X
SelectPercentile              X
SequentialFeatureSelector     X
SimpleImputer                 X
SparsePCA                     X
SparseRandomProjection        X
StandardScaler                X
TruncatedSVD                  X
VarianceThreshold             X

It's true that most estimators use X. However it's unfortunate that the main meta-estimators (Pipeline, Search*) use Xt, as explained in #28756 (comment).

I agree that we should not change for cosmetic reasons. But the inconsistency between most estimators and the main meta-estimators gives a bad user experience that we can improve. I don't know however how many users are concerned, i.e. using kwargs for X.

Side note: only the users who pass X as kwarg would actually see the API change and the deprecation warnings and this change aims to improve the experience of these users. It would be transparent for other users.

So overall I'm +0.5 here.

betatim · 2024-04-04T11:22:36Z

I didn't count but from scrolling through the list of estimators it seems many of them use X. Maybe, if we do anything, we should standardise on X?

GaelVaroquaux · 2024-04-04T13:27:49Z

Maybe, if we do anything, we should standardise on X?

That's my feeling

jeremiedbb · 2024-04-04T13:44:23Z

mine too at first, but notice that the Xts are in Pipeline, GridSearchCV, RandomizedSearchCV, ..., so changing them might impact as many users as changing the others :/

thomasjpfan · 2024-04-04T14:44:10Z

I feel like the impact is minimum because it only impacts users that write inverse_transform(Xt=...). In all our documentation, we always set the input of inverse_transform as a positional argument.

We'll still go through a deprecation cycle and the change is fairly simple for users to make. (Make the input positional or use X=...).

williambdean · 2024-04-04T15:28:45Z

Thanks everyone for the feedback!

Thanks for making the list @jeremiedbb. Very helpful!
My sense is that Pipeline, GridSearchCV, RandomizedSearchCV would be the most used and since they use Xt then that should be the way to go.

Personally, X doesn't make much sense to me if
Xt = transform(X)
since then
X = inverse_transform(Xt)

I want to note that the goal is to be backwards compatible. That would mean:

No warning with positional arguments. instance.inverse_transform(Xt)
Standard deprecation warning if keyword X. instance.inverse_transform(X=...)
ValueError if both keywords are used

I will fix all of the examples in docstrings if they use a keyword example. However, I don't think any do at first glance. https://github.com/search?q=repo%3Ascikit-learn%2Fscikit-learn%20%22inverse_transform(X%3D%22&type=code

I didn't have None for Xt which might have caused some confusion. I've added that with the latest commit!

lorentzenchr · 2024-04-04T16:21:55Z

+1 for making it consistent. For me, it‘s not a cosmetic change, but a real flaw. The sooner the fix the better.

williambdean · 2024-04-04T17:00:38Z

+1 for making it consistent. For me, it‘s not a cosmetic change, but a real flaw. The sooner the fix the better.

🎉

If others have the same idea, then I will go through with the rest of the implementation!

lorentzenchr · 2024-04-04T17:28:39Z

@wd60622 I‘m just one voice and @GaelVaroquaux has a different view than me. So it’s not decided yet.

chkoar · 2024-04-05T10:56:10Z

I am not a core dev but regarding the change I am aligned with @GaelVaroquaux.
Also, for me the context has already be provided by the method itself inverse_transform so there is not need for Xt.
For consistency I would prefer everywhere X.
However, I expect that most of us use inverse_transform with positional argument.

Regarding the impact I agree with @thomasjpfan

I feel like the impact is minimum because it only impacts users that write inverse_transform(Xt=...). In all our documentation, we always set the input of inverse_transform as a positional argument.

We'll still go through a deprecation cycle and the change is fairly simple for users to make. (Make the input positional or use X=...).

A public poll regarding the use of positional or keyword arguments usage in inverse_transform would provide a sense of the impact of the change.

lorentzenchr · 2024-04-05T15:16:48Z

A public poll

That's expensive and I would dedicate such a poll to much more impactful questions.

GaelVaroquaux · 2024-04-05T15:19:57Z

@wd60622 I‘m just one voice and @GaelVaroquaux has a different view than me. So it’s not decided yet.

Consistency is a good thing. Hence given the summary above showing that X and Xt are used, I can be convinced :) I'm slightly more in favor of converging to X than to Xt.

williambdean · 2024-04-06T07:51:54Z

Consistency is a good thing.

Totally, agree. Hence the PR 😄

Hence given the summary above showing that X and Xt are used, I can be convinced :) I'm slightly more in favor of converging to X than to Xt.

Is there an issue with the name Xt? Why is X better than Xt given my argument that that name doesn't make much sense?
Even though more transformers use X, I think that should be weighted by the frequency of use. To me, the Pipeline, GridSearchCV, RandomizedSearchCV are the most used. Since they use Xt, the scale leans toward Xt if it is a weighted average.
However, my usage weights might be a wrong assumption. Just my gut off using the package the last few years

lorentzenchr · 2024-04-06T14:35:34Z

I'm slightly more in favor of converging to X than to Xt.

+1. Makes the API simpler.

I guess we have reached a decision.

williambdean · 2024-04-06T15:27:26Z

I'm slightly more in favor of converging to X than to Xt.

+1. Makes the API simpler.

I guess we have reached a decision.

And that is X?

williambdean · 2024-04-06T16:02:10Z

X it is!

betatim · 2024-04-29T09:42:48Z

sklearn/utils/deprecation.py

+def _deprecate_Xt_in_inverse_transform(X, Xt):
+    """Helper to deprecate the `Xt` argument in favor of `X` in inverse_transform."""
+    if X is not None and Xt is not None:
+        raise ValueError("Cannot use both X and Xt. Use X only.")


Sorry for being late to the party with my comment. I'd use a TypeError here as well. It seems more consistent with the case below where the user forgot to pass any argument. ValueError makes me think that I passed the wrong value, not that I tried to use an argument I shouldn't.

betatim · 2024-04-29T09:43:59Z

sklearn/utils/deprecation.py

+        raise ValueError("Cannot use both X and Xt. Use X only.")
+
+    if X is None and Xt is None:
+        raise TypeError("Missing required positional argument: 'X'.")
+
+    if Xt is not None:
+        warnings.warn(
+            "Xt was renamed X in version 1.5 and will be removed in 1.7.",


nit pick: can we use 'X' and 'Xt' everywhere instead of mixing X and 'X'?

+1 for not mixing. However I then prefer bare X because we usually don't put parameter names between quotes in other error/warning messages. Is it ok for you ?

So in strings, variables should have single quote surrounding? That is, X -> 'X'
Does this go for the docstring changes as well (if outside of the normal parameter definitions)?

I defer to Jeremie for what the correct formatting is. It sounds like it should be "Do not pass X and Xt at the same time". Not "Do not pass 'X' and 'Xt' at the same time".

betatim

Looks good to me. Just two small cosmetic comments. Good to merge after we resolve them.

williambdean · 2024-04-29T10:14:44Z

sklearn/utils/deprecation.py

+    if X is None and Xt is None:
+        raise TypeError("Missing required positional argument: 'X'.")


The TypeError makes sense here @betatim ?

I'd say so. For me passing an argument or missing a required argument is a TypeError.

sklearn/cluster/tests/test_feature_agglomeration.py

sklearn/decomposition/tests/test_nmf.py

sklearn/model_selection/tests/test_search.py

sklearn/preprocessing/tests/test_discretization.py

sklearn/tests/test_pipeline.py

sklearn/utils/deprecation.py

sklearn/cluster/tests/test_feature_agglomeration.py

sklearn/model_selection/tests/test_search.py

sklearn/preprocessing/tests/test_discretization.py

sklearn/tests/test_pipeline.py

sklearn/utils/deprecation.py

williambdean · 2024-04-29T13:25:30Z

I have made the adjustments based on the latest feedback @betatim @jeremiedbb. Hope they're squashed 😆

jeremiedbb · 2024-04-29T13:28:21Z

@wd60622 I think that you misunderstood, the decision was not to put them between quotes. Can you please revert your latest changes, or I can do it if you prefer ?

williambdean · 2024-04-29T13:58:59Z

@wd60622 I think that you misunderstood, the decision was not to put them between quotes. Can you please revert your latest changes, or I can do it if you prefer ?

Sorry. I did but them between quotes though! Not sure what you mean then

Yeah, you can revert!

jeremiedbb · 2024-04-29T14:02:25Z

the decision was not to put them between quotes.

"not to" 😄 . Don't worry, I'll revert.

add example deprecation implementation

79d6eed

github-actions bot added module:decomposition module:utils labels Apr 2, 2024

Xt and y example

cc036fb

Merge branch 'main' into standardize-inverse-transform

86686b0

use None value for backward compat positional args

d887f29

williambdean changed the title ~~Standardize Xt as inverse_transform argument~~ Standardize Xt as inverse_transform parameter Apr 4, 2024

lorentzenchr added the Needs Decision Requires decision label Apr 4, 2024

lorentzenchr removed the Needs Decision Requires decision label Apr 6, 2024

betatim reviewed Apr 29, 2024

View reviewed changes

betatim approved these changes Apr 29, 2024

View reviewed changes