Routing metadata to the `response_method` used by a scorer #27977

jeromedockes · 2023-12-18T13:07:22Z

Describe the workflow you want to enable

I would like to pass sample properties to the response method (eg predict) called by a scorer.
For example, the fairlearn package has a ThresholdOptimizer estimator which needs (in addition to X and y) the sensitive_features argument both for fit and predict.

AFAICT I can pass arguments to the score function (the metric), but not to the response method of the estimator.

import numpy as np
import sklearn
from sklearn.dummy import DummyClassifier
from sklearn.metrics import accuracy_score, make_scorer
from fairlearn.postprocessing import ThresholdOptimizer
from fairlearn.metrics import demographic_parity_difference


sklearn.set_config(enable_metadata_routing=True)

rng = np.random.default_rng(0)
X = rng.normal(size=(10, 3))
y = rng.integers(0, 2, size=X.shape[0])
sensitive = rng.integers(0, 2, size=X.shape[0])

classifier = (
    ThresholdOptimizer(estimator=DummyClassifier(), predict_method="auto")
    .set_fit_request(sensitive_features=True)
    .set_predict_request(sensitive_features=True)
    .fit(X, y, sensitive_features=sensitive)
)

scoring = make_scorer(accuracy_score)
scoring(classifier, X, y, sensitive_features=sensitive) # TypeError: predict() missing 1 argument -- how could I pass `sensitive_features to predict() ?

# passing arguments to the score function (demographic_parity_difference) is OK
classifier = DummyClassifier().fit(X, y)
scoring = make_scorer(
    demographic_parity_difference, greater_is_better=False
).set_score_request(sensitive_features=True)

scoring(classifier, X, y, sensitive_features=sensitive)

This also applies when using a scorer indirectly, for example in cross_validate

Describe your proposed solution

Maybe the scorers could have a method like set_predict_request or set_response_request to specify which parameters should be forwarded to the response method?

Describe alternatives you've considered, if relevant

No response

Additional context

https://fairlearn.org/v0.9/api_reference/generated/fairlearn.postprocessing.ThresholdOptimizer.html
https://fairlearn.org/v0.9/api_reference/generated/fairlearn.metrics.demographic_parity_difference.html

The text was updated successfully, but these errors were encountered:

adrinjalali · 2023-12-25T16:51:43Z

So for fairlearn, the scorers can themselves be routers and report the metadata required by the response_method as requested by the scorer, and things should work out of the box for tools we've implemented the routing for here in scikit-learn.

As for scikit-learn scorers themselves, yes, we should make this happen, but it's not gonna be an easy one I think.

glemaitre · 2024-05-16T09:13:07Z

We have the issue in TunedThresholdClassifierCV now. I'll put it in my TODO task for the next release.

jeromedockes added Needs Triage Issue requires triage New Feature labels Dec 18, 2023

glemaitre removed the Needs Triage Issue requires triage label Dec 18, 2023

adrinjalali mentioned this issue Dec 25, 2023

SLEP006 - Metadata Routing task list #22893

Open

28 tasks

NoPenguinsLand mentioned this issue Jan 23, 2024

AUC of the ROC is based on class labels (predict()) instead of scores (decision_function() or predict_proba()) during call to cross_validate #28234

Closed

glemaitre added this to the 1.6 milestone May 16, 2024

glemaitre self-assigned this May 16, 2024

glemaitre added this to Metadata routing May 16, 2024

glemaitre moved this to Todo in Metadata routing May 16, 2024

glemaitre modified the milestones: 1.6, 1.7 Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Routing metadata to the `response_method` used by a scorer #27977

Routing metadata to the `response_method` used by a scorer #27977

jeromedockes commented Dec 18, 2023

adrinjalali commented Dec 25, 2023

glemaitre commented May 16, 2024

Routing metadata to the response_method used by a scorer #27977

Routing metadata to the response_method used by a scorer #27977

Comments

jeromedockes commented Dec 18, 2023

Describe the workflow you want to enable

Describe your proposed solution

Describe alternatives you've considered, if relevant

Additional context

adrinjalali commented Dec 25, 2023

glemaitre commented May 16, 2024

Routing metadata to the `response_method` used by a scorer #27977

Routing metadata to the `response_method` used by a scorer #27977