Example with ranking metrices #21138

lorentzenchr · 2021-09-24T11:25:42Z

Describe the issue linked to the documentation

Some of the metrices in #2805 were implemented in #7739.

Suggest a potential alternative/fix

It would be nice to add (to) an example the usage of those ranking metrices with the addition of, e.g., kendall's tau and spearman's rho:

from scipy.stats import kendalltau, spearmanr
from sklearn.metrics import make_scorer

kenall_tau_score = make_scorer(kendalltau)
spearman_rho_score = make_scorer(kendalltau)

glemaitre · 2021-09-24T12:44:07Z

If I am not mistaken, I don't think that we have any example tackling the problem of recommendation. It would be nice to have a full example with a predictive model and the way to evaluate it?

acse-srm3018 · 2021-09-27T15:46:11Z

take

adrinjalali · 2021-10-12T10:21:03Z

@acse-srm3018 are you still working on this? If you have a draft, feel free to open a draft PR, mark it WIP, and ask for feedback if you need any help.

ghost · 2021-11-07T20:45:33Z

@adrinjalali Since nothing seems to happen, I would try it. I am quite new to scikit-learn though...could you tell me which file the new example should be in and what function(s) should be used in it?

adrinjalali · 2021-11-08T15:30:30Z

You'd need to add a whole new example @sveneschlbeck , I wouldn't say it's the easiest issue to work on. You need to familiarize yourself with the structure of the examples under the examples/ folder, and add a new one in the appropriate place there.

ghost · 2021-11-08T18:54:44Z

@adrinjalali Thanks for the explanation. If I get you correctly, the example should be an example of recommendations in general or just the ranking metrices?

ghost · 2021-11-08T21:26:46Z

@adrinjalali Added a starter example (I myself used some time back) about a simple content-based movie rec engine using scikit-learn in the new dir examples/recommendation alongside a sample dataset.
The example also uses cosine similarity as a metric to rank the similarity of movies...

rth · 2021-11-08T22:10:53Z

I don't think that we have any example tackling the problem of recommendation. It would be nice to have a full example with a predictive model and the way to evaluate it?

As far as I know ranking is not necessarily a primary evaluation metric in recommendation (see e.g. lightfm.evaluation): one cares more about how many relevant predictions are made in the first N, instead of whether it's first or third. Though I agree that it would still be be good to have a recommendation example, but maybe more for top_k_accuracy_score metric (which also doesn't have any examples apparently)?

For DCG and NDCG what comes to mind is more a search or directly a ranking problem. There are a few ranking problems on OpenML maybe we could pick one (or find some other open dataset and put it there)? Though of course we can also illustrate them on a recommendation example.

A side comment that https://www.openml.org/d/40916 looks interesting, but maybe too political. I do wonder how does the partial dependence plot of "Dystopia" wrt "Happines" looks like :)

ghost · 2021-11-08T22:13:46Z

@rth I agree...since having a Recommendation Engine example seemed to be in the interest of multiple people, I got to that first. The other points are also valid but (as you mentioned) not necessarily well-combineable with rec engines

shivamchhuneja · 2025-06-04T04:45:20Z

Hi!

Happy to pick this up.

Just wanted to quickly confirm the direction before jumping in.

It seems like there are actually two separate things we have here, and maybe it’s worth splitting this into 2 different issues:

A recommendation example (something simple and educational, maybe video/movie based using cosine similarity) - this would flow more naturally into top_k_accuracy_score, which is usually what readers might have in mind and expect. Right now, k score doesn’t have any detailed example or walkthrough either, so we could build something that covers both.
A pure ranking example - where it makes more sense to use kendall or spearman for evaluation. These metrics are usually used in search, not so much in recommenders so a separate item might make sense.

My suggestion: I could start with the recommender + top_k_accuracy_score example first, since it would be what many users might expect when they come to this topic.

After that, I can follow up with a ranking example using kendall and spearman.

Let me know what direction makes more sense - happy to align on whatever would be most valuable for the readers!

@StefanieSenger

StefanieSenger · 2025-06-04T06:31:44Z

Let's tag @lorentzenchr here, so he can evaluate your approach.

shivamchhuneja · 2025-06-06T02:51:28Z

@StefanieSenger in the meantime I will get started with laying down a rough structure for the example, dataset etc. (stuff that would remain slightly constant irrsepective of the approach we take for this one)

shivamchhuneja · 2025-06-09T06:43:15Z

Hey, just getting this thread going again

adrinjalali · 2025-06-11T11:28:16Z

@shivamchhuneja feel free to fire up a pull request with your ideas and we can take the idea from there.

shivamchhuneja · 2025-06-12T07:27:05Z

@adrinjalali thanks, sounds great, will setup the base ideas in a PR within the coming week :)

lorentzenchr added Documentation module:metrics help wanted Easy Well-defined and straightforward way to resolve labels Sep 24, 2021

github-actions bot assigned acse-srm3018 Sep 27, 2021

github-actions bot removed the help wanted label Sep 27, 2021

lorentzenchr unassigned acse-srm3018 Nov 8, 2021

github-actions bot added the help wanted label Nov 8, 2021

lorentzenchr added Moderate Anything that requires some knowledge of conventions and best practices and removed Easy Well-defined and straightforward way to resolve labels Nov 8, 2021

ghost mentioned this issue Nov 8, 2021

Added simple example of Content-based Recommendation Engine #21602

Closed

StefanieSenger mentioned this issue Jun 3, 2025

DOC Add link to plot_monotonic_constraints.py in ensemble examples #31471

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Example with ranking metrices #21138

Example with ranking metrices #21138

lorentzenchr commented Sep 24, 2021

glemaitre commented Sep 24, 2021

Uh oh!

acse-srm3018 commented Sep 27, 2021

Uh oh!

adrinjalali commented Oct 12, 2021

Uh oh!

ghost commented Nov 7, 2021

Uh oh!

adrinjalali commented Nov 8, 2021

Uh oh!

ghost commented Nov 8, 2021

Uh oh!

ghost commented Nov 8, 2021

Uh oh!

rth commented Nov 8, 2021 •

edited

Loading

Uh oh!

ghost commented Nov 8, 2021

Uh oh!

shivamchhuneja commented Jun 4, 2025

Uh oh!

StefanieSenger commented Jun 4, 2025

Uh oh!

shivamchhuneja commented Jun 6, 2025

Uh oh!

shivamchhuneja commented Jun 9, 2025 •

edited

Loading

Uh oh!

adrinjalali commented Jun 11, 2025

Uh oh!

shivamchhuneja commented Jun 12, 2025

Uh oh!

Uh oh!

Example with ranking metrices #21138

Example with ranking metrices #21138

Comments

lorentzenchr commented Sep 24, 2021

Describe the issue linked to the documentation

Suggest a potential alternative/fix

glemaitre commented Sep 24, 2021

Uh oh!

acse-srm3018 commented Sep 27, 2021

Uh oh!

adrinjalali commented Oct 12, 2021

Uh oh!

ghost commented Nov 7, 2021

Uh oh!

adrinjalali commented Nov 8, 2021

Uh oh!

ghost commented Nov 8, 2021

Uh oh!

ghost commented Nov 8, 2021

Uh oh!

rth commented Nov 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ghost commented Nov 8, 2021

Uh oh!

shivamchhuneja commented Jun 4, 2025

Uh oh!

StefanieSenger commented Jun 4, 2025

Uh oh!

shivamchhuneja commented Jun 6, 2025

Uh oh!

shivamchhuneja commented Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adrinjalali commented Jun 11, 2025

Uh oh!

shivamchhuneja commented Jun 12, 2025

Uh oh!

rth commented Nov 8, 2021 •

edited

Loading

shivamchhuneja commented Jun 9, 2025 •

edited

Loading