SAGE: Early stopping and convergence checks #29

jemus42 · 2025-06-26T17:49:30Z

API for convergence tracking i snot great yet but it works, kind of. The early stopping bit is still in need of some improvement because I found it quite hard to have it stop, but in general I guess it works.

This now works by taking the number of permutations and creating a checkpoint after some number (default 5) of permutations after which convergence is checked and the current state of SAGE values is written to a history table.
I realize this is now somewhat orthogonal to the $score table in the other method's implementations, where we also keep per-resampling, per-permutation importance scores, but in the case of SAGE I so far only ever used holdout due to the general slowness of things and the number of permutations here has a different meaning than the number of permutation iterations in PFI, for example.
Would be nice to unify the API and semantics a bit.

As for SAGE convergence, I wondered if the max_reference_size has a large impact on convergence but it looks like it's not huge. Here's friedman1's first two features (both should have same contribution due to them being only an interaction effect) for max_reference_size $ \in {100, 500}$ after 100 permutations each (quite a lot)

In either case I'd argue that 50 permutations seems fine.

Here's the same for the most important feature (including intermediate reference size):

Here it looks like a few more permutations wouldn't have hurt, but not sure what to conclude about the reference size here.

I should also note that computing SAGE for this task with max_reference_size = 300 and n_permutations = 100 took 7 hours or sousing the ranger learner with 5 threads.
It's just.. a lot of stuff to do. Maybe I need to parallelize batchwise predictions after all?

jemus42 added 3 commits June 26, 2025 17:56

SAGE: add early stopping and convergence checks

69cb90c

Print stored params

500523f

Cleanup progress bar

1847217

jemus42 added the enhancement Extends package features in some way label Jun 26, 2025

jemus42 and others added 14 commits June 26, 2025 21:25

Docs

1e5340c

Make test more lenient due to ARF fluctuations

4fbb1ed

Bump N

4c27597

Rename arg

d1c1309

Progress tweaks

dff3ded

Add ocnvergance plots

ba679ff

Doc

eafae6e

Merge branch 'main' into sage-convergence

b361206

Tweak test

42c250a

Cleanup/misc

e9bf80d

Update NEWS

fab60fc

Formatting

323e032

Cosmetics

48e948d

Link PR in NEWS

4750b8f

jemus42 merged commit ff13e2d into main Jun 29, 2025
7 checks passed

jemus42 deleted the sage-convergence branch June 29, 2025 16:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

SAGE: Early stopping and convergence checks #29

SAGE: Early stopping and convergence checks #29

Uh oh!

jemus42 commented Jun 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

SAGE: Early stopping and convergence checks #29

SAGE: Early stopping and convergence checks #29

Uh oh!

Conversation

jemus42 commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jemus42 commented Jun 26, 2025 •

edited

Loading