Fix `KBinsDiscretizer` uniform strategy : wrong bin assignment caused by floating point errors when using `uniform` strategy #30962

Rishab260 · 2025-03-08T05:55:47Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Fix precision issues in KBinsDiscretizer when using the "uniform" strategy by replacing np.linspace (which relies on floating-point arithmetic) with decimal for exact bin edge computation.

Previously, floating-point rounding errors could accumulate, leading to slight inaccuracies in bin edge calculations.

Changes:-

Remove use of np.linspace and uses decimal for edge computation when using uniform strategy.
Added test test_kbinsdiscretizer_uniform_strategy
- Correct bin edges
- Uniform bin widths
- Transformation and inverse transformation work correctly

Any other comments?

Any suggestions or feedback is highly appreciated. Thanks.

…n later

github-actions · 2025-03-08T05:57:03Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 4bd2f72. Link to the linter CI: here}

…nsfix

Rishab260 · 2025-03-15T14:10:41Z

Hi @ogrisel, PTAL if this fix looks good to you? Thanks.

Rishab260 · 2025-03-24T18:52:26Z

Hi @thomasjpfan , could you please take a look at these changes? Thanks.

Rishab260 added 4 commits March 8, 2025 10:17

use fractions in fit for uniform strategy

e87b671

add test_kbinsdiscretizer_uniform_strategy test

cc05622

make test concise and add check for expected midpoints

a2dbe9d

remove conversion to python's float, np.float64 conversion will happe…

1904d3b

…n later

github-actions bot added the module:preprocessing label Mar 8, 2025

Rishab260 and others added 4 commits March 8, 2025 11:44

Merge branch 'main' into kbinsfix

10779df

use decimal instead of fractions

0bb57e5

Merge branch 'kbinsfix' of github.com:Rishab260/scikit-learn into kbi…

f1cebfc

…nsfix

subsampe = None

0105f73

Rishab260 added 3 commits March 18, 2025 06:39

Merge branch 'main' into kbinsfix

6bab20c

Merge branch 'main' into kbinsfix

0940689

Merge branch 'main' into kbinsfix

4bd2f72

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix `KBinsDiscretizer` uniform strategy : wrong bin assignment caused by floating point errors when using `uniform` strategy #30962

Fix `KBinsDiscretizer` uniform strategy : wrong bin assignment caused by floating point errors when using `uniform` strategy #30962

Rishab260 commented Mar 8, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Mar 8, 2025 •

edited

Loading

Uh oh!

Rishab260 commented Mar 15, 2025

Uh oh!

Rishab260 commented Mar 24, 2025

Uh oh!

Uh oh!

Uh oh!

Fix KBinsDiscretizer uniform strategy : wrong bin assignment caused by floating point errors when using uniform strategy #30962

Are you sure you want to change the base?

Fix KBinsDiscretizer uniform strategy : wrong bin assignment caused by floating point errors when using uniform strategy #30962

Conversation

Rishab260 commented Mar 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Changes:-

Any other comments?

Uh oh!

github-actions bot commented Mar 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

Rishab260 commented Mar 15, 2025

Uh oh!

Rishab260 commented Mar 24, 2025

Uh oh!

Uh oh!

Fix `KBinsDiscretizer` uniform strategy : wrong bin assignment caused by floating point errors when using `uniform` strategy #30962

Fix `KBinsDiscretizer` uniform strategy : wrong bin assignment caused by floating point errors when using `uniform` strategy #30962

Rishab260 commented Mar 8, 2025 •

edited

Loading

github-actions bot commented Mar 8, 2025 •

edited

Loading