Thanks to visit codestin.com
Credit goes to github.com

Skip to content

discrete branch: quantile and kmeans strategies #9338

@jnothman

Description

@jnothman

We recently merged a discretizing transformer into the discrete branch (see diff between that branch and master).

The current discretizer there makes fixed width bins based on each feature's input range. While we could do quantile bins by preceding this in a pipeline with a QuantileTransformer, this is inefficient. Quantile bins is quite a common approach to discretizing.

Another approach to discretizing performs KMeans clustering (or perhaps even another clusterer or mixture model) in each feature.

We should either have separate classes for these discretization approaches, or provide a strategy= parameter to the current proposed discretizer.

To dear contributor: Make sure to submit a pull request to the discrete branch.

Metadata

Metadata

Assignees

No one assigned

    Labels

    EnhancementModerateAnything that requires some knowledge of conventions and best practiceshelp wanted

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions