Open
Description
Missing values, represented as NaN, could be treated as a separate category in discretization. This seems much more sensible to me than imputing the missing data then discretizing.
In accordance with recent changes to other preprocessing, NaNs would simply be ignored in calculating fit
statistics, and would be passed on to the encoder in transform
. I can't recall if we're handling this sensibly in OneHotEncoder yet...
Metadata
Metadata
Assignees
Type
Projects
Status
No status