-
-
Notifications
You must be signed in to change notification settings - Fork 25.9k
ENH allows to overwrite read_csv parameter in fetch_openml #26433
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ENH allows to overwrite read_csv parameter in fetch_openml #26433
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I left some minor comments about the docstring, otherwise LGTM.
I am going to open a subsequent PR to illustrate what this PR is allowing for. |
Co-authored-by: Thomas J. Fan <[email protected]>
I open #26436 which shows how useful this feature will be for copping with the Pandas breaking change. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Otherwise LGTM.
Co-authored-by: Adrin Jalali <[email protected]>
…arn#26433) Co-authored-by: Thomas J. Fan <[email protected]> Co-authored-by: Adrin Jalali <[email protected]>
…arn#26433) Co-authored-by: Thomas J. Fan <[email protected]> Co-authored-by: Adrin Jalali <[email protected]>
Reopening #25488
Context
As pointed out in #25878 (comment), pandas introduced a breaking change from 1.X to 2.X to consider
None
as a missing value by default. If we want to avoid suffering from the breaking change and also not having a different behaviour depending on the pandas version, then we can set the defaultna_values
inread_csv
to the previous 1.X values, and announce a future change.However, to silence this
FutureWarning
, we need our user to provide the future defaultna_values
and thus we need to exposeread_csv_kwargs
.This PR is the part that exposes
read_csv_kwargs
to our user.