Thanks to visit codestin.com
Credit goes to github.com

Skip to content

FIX only consider "?" as missing marker as per ARFF specs #26551

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Jun 9, 2023

Conversation

glemaitre
Copy link
Member

@glemaitre glemaitre commented Jun 9, 2023

closes #26436

Alternative proposed in #26436. Strictly following the ARFF specs, only "?" is a missing values marker (c.f. https://www.cs.waikato.ac.nz/~ml/weka/arff.html).

This PR proposes to follow these specs and does not use other default missing values marker defined by pandas.

If a user wants to overwrite this behaviour, the parameter read_csv_kwargs will help.

@glemaitre glemaitre changed the title FIX only consider ? as missing marker as per ARFF specs FIX only consider "?" as missing marker as per ARFF specs Jun 9, 2023
@glemaitre
Copy link
Member Author

ping @ogrisel @adrinjalali @jeremiedbb

@glemaitre glemaitre added this to the 1.3 milestone Jun 9, 2023
Copy link
Member

@adrinjalali adrinjalali left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Member

@jeremiedbb jeremiedbb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@jeremiedbb jeremiedbb merged commit b044ef8 into scikit-learn:main Jun 9, 2023
manudarmi pushed a commit to primait/scikit-learn that referenced this pull request Jun 12, 2023
REDVM pushed a commit to REDVM/scikit-learn that referenced this pull request Nov 16, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants