-
-
Notifications
You must be signed in to change notification settings - Fork 26k
MNT: mark FeatureHasher and HashingVectorizer with requires_fit=False… #31557
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
MNT: mark FeatureHasher and HashingVectorizer with requires_fit=False… #31557
Conversation
Thanks for helping clean this up! Looks good to me. Let's see what the testing robots have to say. @adrinjalali could you explain in a bit more detail what you meant with "require_fit=False can be tricky if there is validation"? |
So because those estimators do not accept normal NumPy array (they are vectorizer) they are not covered by the common test (tests run in We therefore need non-regression tests for the |
As in, do we expect the transformer to do any kind of feature count / feature name consistency between |
As far that I recall, we defined stateless as: if you call |
Time to update the glossary entry? Lines 788 to 795 in ffe9be7
|
It looks like the current tests already cover calling |
Reference Issues/PRs
Closes #30689
What does this implement/fix? Explain your changes.
Adds
requires_fit=False
tag to these classesAny other comments?
N/A
@betatim @glemaitre