[MRG] Use _check_sample_weight in BaseForest #15492

ritalulu · 2019-11-02T20:05:41Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Use _check_sample_weight to validate sample_weight in BaseForest.
Fixes trailing whitespace issues marked by flake8.

Worked on during the WIMLDS Bay Area Sprint with @lakrish

TomDLT · 2019-11-02T21:31:19Z

sklearn/ensemble/_forest.py

@@ -249,8 +249,7 @@ def decision_path(self, X):
        X = self._validate_X_predict(X)
        indicators = Parallel(n_jobs=self.n_jobs, verbose=self.verbose,
                              **_joblib_parallel_args(prefer='threads'))(
-            delayed(tree.decision_path)(X,
-                                     check_input=False)
+            delayed(tree.decision_path)(X, check_input=False)


Thanks for fixing all these cosmetic issues, but it adds some noise both to the review process and to the history of this file, and it may lead to unnecessary conflicts with other pull-requests.
Therefore, we usually prefer changing only the strict minimum.

ok to merge or do you want to undo these?

No strong feelings, just wanted to mention it to avoid it in the future.

that's what I figured, just wanted to confirm :)

TomDLT · 2019-11-02T23:16:37Z

sklearn/ensemble/_forest.py

@@ -548,13 +548,16 @@ def _validate_y_class_weight(self, y):
            if isinstance(self.class_weight, str):
                if self.class_weight not in valid_presets:
                    raise ValueError('Valid presets for class_weight include '
-                                     '"balanced" and "balanced_subsample". Given "%s".'
+                                     '"balanced" and "balanced_subsample". \


If we are to keep these changes, I think we prefer to avoid backslashes \.
For strings you can close the string and reopen it below:

raise ValueError("blah \ blah.") # can be rewritten into raise ValueError("blah " "blah.")

TomDLT · 2019-11-02T23:17:03Z

sklearn/ensemble/_forest.py

                                     % self.class_weight)
                if self.warm_start:
-                    warn('class_weight presets "balanced" or "balanced_subsample" are '
+                    warn('class_weight presets "balanced" or \


Same here, avoid \

adrinjalali

otherwise LGTM, thanks @ritalulu

adrinjalali · 2019-11-04T16:17:02Z

sklearn/ensemble/_forest.py

+                         '"balanced" weights, use compute_class_weight\
+                         ("balanced", '


the same comment regarding \ applies here.

adrinjalali · 2019-11-06T10:31:10Z

still have linting issues. You can run ./build_tools/circle/linting.sh from your scikit-learn's root folder and see the issues locally. You'll need flake8 installed.

Use _check_sample_weight in BaseForest

ce5ded5

ritalulu changed the title ~~Fix: Use _check_sample_weight in BaseForest~~ [MRG] Use _check_sample_weight in BaseForest Nov 2, 2019

TomDLT reviewed Nov 2, 2019

View reviewed changes

amueller approved these changes Nov 2, 2019

View reviewed changes

TomDLT reviewed Nov 2, 2019

View reviewed changes

Correcting for PR comments

9cf8ea8

adrinjalali approved these changes Nov 4, 2019

View reviewed changes

Correcting for PR comments

1bc519d

Correcting for linting issues

958970a

qinhanmin2014 approved these changes Nov 12, 2019

View reviewed changes

qinhanmin2014 merged commit 0170009 into scikit-learn:master Nov 12, 2019

panpiort8 pushed a commit to panpiort8/scikit-learn that referenced this pull request Mar 3, 2020

MNT Use _check_sample_weight in BaseForest (scikit-learn#15492)

90ceb6d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] Use _check_sample_weight in BaseForest #15492

[MRG] Use _check_sample_weight in BaseForest #15492

ritalulu commented Nov 2, 2019 •

edited

Loading

TomDLT Nov 2, 2019

amueller Nov 2, 2019

TomDLT Nov 2, 2019

amueller Nov 2, 2019

TomDLT Nov 2, 2019

TomDLT Nov 2, 2019

adrinjalali left a comment

adrinjalali Nov 4, 2019

adrinjalali commented Nov 6, 2019

		'"balanced" weights, use compute_class_weight\
		("balanced", '

[MRG] Use _check_sample_weight in BaseForest #15492

[MRG] Use _check_sample_weight in BaseForest #15492

Conversation

ritalulu commented Nov 2, 2019 • edited Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

TomDLT Nov 2, 2019

Choose a reason for hiding this comment

amueller Nov 2, 2019

Choose a reason for hiding this comment

TomDLT Nov 2, 2019

Choose a reason for hiding this comment

amueller Nov 2, 2019

Choose a reason for hiding this comment

TomDLT Nov 2, 2019

Choose a reason for hiding this comment

TomDLT Nov 2, 2019

Choose a reason for hiding this comment

adrinjalali left a comment

Choose a reason for hiding this comment

adrinjalali Nov 4, 2019

Choose a reason for hiding this comment

adrinjalali commented Nov 6, 2019

ritalulu commented Nov 2, 2019 •

edited

Loading