[MRG + 1] Fix element-wise comparison for numpy #8011

aashil · 2016-12-08T01:48:25Z

Reference Issue

Working on #7994

What does this implement/fix? Explain your changes.

Any other comments?

Fix the element-wise comparision issue with numpy.

aashil · 2016-12-08T01:52:51Z

@lesteve: I followed your instructions which gave me the below error in this particular line:
multioutput = check_array(multioutput, ensure_2d=False)

Error:

TypeError: Singleton array array('variance_weighted', 
          dtype='|S17') cannot be considered a valid collection.

On a side note, what do you mean when you say install numpy from master ?

lesteve · 2016-12-08T08:09:30Z

Error:

TypeError: Singleton array array('variance_weighted',
dtype='|S17') cannot be considered a valid collection.

Hmmm that means that somewhere the string 'variance_weighted' is turned into an array ... so the string_types clause is skipped. You will need to either:

understand where in our code we do this conversion 'variance_weighted' -> np.array('variance_weighted') and see how easy it is to get rid of it
treat the case of singleton string arrays in the string_types clause

My preference is for 1.

On a side note, what do you mean when you say install numpy from master ?

That means installing the numpy development version because that is what the original issue was about. You can still work on this issue without it but the only way to make sure the original issue is fixed is through Travis. This can be a bit cumbersome and frustrating because the feedback loop is a lot longer that testing things locally.

dalmia · 2016-12-09T04:13:34Z

sklearn/metrics/regression.py

@@ -90,7 +90,10 @@ def _check_reg_targets(y_true, y_pred, multioutput):
    n_outputs = y_true.shape[1]
    multioutput_options = (None, 'raw_values', 'uniform_average',
                           'variance_weighted')
-    if multioutput not in multioutput_options:
+    if isinstance(multioutput, string_types) and multioutput not in multioutput_options:


When any value among raw_values, uniform_average or variance_weighted is passed, this condition is not met since it checks for invalid multioutput_option. We don't have any check for multioutput being a string type and a valid multioutput_option. That check needs to be added.

I will refine the if else to make sure we account for the correct multioutput check there.

dalmia · 2016-12-09T04:14:49Z

sklearn/metrics/regression.py

@@ -90,7 +90,10 @@ def _check_reg_targets(y_true, y_pred, multioutput):
    n_outputs = y_true.shape[1]
    multioutput_options = (None, 'raw_values', 'uniform_average',
                           'variance_weighted')
-    if multioutput not in multioutput_options:
+    if isinstance(multioutput, string_types) and multioutput not in multioutput_options:
+        raise ValueError("Invalid multioutput value")


I feel it's better to use Invalid multioutput option here.

Ok. If that's what you prefer.

dalmia · 2016-12-09T04:17:07Z

However, it does seem natural that check_array should have a default behavior if the array passed is a string, which it currently doesn't seem to have. Your views @lesteve?

aashil · 2016-12-09T05:27:08Z

@lesteve I think the string_types clause is not skipped but within the check_array condition it converts the string variance_weighted to an array using np.asarray("variance_weighted"). Later we check if that array is singleton or not and raise the above error if it is singleton. @dalmia Would you like to help me out here ?

dalmia · 2016-12-09T05:37:11Z

Sure @aashil. From what I see, you need to add another check if multioutput is among the valid multioutput_options and do the corresponding functionality there. So just change:

elif multioutput is not None:

To:

elif multioutput is not None and multioutput not in multioutput_options:

That should be all.

aashil · 2016-12-09T05:50:19Z

@dalmia I believe you mean elif multioutput is not None and multioutput in multioutput_options: But the real problem is in check_array() method inside the elif clause which is not happy with the array being singleton. Take a look at my latest commit.

dalmia · 2016-12-09T06:11:11Z

@aashil Sorry I made a small mistake. Also, I don't think I conveyed properly what I intended to say. Let me elaborate what I intend to say. _check_reg_targets intends to return a proper value of multioutput. So, if a valid multioutput string is passed as a parameter, it won't be modified. However, if it's not a string, only then it needs to go for check_array.
The whole patch then becomes:

multioutput_options = (None, 'raw_values', 'uniform_average',
                           'variance_weighted')
   # If it is a string, but not a valid option, raise an error
    if isinstance(multioutput, string_types) and multioutput not in multioutput_options:
        raise ValueError("Invalid multioutput option")
   # If it is not a string then check for the validity of the array
    elif multioutput is not None and not isinstance(multioutput, string_types):
        multioutput = check_array(multioutput, ensure_2d=False)
        if n_outputs == 1:

aashil · 2016-12-09T06:20:31Z

@dalmia Ahh, that makes it so clear. Thank you.

* Refactored the if clause and add proper check for valid strings. * Fix PEP8 errors.

dalmia · 2016-12-09T07:15:21Z

Sure, happy to help :)

lesteve · 2016-12-09T10:33:55Z

@aashil I pushed a cosmetic change (I feel the if clause logic is more readable this way) and a test for the error message, have a look at a6efd19.

amueller · 2016-12-09T17:40:09Z

LGTM

aashil · 2016-12-09T18:48:12Z

Tested locally. LGTM

lesteve · 2016-12-12T13:56:00Z

OK, merging then, thanks a lot @aashil!

Was causing "ValueError: The truth value of an array with more than one element is ambiguous"

aashil changed the title ~~[WIP]~~ [WIP] Fix element-wise comparison for numpy Dec 8, 2016

dalmia suggested changes Dec 9, 2016

View reviewed changes

aashil force-pushed the dev-fix-numpy-broken branch from 1e8d7c5 to 019f892 Compare December 9, 2016 06:33

[MRG] Fix the element-wise comparison error thrown by numpy.

7ef3323

* Refactored the if clause and add proper check for valid strings. * Fix PEP8 errors.

aashil force-pushed the dev-fix-numpy-broken branch from 019f892 to 7ef3323 Compare December 9, 2016 06:56

dalmia approved these changes Dec 9, 2016

View reviewed changes

aashil changed the title ~~[WIP] Fix element-wise comparison for numpy~~ [MRG] Fix element-wise comparison for numpy Dec 9, 2016

Cosmetic changes and added test for exception message

a6efd19

amueller changed the title ~~[MRG] Fix element-wise comparison for numpy~~ [MRG + 1] Fix element-wise comparison for numpy Dec 9, 2016

lesteve merged commit 6a42ea2 into scikit-learn:master Dec 12, 2016

amueller mentioned this pull request Dec 12, 2016

numpy dev broken again? #7994

Closed

sergeyf pushed a commit to sergeyf/scikit-learn that referenced this pull request Feb 28, 2017

[MRG + 1] Fix failure on numpy master (scikit-learn#8011)

27fa08e

Was causing "ValueError: The truth value of an array with more than one element is ambiguous"

Przemo10 mentioned this pull request Mar 17, 2017

update fork (#1) #8606

Closed

Sundrique pushed a commit to Sundrique/scikit-learn that referenced this pull request Jun 14, 2017

[MRG + 1] Fix failure on numpy master (scikit-learn#8011)

42115c5

Was causing "ValueError: The truth value of an array with more than one element is ambiguous"

NelleV pushed a commit to NelleV/scikit-learn that referenced this pull request Aug 11, 2017

[MRG + 1] Fix failure on numpy master (scikit-learn#8011)

ed5897c

Was causing "ValueError: The truth value of an array with more than one element is ambiguous"

paulha pushed a commit to paulha/scikit-learn that referenced this pull request Aug 19, 2017

[MRG + 1] Fix failure on numpy master (scikit-learn#8011)

64e981a

Was causing "ValueError: The truth value of an array with more than one element is ambiguous"

maskani-moh pushed a commit to maskani-moh/scikit-learn that referenced this pull request Nov 15, 2017

[MRG + 1] Fix failure on numpy master (scikit-learn#8011)

1036b6d

Was causing "ValueError: The truth value of an array with more than one element is ambiguous"

Uh oh!

[MRG + 1] Fix element-wise comparison for numpy #8011

[MRG + 1] Fix element-wise comparison for numpy #8011

Uh oh!

Conversation

aashil commented Dec 8, 2016

Reference Issue

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

aashil commented Dec 8, 2016

Uh oh!

lesteve commented Dec 8, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dalmia Dec 9, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aashil Dec 9, 2016

Choose a reason for hiding this comment

Uh oh!

dalmia Dec 9, 2016

Choose a reason for hiding this comment

Uh oh!

aashil Dec 9, 2016

Choose a reason for hiding this comment

Uh oh!

dalmia commented Dec 9, 2016

Uh oh!

aashil commented Dec 9, 2016

Uh oh!

dalmia commented Dec 9, 2016

Uh oh!

aashil commented Dec 9, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dalmia commented Dec 9, 2016

Uh oh!

aashil commented Dec 9, 2016

Uh oh!

dalmia commented Dec 9, 2016

Uh oh!

lesteve commented Dec 9, 2016

Uh oh!

amueller commented Dec 9, 2016

Uh oh!

aashil commented Dec 9, 2016

Uh oh!

lesteve commented Dec 12, 2016

Uh oh!

Uh oh!

lesteve commented Dec 8, 2016 •

edited

Loading

dalmia Dec 9, 2016 •

edited

Loading

aashil commented Dec 9, 2016 •

edited

Loading