Codestin Search App

ppwwyyxx · 2017-12-11T08:20:04Z

Cudnn kernels doesn't work for empty input tensors.
This PR adds support for empty input tensor for FusedBatchNorm,FusedBatchNormGrad,Conv2DBackpropFilter, and cudnn pooling. (fix #14657)

tensorflow-jenkins · 2017-12-11T08:20:07Z

Can one of the admins verify this patch?

yzhwang

Thanks for the PR!
Could you add according test cases for empty input tensor for all these ops?
fused_batch_norm test is at here:https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/ops/nn_fused_batchnorm_test.py
Others are located at tensorflow/python/kernel_tests.

yzhwang · 2017-12-19T21:45:58Z

    if (filter_shape.num_elements() == 0) {
      return;
    }
+    if (input.shape().num_elements() == 0) {


Could you also add this to conv_input_filter_ops.cc?
Also, could you make the comments for this consistent by saying something like: if there is nothing to cmpute, return empty tensors as the output.

You mean conv_grad_input_ops.cc? It has handled this correctly.
I'll add comments.

ppwwyyxx · 2017-12-20T00:16:09Z

When input tensor has zero elements, the reference BN forward implementation in the test script gives NaN for mean/variance, however I return zeros.
NaN seems to make more sense in terms of math, though it will then require some special treatment in the actual training. If there is no objections I'll switch to NaNs.

yzhwang · 2017-12-20T00:22:55Z

@zhangyaobit Could you comment on this: #15264 (comment) please?

yzhwang

LGTM.

yzhwang · 2017-12-20T16:58:02Z

@tensorflow-jenkins test this please

ppwwyyxx · 2017-12-20T18:01:45Z

I haven't switched from returning zeros to returning NaNs, so the test is failing. I'll do that later

yzhwang · 2017-12-20T18:09:01Z

Let's wait for @zhangyaobit 's comment on this first then.

yzhwang

Undo LGTM for now until fix fused_batch_norm test.

zhangyaobit · 2017-12-20T18:26:24Z

@ppwwyyxx, could you comment on the use case of an empty input (e.g. [0, 64, 64, 3])? Should we require the input is non-empty?

ppwwyyxx · 2017-12-20T19:04:55Z

In object detection we may run CNN on patches (object candidates) cropped from an image with predicted boxes. When the image has no candidates we'll get zero patches. In training we can filter out these data but still can't avoid it in testing. A workaround is to use tf.cond but I hope the op can support it by itself.

In fact, I just found that the existing CPU (eigen) implementation of fused_batch_norm can work with empty input and returns NaNs for mean/variance.

zhangyaobit · 2017-12-20T21:02:20Z

Thanks! This sounds good. Could you make the behavior of GPU implementation consistent with Eigen (return NaNs)? Please let me know once the PR is ready for review.

ppwwyyxx · 2017-12-20T23:39:48Z

@zhangyaobit The changes have been made. Could you review it when you have a time? Thanks!

yzhwang

LGTM.

yzhwang · 2017-12-20T23:49:01Z

            << " offset shape: " << offset.shape().DebugString()
            << " tensor format: " << tensor_format;

+    // If input is empty, weturn NaN mean/variance


s/weturn/return.

zhangyaobit

Thanks!

zhangyaobit · 2017-12-21T23:59:36Z

@tensorflow-jenkins test this please

ppwwyyxx · 2017-12-23T15:37:21Z

The implementation of FillFunctor, SetZeroFunctor, etc, are split in two bazel targets: :fill_functor and :constant_op. However :constant_op depends on a lot of stuff: it depends on :transpose_functor which depends on conv2d (I saw a TODO for this by @yzhwang). But conv2d_grad_filter needs to use SetZeroFunctor which ends up being a cyclic reference.

ppwwyyxx · 2017-12-23T21:42:05Z

-    deps = ARRAY_DEPS,
+    deps = [
+        "//tensorflow/core:array_grad",
+        "//tensorflow/core:array_ops_op_lib",
+        "//tensorflow/core:framework",
+        "//tensorflow/core:lib",
+        "//third_party/eigen3",
+        ":bounds_check",
+        ":fill_functor",
+        ":ops_util",
+    ],

Cherry-picking the dependencies seems to make this PR build, but doesn't sound like an ideal solution. In general I guess functors should not depend on ops, but here fill_functor is actually in :constant_op, and :transpose_functor depends on :conv_ops.

drpngx · 2017-12-26T02:33:12Z

Jenkins, test this please.

ppwwyyxx · 2017-12-26T10:48:18Z

Thanks @drpngx for help! However I haven't yet fixed the cyclic dependency error mentioned above. I
think a proper fix would be one of the following:

Move GPU implementation of fill_functor to target :fill_functor.
Don't let :transpose_functor depend on :conv_ops.

The first one seems to be within my reach. I can give it a try but not sure if there is any reason why this is not done before.

drpngx · 2017-12-26T19:11:10Z

@yifeif for some reason, this is stuck with kokoro-run. There are others PRs as well.

yifeif · 2017-12-27T18:14:58Z

Ah if a PR has ran Kokoro tests before, it will need the force-run tag :).

drpngx · 2017-12-27T22:51:51Z

Oh, makes sense, of course.

drpngx · 2017-12-27T22:52:57Z

@ppwwyyxx there are some build breakages on GPU, could you check?

…2DBackpropFilter (fix tensorflow#14657)

ppwwyyxx · 2017-12-28T09:37:10Z

The build was failing because of the bazel dependency problem, which should've been fixed now after I moved implementations to :fill_functor target.

drpngx · 2017-12-28T18:55:59Z

Jenkins, test this please.

drpngx · 2017-12-28T19:21:14Z

/CC @gunan ran out of devmapper space on the Jenkins build.

devmapper: Thin Pool has 968455 free data blocks which is less than minimum required 983040 free data blocks. Create more free space in thin pool or use dm.min_free_space option to change behavior
ERROR: docker build failed. Dockerfile is at /var/lib/jenkins/workspace/tensorflow-pull-requests-cpu-python3/tensorflow/tools/ci_build/Dockerfile.cpu

Jenkins, test this please.

ppwwyyxx · 2017-12-28T23:59:03Z

Why is 'XLA' build failing without details?

drpngx · 2017-12-29T00:02:44Z

OK, looks like @yifeif might have fixed it. We just ran out of space on the machine.

yifeif · 2017-12-29T00:02:59Z

@ppwwyyxx looks like an internal infra failure. I kicked it off again.

drpngx · 2017-12-29T00:38:44Z

Merged. Woohoo!

googlebot added the cla: yes label Dec 11, 2017

ppwwyyxx mentioned this pull request Dec 19, 2017

FusedBatchNorm & Conv2D backwards doesn't support zero batch size #14657

Closed

yzhwang self-assigned this Dec 19, 2017

yzhwang suggested changes Dec 19, 2017

View reviewed changes

yzhwang approved these changes Dec 20, 2017

View reviewed changes

yzhwang suggested changes Dec 20, 2017

View reviewed changes

yzhwang requested a review from zhangyaobit December 20, 2017 23:47

yzhwang reviewed Dec 20, 2017

View reviewed changes

zhangyaobit approved these changes Dec 21, 2017

View reviewed changes

yzhwang approved these changes Dec 22, 2017

View reviewed changes

yifeif added the kokoro:force-run Tests on submitted change label Dec 22, 2017

kokoro-team removed the kokoro:force-run Tests on submitted change label Dec 22, 2017

yzhwang mentioned this pull request Dec 22, 2017

could not set cudnn filter descriptor: CUDNN_STATUS_BAD_PARAM #5772

Closed

drpngx added awaiting testing (then merge) labels Dec 26, 2017

drpngx added the stat:awaiting response Status - Awaiting response from author label Dec 26, 2017

yifeif added kokoro:force-run Tests on submitted change and removed kokoro:run labels Dec 27, 2017

kokoro-team removed the kokoro:force-run Tests on submitted change label Dec 27, 2017

ppwwyyxx added 8 commits December 28, 2017 00:38

Support empty input tensor for FusedBatchNorm,FusedBatchNormGrad,Conv…

261951e

…2DBackpropFilter (fix tensorflow#14657)

Also fix pooling ops

422654b

Add some comments in ops

0dbe9b4

Add tests for conv/pooling/bn.

89d34c1

Return NaN mean/variance when input is empty

659a3fb

update comments

3e38bed

fix typo

208ab5a

Move fill_functor implementations to :fill_functor

8502d24

ppwwyyxx force-pushed the empty-input-tensor branch from e06cf96 to 8502d24 Compare December 28, 2017 08:42

drpngx added kokoro:force-run Tests on submitted change and removed stat:awaiting response Status - Awaiting response from author labels Dec 28, 2017

kokoro-team removed the kokoro:force-run Tests on submitted change label Dec 28, 2017

drpngx added the kokoro:force-run Tests on submitted change label Dec 29, 2017

kokoro-team removed the kokoro:force-run Tests on submitted change label Dec 29, 2017

drpngx merged commit 3a3b753 into tensorflow:master Dec 29, 2017

Conversation

ppwwyyxx commented Dec 11, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tensorflow-jenkins commented Dec 11, 2017

Uh oh!

yzhwang left a comment

Choose a reason for hiding this comment

Uh oh!

yzhwang Dec 19, 2017

Choose a reason for hiding this comment

Uh oh!

ppwwyyxx Dec 19, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ppwwyyxx commented Dec 20, 2017

Uh oh!

yzhwang commented Dec 20, 2017

Uh oh!

yzhwang left a comment

Choose a reason for hiding this comment

Uh oh!

yzhwang commented Dec 20, 2017

Uh oh!

ppwwyyxx commented Dec 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yzhwang commented Dec 20, 2017

Uh oh!

yzhwang left a comment

Choose a reason for hiding this comment

Uh oh!

zhangyaobit commented Dec 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ppwwyyxx commented Dec 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zhangyaobit commented Dec 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ppwwyyxx commented Dec 20, 2017

Uh oh!

yzhwang left a comment

Choose a reason for hiding this comment

Uh oh!

yzhwang Dec 20, 2017

Choose a reason for hiding this comment

Uh oh!

zhangyaobit left a comment

Choose a reason for hiding this comment

Uh oh!

zhangyaobit commented Dec 21, 2017

Uh oh!

ppwwyyxx commented Dec 23, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ppwwyyxx commented Dec 23, 2017

Uh oh!

drpngx commented Dec 26, 2017

Uh oh!

ppwwyyxx commented Dec 26, 2017

Uh oh!

drpngx commented Dec 26, 2017

Uh oh!

yifeif commented Dec 27, 2017

Uh oh!

drpngx commented Dec 27, 2017

Uh oh!

drpngx commented Dec 27, 2017

Uh oh!

ppwwyyxx commented Dec 28, 2017

Uh oh!

drpngx commented Dec 28, 2017

Uh oh!

drpngx commented Dec 28, 2017

Uh oh!

ppwwyyxx commented Dec 28, 2017

Uh oh!

ppwwyyxx commented Dec 11, 2017 •

edited

Loading

ppwwyyxx Dec 19, 2017 •

edited

Loading

ppwwyyxx commented Dec 20, 2017 •

edited

Loading

zhangyaobit commented Dec 20, 2017 •

edited

Loading

ppwwyyxx commented Dec 20, 2017 •

edited

Loading

zhangyaobit commented Dec 20, 2017 •

edited

Loading

ppwwyyxx commented Dec 23, 2017 •

edited

Loading