Use layer stages for all-in-one nets #628

lukeyeager · 2016-03-11T00:54:28Z

Close #605, close #623, close #335

Use layer.include.stage to specify train/val/deploy all in a single .prototxt description
Stop automatically creating Softmax layers in deploy networks for classification jobs
Only set inner_product_param.num_output for classification jobs if it was unset
Update the standard networks

TODO:

Documentation

lukeyeager · 2016-03-11T01:43:32Z

Added some documentation:

Updated with a few suggestions from @jmancewicz - thanks!

gheinrich · 2016-03-11T17:48:59Z

digits/model/tasks/caffe_train.py

+            if layer.type == 'Softmax':
+                found_softmax = True
+                break
+        assert found_softmax, 'Your deploy network is missing a Softmax layer! Read the documentation for custom networks and/or look at the standard networks for examples.'


Did I miss the bit in the documentation where it is explained to the user that a softmax layer is needed to display a probability distribution?

Oh yeah, good call

Use layer.include.stage to specify train/val/deploy all in a single .prototxt description Stop automatically creating Softmax layers in deploy networks for classification jobs Only set inner_product_param.num_output for classification jobs if it was unset Update the standard networks

gheinrich · 2016-03-14T14:08:40Z

digits/model/tasks/caffe_train.py

+            # Check to see if top_k > num_categories
+            if ( layer.accuracy_param.HasField('top_k') and
+                    layer.accuracy_param.top_k >= num_categories ):
+                self.logger.warning(


self is not defined here and the layer isn't actually being removed

Whoops that was sloppy. Thanks for the review! Fixed.

Surprisingly, you can edit an array while enumerating over it. Python is so convenient sometimes.

>>> a = range(10) >>> for i, x in enumerate(a): ... if (x%3 == 0): ... del a[i] ... >>> a [1, 2, 4, 5, 7, 8]

Oops I misspoke. The way I implemented it will break if two subsequent layers both have an invalid top_k because the second one won't be processed.

I need to fix that tomorrow...

>>> a = range(10) >>> for i, x in enumerate(a): ... print 'Processing %d (%d) ...' % (i, x) ... if (x%3 == 0): ... del a[i] ... Processing 0 (0) ... Processing 1 (2) ... Processing 2 (3) ... Processing 3 (5) ... Processing 4 (6) ... Processing 5 (8) ... Processing 6 (9) ... >>> a [1, 2, 4, 5, 7, 8]

gheinrich · 2016-03-14T14:16:01Z

Great PR! Looks good except for the small omission in the processing of accuracy layers. I've also pushed #632 to update examples.

gheinrich · 2016-03-15T16:41:46Z

digits/templates/models/images/classification/custom_network_explanation.html

    <li>
-        The <i>num_output</i> for each <b>InnerProduct</b> layer which is a network output gets set to the number of labels in the chosen dataset.
+        The Deploy network <b>must contain a Softmax layer</b>.
+        This should produce the only network output.


I think GoogleNet does not abide by this principle. Do the auxiliary classifiers need to be sent to SilenceLayers in the deploy network?

Actually it does. I pruned the auxiliary classifiers from the deploy network (solving #335).

Oh yes indeed, my mistake, sorry.

Use layer stages for all-in-one nets

Update examples after #628

Since NVIDIA#628 DIGITS does not overwrite the number of outputs in the last fully-connected layer if it is already set to a value. If the user accidentally specifies too large a `num_output` then inference will fail as reported on NVIDIA#678. This change causes classification outputs to be ignored if there is no corresponding label. close NVIDIA#678

lukeyeager added enhancement caffe labels Mar 11, 2016

lukeyeager force-pushed the layer-stages branch 2 times, most recently from ffd6258 to c005f75 Compare March 11, 2016 01:41

lukeyeager mentioned this pull request Mar 11, 2016

Changes to layer exclusion prefixes and Softmax-related code [DON'T MERGE] #623

Closed

2 tasks

lukeyeager force-pushed the layer-stages branch from c005f75 to 9d7c33b Compare March 11, 2016 02:17

gheinrich reviewed Mar 11, 2016
View reviewed changes

lukeyeager force-pushed the layer-stages branch from 9d7c33b to 2f0873c Compare March 11, 2016 18:07

lukeyeager mentioned this pull request Mar 11, 2016

Layer exclusion naming convention applied to classification nets [DON'T MERGE YET] #605

Closed

gheinrich added a commit to gheinrich/DIGITS that referenced this pull request Mar 14, 2016

Update examples after NVIDIA#628

8659c76

gheinrich reviewed Mar 14, 2016
View reviewed changes

Fix top_k bug

a5eba04

gheinrich reviewed Mar 15, 2016
View reviewed changes

lukeyeager mentioned this pull request Mar 15, 2016

Add better Python layer documentation #636

Closed

lukeyeager added a commit that referenced this pull request Mar 15, 2016

Merge pull request #628 from lukeyeager/layer-stages

4bd5b9e

Use layer stages for all-in-one nets

lukeyeager merged commit 4bd5b9e into NVIDIA:master Mar 15, 2016

lukeyeager deleted the layer-stages branch March 15, 2016 17:48

lukeyeager added a commit that referenced this pull request Mar 15, 2016

Merge pull request #632 from gheinrich/dev/update-examples

a9ee507

Update examples after #628

gheinrich mentioned this pull request Mar 21, 2016

Use Datum label field in the absence of a separate label database #620

Closed

This was referenced Mar 21, 2016

New sanity check for "invalid bottom" blobs breaks valid networks #601

Closed

[Discussion] How to specify all-in-one networks BVLC/caffe#3864

Open

Add pop-out explanation for Python layers #651

Merged

gheinrich mentioned this pull request Apr 14, 2016

Ignore classification outputs without corresponding label #682

Merged

lukeyeager mentioned this pull request May 16, 2016

ERROR: error code -8 when using Generic Image Model #740

Closed

lukeyeager mentioned this pull request Aug 22, 2016

Layer Visualization And Weights for Pretrained Jobs #937

Closed

lukeyeager mentioned this pull request Feb 7, 2017

Add better documentation for how to convert train/val nets into all-in-one nets #1192

Open

lukeyeager mentioned this pull request Mar 4, 2017

Validation loss and acc don't show #1488

Closed

lukeyeager mentioned this pull request Apr 10, 2017

ERROR: Top blob 'data' produced by multiple sources #1572

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use layer stages for all-in-one nets #628

Use layer stages for all-in-one nets #628

Uh oh!

lukeyeager commented Mar 11, 2016

Uh oh!

lukeyeager commented Mar 11, 2016

Uh oh!

gheinrich Mar 11, 2016

Uh oh!

lukeyeager Mar 11, 2016

Uh oh!

lukeyeager Mar 11, 2016

Uh oh!

gheinrich Mar 14, 2016

Uh oh!

lukeyeager Mar 15, 2016

Uh oh!

lukeyeager Mar 15, 2016

Uh oh!

gheinrich commented Mar 14, 2016

Uh oh!

gheinrich Mar 15, 2016

Uh oh!

lukeyeager Mar 15, 2016

Uh oh!

gheinrich Mar 15, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Use layer stages for all-in-one nets #628

Use layer stages for all-in-one nets #628

Uh oh!

Conversation

lukeyeager commented Mar 11, 2016

Uh oh!

lukeyeager commented Mar 11, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gheinrich commented Mar 14, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants