Codestin Search App

ElaineBao · 2019-08-26T02:00:18Z

Description

add uint8 batchnorm, mkldnn implementation and test
@PatricZhao @ZhennanQin

Details

Usage

Check the doc in https://github.com/apache/incubator-mxnet/tree/master/example/quantization/README.md to quantize models and do inference.
Quantized bn will be used automatically when a bn operator cannot be fused.

Performance

In most cases, bn can be fused, so quantized bn is not introduced. In reset50 v2, some of the bn operators are standalone, quantizing these bn give a performance as follows:

Model	FP32 (Top-1 / Top-5)	Fusion + fp32 bn	Fusion + int8 bn
Resnet50 v2	0.764 / 0.935	0.722 / 0.901	0.712 / 0.897

xinyu-intel · 2019-08-26T02:21:05Z

                excluded_sym_names += ['resnetv10_conv0_fwd']
        elif args.model.find('resnet') != -1 and args.model.find('v2') != -1:
-            excluded_sym_names += ['resnetv20_flatten0_flatten0']
+            excluded_sym_names += ['resnetv20_flatten0_flatten0', 'resnetv20_stage1_batchnorm0_fwd']


why exclude the first one?

This is for the sake of accuracy, if do not exclude this layer, top-1 accuracy will drop to 52.3. Reason of this accuracy drop is under investigation.

xinyu-intel

.

ZhennanQin

LGTM. Just add a comment to remind that the excluded BN layer is for accuracy purpose.

pengzhao-intel

Thanks for the contribution.

LGTM and mering now.

ElaineBao added 2 commits August 26, 2019 09:11

add uint8 bn mkldnn implementation

e1bfae3

update test case for uint8 bn

df4c02a

ElaineBao requested a review from szha as a code owner August 26, 2019 02:00

fix lint

7d00792

xinyu-intel reviewed Aug 26, 2019

View reviewed changes

pengzhao-intel added the MKLDNN label Aug 26, 2019

xinyu-intel reviewed Aug 26, 2019

View reviewed changes

Comment thread tests/python/quantization/test_quantization.py Outdated

xinyu-intel reviewed Aug 26, 2019

View reviewed changes

ZhennanQin approved these changes Aug 26, 2019

View reviewed changes

ElaineBao added 4 commits August 26, 2019 10:29

update test with gpu

f736c04

add comment for quantization

3d0a457

fix quantized_bn test

dd53622

fix quantize_model_with_forward test

eeb60f0

pengzhao-intel approved these changes Aug 26, 2019

View reviewed changes

pengzhao-intel merged commit 9410cc4 into apache:master Aug 26, 2019

ElaineBao deleted the bn-uint8 branch August 29, 2019 04:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add uint8 bn mkldnn implementation#16003

add uint8 bn mkldnn implementation#16003
pengzhao-intel merged 7 commits into
apache:masterfrom
ElaineBao:bn-uint8

ElaineBao commented Aug 26, 2019

Uh oh!

xinyu-intel Aug 26, 2019

Uh oh!

ElaineBao Aug 26, 2019

Uh oh!

Uh oh!

xinyu-intel left a comment

Uh oh!

ZhennanQin left a comment

Uh oh!

pengzhao-intel left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ElaineBao commented Aug 26, 2019

Description

Details

Usage

Performance

Uh oh!

xinyu-intel Aug 26, 2019

Choose a reason for hiding this comment

Uh oh!

ElaineBao Aug 26, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

xinyu-intel left a comment

Choose a reason for hiding this comment

Uh oh!

ZhennanQin left a comment

Choose a reason for hiding this comment

Uh oh!

pengzhao-intel left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants