-
-
Notifications
You must be signed in to change notification settings - Fork 56.3k
DNN: optimize dnn vulkan backend #23349
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
e3b7d04
to
8b7cc81
Compare
e7c6627
to
49f7a12
Compare
49f7a12
to
8ea197c
Compare
@vpisarev Friendly reminder. |
9c339e3
to
f982551
Compare
The CI is greed now. zihaomu#1 |
kernel_size.assign(1, kernel_size[0]); | ||
strides.assign(1, strides[0]); | ||
pads_begin.assign(1, pads_begin[0]); | ||
pads_end.assign(1, pads_end[0]); | ||
kernel_size.resize(1, kernel_size[0]); | ||
strides.resize(1, strides[0]); | ||
pads_begin.resize(1, pads_begin[0]); | ||
pads_end.resize(1, pads_end[0]); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This modification fixes the error reported by Visual Studio 2020.
7635ec6
to
c05dc51
Compare
Please rebase to resolve conflicts:
|
c4f6c54
to
a12cd9a
Compare
a12cd9a
to
5e2594e
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM 👍
Optimize DNN Vulkan backend
merge with: opencv/ci-gha-workflow#95.
My purposes for this PR:
1.0
to1.2
to support thefp16
andint8
data format.Vulkan CI result can be found at this PR
We only optimize the integrated GPU, and the discrete GPU like Nvidia GPU will run relatively slowly.
There are two CIs:
TODO List:
Performance Test
NOTE: Currently PR is only optimized for integrated graphics, it will run very slowly on discrete graphics like Nvidia GPU.
Test on Apple M1 chip.
Patch performance:
Since the old vulkan kernel is almost without optimize, it works very slowly.
Pull Request Readiness Checklist
See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request
Patch to opencv_extra has the same branch name.