DNN: optimize dnn vulkan backend #23349

zihaomu · 2023-03-14T05:29:51Z

Optimize DNN Vulkan backend

My purposes for this PR:

upgrade the Vulkan header file from version 1.0 to 1.2 to support the fp16 and int8 data format.
Carefully optimized the convolution layer and gemm layer. speed up from 170 ms to 36 ms of ResNet50 with Vulkan Backend.
Remove support for some layers like: pooling, permute, LRN, relu. The support of these layers will slow down the DNN inference speed because their kernels are not well-optimized. I think you should leave this task for the next step. GSoC students could take on some work.
Support the ios and Mac M1 chip platforms.

Vulkan CI result can be found at this PR

We only optimize the integrated GPU, and the discrete GPU like Nvidia GPU will run relatively slowly.
There are two CIs:

Mac M1, running the full test would take about 2 mins.
Win10, Nvidia GPU, running the full test would take about 5 mins.

TODO List:

add the vulkan CI in github action, then we can test the PR.

Performance Test

NOTE: Currently PR is only optimized for integrated graphics, it will run very slowly on discrete graphics like Nvidia GPU.

Test on Apple M1 chip.

Model Name	Resnet50	MobileNetV2	YoloV3	YoloV4
CPU Backend (4 thread)	26 ms	6 ms	130.05 ms	215.76 ms
CPU without Winograd (4 thread)	35 ms	6.5 ms	218.9 ms	271.7 ms
Vulkan GPU	37.8 ms	13.8 ms	182.3 s	270.04 ms

Patch performance:
Since the old vulkan kernel is almost without optimize, it works very slowly.

Test of ResNet50 on M1 chip	Before patch	With patch
Vulkan GPU	190 ms	37.8 ms (4X faster)

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
The PR is proposed to the proper branch
There is a reference to the original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

asmorkalov · 2023-05-03T11:23:02Z

@vpisarev Friendly reminder.

zihaomu · 2023-05-11T05:41:31Z

The CI is greed now. zihaomu#1

zihaomu · 2023-05-15T01:31:48Z

modules/dnn/src/int8layers/pooling_layer.cpp

-            kernel_size.assign(1, kernel_size[0]);
-            strides.assign(1, strides[0]);
-            pads_begin.assign(1, pads_begin[0]);
-            pads_end.assign(1, pads_end[0]);
+            kernel_size.resize(1, kernel_size[0]);
+            strides.resize(1, strides[0]);
+            pads_begin.resize(1, pads_begin[0]);
+            pads_end.resize(1, pads_end[0]);


This modification fixes the error reported by Visual Studio 2020.

opencv-alalek · 2023-05-18T11:52:43Z

Please rebase to resolve conflicts:

Conflicting files
modules/dnn/src/dnn_common.hpp
modules/dnn/test/test_backends.cpp

opencv-alalek

LGTM 👍

zihaomu requested a review from vpisarev March 14, 2023 05:31

zihaomu changed the title ~~DNN: speed up vulkan dnn, and support ios and apple m1 chip.~~ DNN: optimize dnn vulkan backend Mar 14, 2023

zihaomu added category: dnn optimization labels Mar 14, 2023

zihaomu force-pushed the optimize_vulkan_dnn branch 5 times, most recently from e3b7d04 to 8b7cc81 Compare March 15, 2023 01:23

zihaomu force-pushed the optimize_vulkan_dnn branch 8 times, most recently from e7c6627 to 49f7a12 Compare April 20, 2023 08:32

zihaomu marked this pull request as ready for review April 20, 2023 08:58

zihaomu force-pushed the optimize_vulkan_dnn branch from 49f7a12 to 8ea197c Compare April 20, 2023 09:07

asmorkalov added this to the 4.9.0 milestone May 5, 2023

zihaomu force-pushed the optimize_vulkan_dnn branch from 9c339e3 to f982551 Compare May 11, 2023 03:15

zihaomu mentioned this pull request May 11, 2023

DNN: turn on Vulkan backend at Win and Mac CI opencv/ci-gha-workflow#95

Merged

vpisarev approved these changes May 12, 2023

View reviewed changes

vpisarev requested a review from opencv-alalek May 12, 2023 08:27

opencv-alalek modified the milestones: 4.9.0, 4.8.0 May 12, 2023

zihaomu commented May 15, 2023

View reviewed changes

zihaomu force-pushed the optimize_vulkan_dnn branch from 7635ec6 to c05dc51 Compare May 15, 2023 01:39

zihaomu force-pushed the optimize_vulkan_dnn branch 2 times, most recently from c4f6c54 to a12cd9a Compare May 18, 2023 12:49

speed up vulkan dnn, and support ios and apple m1 chip.

5e2594e

zihaomu force-pushed the optimize_vulkan_dnn branch from a12cd9a to 5e2594e Compare May 18, 2023 12:57

opencv-alalek approved these changes May 18, 2023

View reviewed changes

vpisarev merged commit 5025f29 into opencv:4.x May 18, 2023

asmorkalov mentioned this pull request May 22, 2023

Extend Vulkan tests to 5.x after #23349 merge to 5.x opencv/ci-gha-workflow#100

Open

asmorkalov mentioned this pull request May 31, 2023

(5.x) Merge 4.x #23718

Merged

zihaomu mentioned this pull request Dec 26, 2023

Vulkan backend for NaryEltwiseLayer in DNN module #24768

Merged

6 tasks

thewoz pushed a commit to thewoz/opencv that referenced this pull request Jan 4, 2024

speed up vulkan dnn, and support ios and apple m1 chip. (opencv#23349)

ff99586

thewoz pushed a commit to thewoz/opencv that referenced this pull request May 29, 2024

speed up vulkan dnn, and support ios and apple m1 chip. (opencv#23349)

8728872

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

DNN: optimize dnn vulkan backend #23349

DNN: optimize dnn vulkan backend #23349

Uh oh!

zihaomu commented Mar 14, 2023 •

edited

Loading

Uh oh!

asmorkalov commented May 3, 2023

Uh oh!

zihaomu commented May 11, 2023

Uh oh!

zihaomu May 15, 2023

Uh oh!

opencv-alalek commented May 18, 2023

Uh oh!

opencv-alalek left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

DNN: optimize dnn vulkan backend #23349

DNN: optimize dnn vulkan backend #23349

Uh oh!

Conversation

zihaomu commented Mar 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Optimize DNN Vulkan backend

Vulkan CI result can be found at this PR

TODO List:

Performance Test

Pull Request Readiness Checklist

Uh oh!

asmorkalov commented May 3, 2023

Uh oh!

zihaomu commented May 11, 2023

Uh oh!

zihaomu May 15, 2023

Choose a reason for hiding this comment

Uh oh!

opencv-alalek commented May 18, 2023

Uh oh!

opencv-alalek left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zihaomu commented Mar 14, 2023 •

edited

Loading