Support ONNX operator QLinearSoftmax in dnn #23655

fengyuentau · 2023-05-22T08:19:00Z

Resolves #23636.
Merge with opencv/opencv_extra#1064.

This PR maps the QLinearSoftmax (from com.microsoft domain) to SoftmaxInt8 in dnn along with some speed optimization.

Todo:

support QLinearSoftmax with opset = 13
add model and test data for QLinearSoftmax with opset = 13
ensure all models have dims >= 3.
add the script to generate model and test data

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
The PR is proposed to the proper branch
There is a reference to the original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

modules/dnn/src/int8layers/softmax_layer.cpp

dkurt · 2023-05-23T10:56:06Z

modules/dnn/test/test_onnx_importer.cpp

+TEST_P(Test_ONNX_layers, QLinearSoftmax)
+{
+    // threshold is set for fusion with dequantization
+    testONNXModels("qlinearsoftmax_11", npy, 0.001, 0.002);


Does this test cover all 4 scenarios?

SoftmaxInt8Invoker<true>::run(src, dst, blobs[0], N, D, output_sc, output_zp); SoftmaxInt8Invoker<false>::run(src, dst, blobs[0], N, D, output_sc, output_zp); SoftmaxInt8OutputFloatInvoker<true>::run(src, dst, blobs[0], N, D); SoftmaxInt8OutputFloatInvoker<false>::run(src, dst, blobs[0], N, D);

This test does not cover log softmax, but it is tested by the existing test Test_Int8_layers.Softmax_log_ONNX/0 instead.

The model architecture of this test is like [input]->QuantizeLinear -> QLinearSoftmax -> DequantizeLinear -> [output]. Since we have tryFuse in SoftmaxInt8 which fuses DequantizeLinear and therefore we have output in float instead of int8 in the end, SoftmaxInt8OutputFloatInvoker<false>::run(...) is triggered here in the test. I tested locally with tryFuse disabled which triggers SoftmaxInt8Invoker<false>::run(...) and it passes. Do you think it necessary to add another test case for int8 output like [input] -> QLinearSoftmax -> [output]?

fengyuentau · 2023-05-24T10:12:39Z

All todo items are clear. Please review if possible.

modules/dnn/src/int8layers/softmax_layer.cpp

Support ONNX operator QLinearSoftmax in dnn opencv#23655 Resolves opencv#23636. Merge with opencv/opencv_extra#1064. This PR maps the QLinearSoftmax (from com.microsoft domain) to SoftmaxInt8 in dnn along with some speed optimization. Todo: - [x] support QLinearSoftmax with opset = 13 - [x] add model and test data for QLinearSoftmax with opset = 13 - [x] ensure all models have dims >= 3. - [x] add the script to generate model and test data ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake

fengyuentau added 2 commits May 18, 2023 18:07

initial impl

66362d2

fix bugs: wrong D and invert output scale

083db9a

fengyuentau added optimization category: dnn category: dnn (onnx) ONNX suport issues in DNN module labels May 22, 2023

asmorkalov requested review from dkurt and zihaomu May 22, 2023 08:31

asmorkalov reviewed May 22, 2023

View reviewed changes

modules/dnn/src/int8layers/softmax_layer.cpp Outdated Show resolved Hide resolved

fengyuentau added 3 commits May 23, 2023 11:45

support fusion with dequantization

d9a2596

fix log softmax

990b590

support non-coerced softmax

98294be

asmorkalov changed the title ~~dnn: support ONNX operator QLinearSoftmax~~ WIP: support ONNX operator QLinearSoftmax in dnn May 23, 2023

dkurt reviewed May 23, 2023

View reviewed changes

modules/dnn/src/int8layers/softmax_layer.cpp Show resolved Hide resolved

dkurt reviewed May 23, 2023

View reviewed changes

fengyuentau added 2 commits May 24, 2023 10:29

fix for transposend

8f087cc

fix accuracy difference

8aaa37f

fengyuentau changed the title ~~WIP: support ONNX operator QLinearSoftmax in dnn~~ Support ONNX operator QLinearSoftmax in dnn May 24, 2023

update tests

f9dc368

fengyuentau added this to the 4.8.0 milestone May 24, 2023

zihaomu reviewed May 25, 2023

View reviewed changes

modules/dnn/src/int8layers/softmax_layer.cpp Show resolved Hide resolved

zihaomu approved these changes May 25, 2023

View reviewed changes

dkurt approved these changes May 25, 2023

View reviewed changes

asmorkalov merged commit f07b01c into opencv:4.x May 25, 2023

fengyuentau deleted the qlinearsoftmax branch May 31, 2023 06:12

asmorkalov mentioned this pull request May 31, 2023

(5.x) Merge 4.x #23718

Merged

fengyuentau mentioned this pull request Feb 21, 2024

ONNX conformance test results #21078

Open

48 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Support ONNX operator QLinearSoftmax in dnn #23655

Support ONNX operator QLinearSoftmax in dnn #23655

Uh oh!

fengyuentau commented May 22, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

dkurt May 23, 2023

Uh oh!

fengyuentau May 23, 2023

Uh oh!

fengyuentau commented May 24, 2023

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Support ONNX operator QLinearSoftmax in dnn #23655

Support ONNX operator QLinearSoftmax in dnn #23655

Uh oh!

Conversation

fengyuentau commented May 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Readiness Checklist

Uh oh!

Uh oh!

Uh oh!

dkurt May 23, 2023

Choose a reason for hiding this comment

Uh oh!

fengyuentau May 23, 2023

Choose a reason for hiding this comment

Uh oh!

fengyuentau commented May 24, 2023

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fengyuentau commented May 22, 2023 •

edited

Loading