dnn : int8 quantized layers support in onnx importer #20535

jebastin-nadar · 2021-08-11T10:30:38Z

merge with extra : opencv/opencv_extra#896

Final PR for GSoC'21 project on 8-bit quantization support in dnn module. This PR adds new layers which are currently supported by onnx quantization.
Docs - https://onnxruntime.ai/docs/how-to/quantization.html
Supported layers - https://github.com/microsoft/onnxruntime/blob/master/onnxruntime/python/tools/quantization/registry.py#L34-L53

resolves : #20188
replaces #20264

TODO:

tests for new layers
~~fallback to FP32 for unsupported cases~~ . Automatic fallback does not look possible
add support for unsupported cases - eltwise scalar input, concat layer, int8 resize layer
add quantized resnet50 onnx test - Add qunatized resnet50 model onnx/models#460
tutorial

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or other license that is incompatible with OpenCV
The PR is proposed to proper branch
There is reference to original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

modules/dnn/src/onnx/onnx_importer.cpp

alalek · 2021-08-19T20:24:41Z

modules/dnn/src/onnx/onnx_importer.cpp

+    int depth = CV_32F;
+    checkQuantizedLayer(node_proto, layerParams, depth);
+    if (depth == CV_8S)
+        CV_Error(Error::StsNotImplemented, "Int8 resize layer is not supported");


IMHO, we should fallback for such cases - reuse 32F implementation which would always have better coverage.

BTW, #20228 should NOT block implementation of new layers without "int8" stuff. That experimental code must be optional.

Regarding fallback for current layers without int8 support and for new layers added in future :

Functions in 20228 can fallback to FP32 version automatically if int8 version of a layer is unavailable. Tests which have been added check the logic for the same. Adding int8 version of a new layer is optional.

Nodes in this PR which don't have int8 version donot fallback to FP32 version, they should be probably be added. Will try to add it in the coming days. Logic is slightly difficult as quantize/dequantize nodes have to be added before and after the unsupported layer.

modules/dnn/src/onnx/onnx_importer.cpp

jebastin-nadar · 2021-09-08T10:09:27Z

Marking pull request as ready for review as coding part is done. Only a short tutorial for using quantization and quantized onnx models remains.

@alalek Automatic FP32 fallback for an unsupported INT8 node does not look possible right now. Instead, adding INT8 path for that unsupported node seems much more easier. That's what I did with resize layer.

INT8 path for resize layer has been added in the FP32 version of the layer itself to avoid code duplication. But looking at it now, maybe the changes made are too much and may affect performance of the FP32 layer. If required, I can move the int8 implementation to int8layers/resize_layer.cpp and revert the changes in layers/resize_layer.cpp.

jebastin-nadar · 2021-09-08T10:13:10Z

Any solution to solve this build failure?

OpenCV tests: Can't find required data file: dnn/onnx/models/resnet50_int8.onnx in function 'findData'
" thrown in the test body

I have modified download_models.py in opencv/opencv_extra#896 to download quantized ResNet50 from onnx modelzoo.

alalek · 2021-09-08T19:37:25Z

download_models.py is not triggered automatically by CI. Now testdata share should be updated.

alalek · 2021-09-11T09:38:44Z

modules/dnn/test/test_onnx_importer.cpp

@@ -967,6 +967,112 @@ TEST_P(Test_ONNX_layers, ConvResizePool1d)
    testONNXModels("conv_resize_pool_1d");
 }

+TEST_P(Test_ONNX_layers, Quantized_Convolution)


Test_ONNX_layers

This test subset are parametrized to run on all available backends/targets.
This doesn't make sense for now.
Create separate test fixture to test on OpenCV/CPU target only and move there all "quantized" cases.

This doesn't make sense for now

While it's true that other backends don't have quantized layer's implementation, we still need to ensure quantized networks fallback to a supported backend. Keeping the tests available for all backends helps in testing this fallback.

[ RUN ] Test_ONNX_nets.ResNet50_Int8/0, where GetParam() = NGRAPH/CPU [ WARN:0] global /home/jebastin/opencv_build/opencv/modules/dnn/src/dnn.cpp (4562) setPreferableBackend DNN: Only default backend supports quantized networks FALLBACK: Layer [Quantize]:[data_quantized] is expected to has backend implementation . . [ OK ] Test_ONNX_nets.ResNet50_Int8/0 (168 ms)

Internally, the code fallbacks to OpenCV/CPU target for unsupported backends, so I dont see a point in limiting the backends of these tests (except reducing the total time for running dnn module tests).

If you still think a new test fixture is needed, I will start working on it soon (will probably need some reference code)

alalek · 2021-09-11T09:39:38Z

modules/dnn/test/test_onnx_importer.cpp

@@ -1103,6 +1209,11 @@ TEST_P(Test_ONNX_nets, ResNet50v1)
    testONNXModels("resnet50v1", pb, default_l1, default_lInf, true, target != DNN_TARGET_MYRIAD);
 }

+TEST_P(Test_ONNX_nets, ResNet50_Int8)


Test_ONNX_nets

The same.
We need separate test fixture for quantized tests with limited set of backends.

alalek

Let put it in to have some usable tests for int8 feature.
Thank you 👍

dnn : int8 quantized layers support in onnx importer * added quantized layers support in onnx importer * added more cases in eltwise node, some more checks * added tests for quantized nodes * relax thresholds for failed tests, address review comments * refactoring based on review comments * added support for unsupported cases and pre-quantized resnet50 test * relax thresholds due to int8 resize layer

added quantized layers support in onnx importer

7b74573

jebastin-nadar force-pushed the onnx-q branch from 8437267 to 7b74573 Compare August 11, 2021 10:48

asmorkalov added category: dnn feature labels Aug 12, 2021

asmorkalov requested a review from rogday August 12, 2021 08:17

vpisarev added the GSoC label Aug 13, 2021

added more cases in eltwise node, some more checks

e578593

vpisarev self-assigned this Aug 18, 2021

jebastin-nadar added 3 commits August 19, 2021 18:53

resolve conflicts

eb02bd4

added tests for quantized nodes

3d5ddd3

resolve conflicts

fd530da

alalek reviewed Aug 23, 2021

View reviewed changes

jebastin-nadar added 3 commits August 23, 2021 15:49

relax thresholds for failed tests, address review comments

a73efbe

refactoring based on review comments

0501a3e

added support for unsupported cases and pre-quantized resnet50 test

bcfc6af

jebastin-nadar marked this pull request as ready for review September 8, 2021 10:09

asmorkalov mentioned this pull request Sep 10, 2021

8-bit quantization test tuning for 32-bit platforms #20684

Closed

6 tasks

relax thresholds due to int8 resize layer

66f9e89

alalek reviewed Sep 11, 2021

View reviewed changes

alalek mentioned this pull request Oct 3, 2021

DNN: Invalid memory access in Int8 code #20799

Closed

jebastin-nadar mentioned this pull request Oct 3, 2021

dnn : fix illegal memory access in int8 convolution #20800

Merged

6 tasks

alalek approved these changes Oct 4, 2021

View reviewed changes

alalek merged commit cce78cc into opencv:master Oct 4, 2021

This was referenced Oct 15, 2021

(5.x) Merge 4.x #20886

Merged

DNN(Int8): error messages from Quantized tests #20909

Open

alalek mentioned this pull request Dec 24, 2021

(4.x) Merge 3.4 #21340

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

dnn : int8 quantized layers support in onnx importer #20535

dnn : int8 quantized layers support in onnx importer #20535

Uh oh!

jebastin-nadar commented Aug 11, 2021 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alalek Aug 19, 2021

Uh oh!

jebastin-nadar Aug 23, 2021

Uh oh!

Uh oh!

jebastin-nadar commented Sep 8, 2021

Uh oh!

jebastin-nadar commented Sep 8, 2021

Uh oh!

alalek commented Sep 8, 2021

Uh oh!

alalek Sep 11, 2021

Uh oh!

jebastin-nadar Sep 11, 2021

Uh oh!

alalek Sep 11, 2021

Uh oh!

alalek left a comment

Uh oh!

Uh oh!

Uh oh!

dnn : int8 quantized layers support in onnx importer #20535

dnn : int8 quantized layers support in onnx importer #20535

Uh oh!

Conversation

jebastin-nadar commented Aug 11, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Readiness Checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alalek Aug 19, 2021

Choose a reason for hiding this comment

Uh oh!

jebastin-nadar Aug 23, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jebastin-nadar commented Sep 8, 2021

Uh oh!

jebastin-nadar commented Sep 8, 2021

Uh oh!

alalek commented Sep 8, 2021

Uh oh!

alalek Sep 11, 2021

Choose a reason for hiding this comment

Uh oh!

jebastin-nadar Sep 11, 2021

Choose a reason for hiding this comment

Uh oh!

alalek Sep 11, 2021

Choose a reason for hiding this comment

Uh oh!

alalek left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jebastin-nadar commented Aug 11, 2021 •

edited

Loading