G-API: Implement inference only mode for OV backend #24584

TolyaTalamanov · 2023-11-23T14:44:27Z

Changes overview

Introduced cv::gapi::wip::ov::benchmark_mode{} compile argument which if enabled force OpenVINO backend to run only inference without populating input and copying back output tensors.

This mode is only relevant for measuring the performance of pure inference without data transfers. Similar approach is using on OpenVINO side in benchmark_app: https://github.com/openvinotoolkit/openvino/blob/master/samples/cpp/benchmark_app/benchmark_app.hpp#L134-L139

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
The PR is proposed to the proper branch
There is a reference to the original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

TolyaTalamanov · 2023-11-23T14:58:53Z

@dmatveev Could you have a look, please?

dmatveev · 2023-11-23T15:22:32Z

modules/gapi/include/opencv2/gapi/infer/ov.hpp

+ * This mode is used to evaluate the pure inference performance of the model without
+ * taking into account the i/o data transfer.
+ */
+struct inference_only { };


I'd move it to cv::gapi::wip

I'd rename it to benchmark_mode to illustrate the purpose.

Should it be cv::gapi::wip or cv::gapi::wip::ov?

the latter, of course.

dmatveev · 2023-11-23T15:23:58Z

modules/gapi/src/backends/ov/govbackend.cpp

@@ -252,7 +252,8 @@ class OVCallContext
                  const std::vector<cv::gimpl::RcDesc>              &  outs,
                  cv::GRunArg::Meta                                 && meta,
                  std::vector<cv::gimpl::GIslandExecutable::InObj>  && input_objs,
-                  std::vector<cv::gimpl::GIslandExecutable::OutObj> && output_objs);
+                  std::vector<cv::gimpl::GIslandExecutable::OutObj> && output_objs,
+                  const bool                                           inference_only);


probably you can omit it from there - as it is not the mandatory parameter.

it can be false internally by default.

This is the place where inference_only from GOVExecutable is propogated to the specific inference kernel (e.g Infer, InferList, etc):
https://github.com/TolyaTalamanov/opencv/blob/at/implement-inference-only-mode-for-ov-backend/modules/gapi/src/backends/ov/govbackend.cpp#L1501-L1502

I guess there is no sense to do something like that:

auto ctx = std::make_shared<OVCallContext>(uu, out, op.args, op.outs, std::move(stub_meta), std::move(input_objs), std::move(output_objs)); if (m_inference_only) { ctx->enableInferenceOnly(); }

It's easier to pass this straight to context ctor.

It should remain in the class but it shouldn't be a mandatory constructor argument

dmatveev · 2023-11-23T15:25:23Z

modules/gapi/src/backends/ov/govbackend.cpp

+    m_inference_only =
+        cv::gapi::getCompileArg<cv::gapi::ov::inference_only>(compileArgs).has_value();


probably you don't want to track all the individual fine-tuning knobs here, so I'd propose a configuration-like structure instead..

Do you mean something like this?

namespace cv { namespace gapi { namespace wip { namespace { ov { struct execution_config { bool enable_benchmark_mode = false; }; } // ov } // wip } // gapi } // cv // Example: auto compile_args = cv::compile_args(cv::gapi::wip::ov::execution_config{ true /* benchmark mode */ };

Not at the API level. Inside.

dmatveev · 2023-11-23T15:26:03Z

modules/gapi/src/backends/ov/govbackend.cpp

@@ -1471,7 +1499,7 @@ void cv::gimpl::ov::GOVExecutable::run(cv::gimpl::GIslandExecutable::IInput  &in
    const auto &op = m_gm.metadata(this_nh).get<Op>();

    auto ctx = std::make_shared<OVCallContext>(uu, out, op.args, op.outs,
-            std::move(stub_meta), std::move(input_objs), std::move(output_objs));
+            std::move(stub_meta), std::move(input_objs), std::move(output_objs), m_inference_only);


..and pass it as a whole thing here.

This method was quite generic, but now it knows explicitly about inference_only mode - purely an abstraction leak.

This method still doesn't know about inference only implementation details it just propagates "some" configuration into the kernel. But your point is correct, need to decide who exactly should be responsible for handling this mode:

There are a few options:

Kernel - This is the current approach. inference_only is propagated as the following:
compileArgs -> GOVExecutable -> OVCallContext -> Kernel (e.g Infer, InferList, etc)

The benefit is that IInferExecutor that used as proxy to submit inference tasks don't know about this option at all, the kernel hides this under set_input_data and read_output_data callbacks so it may continue work in sync/async modes without knowledges about this mode.

RequestPool / IInferExecutor - In this apporach GOVExecutable may configure RequestPool the way it knows about benchmark mode.

Prons:

Kernels don't know about this mode so they continue submit their tasks through RequestPool API.

Cons:

inference_only mode will be enabled for all Kernel's (now it's only relevant for Infer (InferList, InferROI, InferList2 throw exception if mode is enabled).

Mode should be handled for both SyncInferExecutor and AsyncInferExecutor. This is a little bit tricky since currently read_output_data is also responsible for posting outputs to maintain contract with streaming executor.
https://github.com/TolyaTalamanov/opencv/blob/at/implement-inference-only-mode-for-ov-backend/modules/gapi/src/backends/ov/govbackend.cpp#L471

Current approach looked less invasive.

Yes and my point is to make some configuration really a some configuration but not a specific field.

dmatveev

👍 !

TolyaTalamanov · 2023-11-29T14:17:44Z

@asmorkalov Can we merge it, please?

…rence-only-mode-for-ov-backend G-API: Implement inference only mode for OV backend opencv#24584 ### Changes overview Introduced `cv::gapi::wip::ov::benchmark_mode{}` compile argument which if enabled force `OpenVINO` backend to run only inference without populating input and copying back output tensors. This mode is only relevant for measuring the performance of pure inference without data transfers. Similar approach is using on OpenVINO side in `benchmark_app`: https://github.com/openvinotoolkit/openvino/blob/master/samples/cpp/benchmark_app/benchmark_app.hpp#L134-L139 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake

Implement inference only mode for OV backend

27e68d2

dmatveev requested changes Nov 23, 2023

View reviewed changes

Fix comments to review

8e9f224

TolyaTalamanov force-pushed the at/implement-inference-only-mode-for-ov-backend branch from 3d2311d to 8e9f224 Compare November 27, 2023 12:43

TolyaTalamanov requested a review from dmatveev November 28, 2023 10:33

dmatveev approved these changes Nov 29, 2023

View reviewed changes

dmatveev self-assigned this Nov 29, 2023

dmatveev added the category: g-api / gapi label Nov 29, 2023

dmatveev added this to the 4.9.0 milestone Nov 29, 2023

asmorkalov merged commit 79797a3 into opencv:4.x Nov 29, 2023

asmorkalov mentioned this pull request Jan 19, 2024

5.x merge 4.x #24862

Merged

		m_inference_only =
		cv::gapi::getCompileArg<cv::gapi::ov::inference_only>(compileArgs).has_value();

Uh oh!

G-API: Implement inference only mode for OV backend #24584

G-API: Implement inference only mode for OV backend #24584

Uh oh!

Conversation

TolyaTalamanov commented Nov 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes overview

Pull Request Readiness Checklist

Uh oh!

TolyaTalamanov commented Nov 23, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TolyaTalamanov Nov 24, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dmatveev left a comment

Choose a reason for hiding this comment

Uh oh!

TolyaTalamanov commented Nov 29, 2023

Uh oh!

Uh oh!

TolyaTalamanov commented Nov 23, 2023 •

edited

Loading

TolyaTalamanov Nov 24, 2023 •

edited

Loading