[OpenVINO backend] OpenVINOQuantizer #15

daniil-lyakhov · 2025-02-12T14:32:26Z

Summary

OpenVINOQuantizer is introduced
aot_openvino_compiler.py is updated with the quantization pipeline (timm and torchvision backends only)
openvino_executor_runner.cpp is updated to take several inputs / produce several outputs in a row. This allows to validate models converted to the edge (inspired by the qualcom example) (code style is refactored by clang-format)
aot_openvino_compiler.py is updated with a validation pipeline (timm and torchvision backends only)

Test plan

Model	FP32 acc	INT8 acc	FP32 avg latency	INT8 avg latency	Batch size
Resnet50d	0.80534	0.8038	538	178.167	125

Co-authored-by: Alexander Suslov <[email protected]>

ynimmaga · 2025-02-12T22:31:38Z

backends/openvino/quantizer/quantizer.py

+            preset=preset, model_type=model_type, **kwargs
+        )
+
+    def set_ignored_scope(


Is this function be exposed to the user so that they can tune the quantization process? If so, it will be good to mention in the documentation

Agree! But I'm not sure which documentation to update: we don't have a document with describe the OpenVINOQuantizer yet. Should we perhaps create one?

examples/openvino/aot/aot_openvino_compiler.py

examples/openvino/executor_runner/openvino_executor_runner.cpp

Co-authored-by: Yamini Nimmagadda <[email protected]>

alexsu52 and others added 14 commits February 12, 2025 15:08

added init integration of quantization

5d2784d

deit3_small_patch16_224_in21ft1k

61488d5

Resnet-like model checked

42155a1

WIP

7c66314

Formating

c1fa9e2

openvino_executor_runner.cpp can run on several inputs

e2415af

Validate option / minor

8cbb117

Input shape from the input dataset

4b60fb4

--batch_size

e0cd644

Adapt subset size to keep +- 300 pics for calibration

2a04ee6

Apply suggestions from code review

db7dc13

Co-authored-by: Alexander Suslov <[email protected]>

Comments

de3f50b

OpenVINOQuantizer: constructor arguments have been refined

17fe62f

set_ignored_scope | readme updates

c7e0758

ynimmaga reviewed Feb 12, 2025

View reviewed changes

examples/openvino/aot/aot_openvino_compiler.py Outdated Show resolved Hide resolved

ynimmaga reviewed Feb 12, 2025

View reviewed changes

examples/openvino/aot/aot_openvino_compiler.py Outdated Show resolved Hide resolved

cavusmustafa reviewed Feb 13, 2025

View reviewed changes

examples/openvino/aot/aot_openvino_compiler.py Outdated Show resolved Hide resolved

examples/openvino/executor_runner/openvino_executor_runner.cpp Outdated Show resolved Hide resolved

daniil-lyakhov and others added 4 commits February 14, 2025 14:10

openvino_executor_runner.cpp: comments

19cbc69

Apply suggestions from code review

0892b9d

Co-authored-by: Yamini Nimmagadda <[email protected]>

aot_openvino_compiler.py: comments

d1aa425

README

b9b604d

daniil-lyakhov requested review from ynimmaga and cavusmustafa February 14, 2025 17:06

daniil-lyakhov mentioned this pull request Feb 14, 2025

Move files from nncf/common/quantization to nncf/quantization openvinotoolkit/nncf#3280

Draft

cavusmustafa merged commit 9d07bbb into ynimmaga:openvino_backend Feb 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OpenVINO backend] OpenVINOQuantizer #15

[OpenVINO backend] OpenVINOQuantizer #15

daniil-lyakhov commented Feb 12, 2025

ynimmaga Feb 12, 2025

daniil-lyakhov Feb 14, 2025

[OpenVINO backend] OpenVINOQuantizer #15

[OpenVINO backend] OpenVINOQuantizer #15

Conversation

daniil-lyakhov commented Feb 12, 2025

Summary

Test plan

ynimmaga Feb 12, 2025

Choose a reason for hiding this comment

daniil-lyakhov Feb 14, 2025

Choose a reason for hiding this comment