Changes from upstream/master #2

bbielawx · 2024-04-17T08:19:46Z

No description provided.

### Details: - Fix for #27903 - Update NF4 precision conversions in NPU plugin - *...*

### Details: - *static strided_sliced crop to use tensor* ### Tickets: - *160254*

### Details: - Added common base class for plugin configuration and migration in GPU plugin. - New config defines options as typed members of config class which shall provide more efficient access to the value than AnyMap - Property setting is possible before finalization only which shall localize the point when config become immutable. - All config options may be read from environment as well as from json config file when OV is built w/ debug caps. - Debug and release options are defined and handled similarly now --------- Signed-off-by: Vladimir Paramuzov <[email protected]>

### Details: - ISTFT-16 specification ### Tickets: - 159378 --------- Co-authored-by: Mateusz Mikolajczyk <[email protected]>

### Details: - *Implement reference conversion from fp16 to f8e4m3, and apply 2-step conversion, i.e., apply fp32->fp16, then fp16->fp8 for conversions from fp32 to fp8.* ### Tickets: - *[CVS-160375](https://jira.devtools.intel.com/browse/CVS-160375)*

Details: Microsoft Contrib Operator "FastGelu" for ONNX RT Tickets: N/A --------- Co-authored-by: Georgy Krivoruchko <[email protected]>

This PR is doing some optimization work on onnxfrontend com.microsoft.MatMulNbits operators with this changes: 1. it disabled const folding with use 75GB for phi3 INT4 model and 200+GB for llama3 INT4 model. 2. it trigger oneDNN matmul primitives, much benefits the GPU performance we tested this changes along with another PR #28163 , and confirmed phi3/llama3 INT4 model run well in LNL. --------- Co-authored-by: Yu, Zijun <[email protected]>

Details: Microsoft Contrib Operator "QLinearAveragePool" for ONNX RT Tickets: N/A --------- Co-authored-by: Georgy Krivoruchko <[email protected]>

### Details: - The original PR (#28788) has been mistakenly force-merged due to a mistake in merge queue settings. It was later reverted, so this is the "new" Ref PR. - Add reference implementation - Add tests ### Related PRs: - #28103 - #28698 - #28979 - #28999 ### Tickets: - CVS-158917 --------- Signed-off-by: p-wysocki <[email protected]> Co-authored-by: Roman Kazantsev <[email protected]> Co-authored-by: Pawel Raasz <[email protected]> Co-authored-by: Katarzyna Mitrus <[email protected]>

### Details: - Update onednn for gpu with latest 3.7 branch which includes ARL-H perf improvement ### Tickets: - *ticket-id*

### Details: - *Some customers can set policy ProcessDynamicCodePolicy to prohibit dynamic code. So need to disable JIT in such cases.* ### Tickets: - *162440*

### Details: - we have some nodes which is target to be optimized during runtime, the memory status can be corrupted among iterations for these nodes ### Tickets: - CVS-161830 --------- Signed-off-by: fishbell <[email protected]>

…#29058) Make an additional conversion from std::string to std::wstring at uv::util::to_file_path function when OPENVINO_ENABLE_UNICODE_PATH_SUPPORT was set Tickets: [162461](https://jira.devtools.intel.com/browse/CVS-162461) [146659](https://jira.devtools.intel.com/browse/CVS-146659)

Signed-off-by: Alina Kladieva <[email protected]>

Required for internal tests using requirement files packed here Signed-off-by: Alina Kladieva <[email protected]>

### Description: As CPP17 is the default standard of OpenVINO, there is no need to use the self-implemented optional struct. Note: discussed in #28942. ### Related PRs: - #28942 Co-authored-by: Michal Lukaszewski <[email protected]>

+ Add reorder to change input format from blocked to plain for dGPU to select onednn kernel for gemm. ### Tickets: - *162459*

Implicit casting error when different input and output data type. Use in_type instead of out_type during FUSED_OPS_VEC. ### Tickets: - *160534* --------- Signed-off-by: hyunback <[email protected]>

### Details: - *Use the new ZE_MUTABLE_COMMAND_EXP_FLAG_GRAPH_ARGUMENTS flag -> if the flag is still not supported by the driver, fallback to the deprecated one* - *Create fences or events only if they are used* - *Check Major version correctly* - *Make sure the Command queue is kept alive until the Fence is destroyed* - *Removing unused parameters from compiler adapter* --------- Signed-off-by: Bogdan Pereanu <[email protected]>

By utilizing the fact, that the base edge is always the first edge in the cluster. This helps to avoid unnecessary looping when updating the referencing edges. Also allocated dynamic output edges before the regions are created, so there is no need to create those regions and then removed them anyway.

Reverts #29031 because it introduced sporadic failures (e.g.: https://github.com/openvinotoolkit/openvino/actions/runs/13432909164/job/37528579365?pr=29086) The licenses whitelist should not contain AND expressions (see actions/dependency-review-action#842)

### Details: After switching from ModelOptimizer to OVC, the order of applying PrePostProcessing and MOCTransformations has changed: MO path : [fw model conversion -> PrePostProcessing -> MOC] -> nncf OVC path: [fw model conversion -> MOC] -> PrePostProcessing -> nncf Since nncf is applied with a not fully optimized model, extra FQ ops might appear, which can affect both accuracy and performance. e.g. Mul -> Conv fusion is not applied due to extra FQ <img width="165" alt="{C6E93F2C-2CE3-4596-8D7F-ED7BD8013603}" src="https://github.com/user-attachments/assets/3cbe6e07-9c07-4002-8b4c-9fb5bc662421" /> PrePostProcessing is not part of OVC, so we have to insert additional Transformation calls inside PrePostProcessing. ### Tickets: - *CVS-160786* - CVS-161724 --------- Co-authored-by: Andrei Kochin <[email protected]> Co-authored-by: Andrii Staikov <[email protected]>

### Details: - Mark `ov::element::undefined` as deprecated - Use dynamic type instead of undefined ### Related PRs: - openvinotoolkit/openvino_contrib#945 - #28524 ### Tickets: - CVS-161637 --------- Signed-off-by: Pawel Raasz <[email protected]> Signed-off-by: Raasz, Pawel <[email protected]>

) ### Details: - Add utils to core to set names on model input/output tensors - Define default tensor names and port separator - Use auto naming in model created by python API. Model outputs will have got names also. - The auto naming can be part of Model API but requires to add validation tools to find and resolve names collision - Auto naming cannot be added to Model creation in C++ as it can have impact on (de)serialization, requires more investigation ### Tickets: - CVS-159997 --------- Signed-off-by: Raasz, Pawel <[email protected]>

### Details: - *item1* - *...* ### Tickets: - *ticket-id* Co-authored-by: Mikhail Ryzhov <[email protected]>

### Details: - *Reverted `reset_state()` change (no changes regarding `reset_state()` is presented here regardless of branch name)* - *Added initialization of tensors in LLMInferRequest* - *Fixed copy of attention_mask* ### Tickets: - *ticket-id*

### Details: - *Return default NonZero type where possible. This allows to merge two NonZero nodes.* ### Tickets: - *CVS-160578* Signed-off-by: Maxim Vafin <[email protected]>

### Details: - *Extend mean_iou metric in single-image-test for semantic segmentation networks that have ArgMax at the end.* ### Tickets: - *EISW-158075*

### Details: - *Fix turbo mode* ### Tickets: - *E#156694* Signed-off-by: Bogdan Pereanu <[email protected]>

bbielawx force-pushed the master branch from 9311891 to e3fbb57 Compare April 17, 2024 08:26

github-actions bot added category: transformations category: samples category: MO category: ONNX FE category: AUTO category: tools labels Apr 17, 2024

github-actions bot added category: AUTO BATCH category: HETERO labels Apr 25, 2024

StefaniaHergane and others added 30 commits February 18, 2025 07:59

[NPU] Update NF4 precision conversions in NPU plugin (#29026)

b3a32cb

### Details: - Fix for #27903 - Update NF4 precision conversions in NPU plugin - *...*

[GPU] static strided_sliced crop to use tensor (#28507)

398842a

### Details: - *static strided_sliced crop to use tensor* ### Tickets: - *160254*

[Op][Spec] ISTFT-16 specification (#28807)

ad8e793

### Details: - ISTFT-16 specification ### Tickets: - 159378 --------- Co-authored-by: Mateusz Mikolajczyk <[email protected]>

[ONNX] Added FastGelu from com.microsoft domain (#28715)

047976e

Details: Microsoft Contrib Operator "FastGelu" for ONNX RT Tickets: N/A --------- Co-authored-by: Georgy Krivoruchko <[email protected]>

[ONNX] Added QLinearAveragePool from com.microsoft domain (#28799)

e58f38f

Details: Microsoft Contrib Operator "QLinearAveragePool" for ONNX RT Tickets: N/A --------- Co-authored-by: Georgy Krivoruchko <[email protected]>

[GPU] update onednn with the latest 3.7 branch (#28986)

c3ae4b0

### Details: - Update onednn for gpu with latest 3.7 branch which includes ARL-H perf improvement ### Tickets: - *ticket-id*

[CORE][CACHE][HASH] DYNAMIC_CODE_POLICY check on Windows (#29006)

888cc14

### Details: - *Some customers can set policy ProcessDynamicCodePolicy to prohibit dynamic code. So need to disable JIT in such cases.* ### Tickets: - *162440*

[clean-up ] Fix code style (#29064)

6d19e8b

Add 2024.5.1 to conflicting versions (#29055)

6396dc9

Signed-off-by: Alina Kladieva <[email protected]>

[GHA] Save Manylinux openvino_tests archive on share (#29033)

77707fb

Required for internal tests using requirement files packed here Signed-off-by: Alina Kladieva <[email protected]>

[core] Remove custom ov::optional (#28949)

d3cdfe8

### Description: As CPP17 is the default standard of OpenVINO, there is no need to use the self-implemented optional struct. Note: discussed in #28942. ### Related PRs: - #28942 Co-authored-by: Michal Lukaszewski <[email protected]>

[GPU] Add input reorder for onednn gemm (#29059)

45837a4

+ Add reorder to change input format from blocked to plain for dGPU to select onednn kernel for gemm. ### Tickets: - *162459*

[GPU] Fix resmlp-12-distilled-224 failure. (#28861)

b0633ff

Implicit casting error when different input and output data type. Use in_type instead of out_type during FUSED_OPS_VEC. ### Tickets: - *160534* --------- Signed-off-by: hyunback <[email protected]>

DOCS Port for Update packages to master (#29079)

1643337

### Details: - *item1* - *...* ### Tickets: - *ticket-id* Co-authored-by: Mikhail Ryzhov <[email protected]>

[PT FE] Return default NonZero type where possible (#29072)

61ab2ef

### Details: - *Return default NonZero type where possible. This allows to merge two NonZero nodes.* ### Tickets: - *CVS-160578* Signed-off-by: Maxim Vafin <[email protected]>

[NPU][SIT] Extand mean_iou metric for networks with ArgMax (#29085)

2ac4fb6

### Details: - *Extend mean_iou metric in single-image-test for semantic segmentation networks that have ArgMax at the end.* ### Tickets: - *EISW-158075*

[NPU] Fix turbo mode (#29090)

285a73a

### Details: - *Fix turbo mode* ### Tickets: - *E#156694* Signed-off-by: Bogdan Pereanu <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changes from upstream/master #2

Changes from upstream/master #2

bbielawx commented Apr 17, 2024

Changes from upstream/master #2

Are you sure you want to change the base?

Changes from upstream/master #2

Conversation

bbielawx commented Apr 17, 2024