[WebNN EP] Support int64 output data type for CoreML backend #21401

Honry · 2024-07-18T02:33:41Z

Describe the feature request

WebNN CoreML backend doesn't support int64 data type, however some ops from ONNX produce int64 output, e.g. ArgMax, ArgMin, etc., CoreML's AragMax reproduces int32 output.

That means we should check the dimension size being reduced is within int32 range, then do type casting (int32 -> int64) for the output.

The node of such op must be the output of a subgraph model, as its next node is int64 input which is not supported by CoreML backend, and it will fall back, unless it is a special case: ArgMax-Cast (from int64 to int32).

Following actions can be taken into account:

Use WebNN's opSupportLimits() to check if whether int64 data type is supported
Make sure producing int32 output instead is safe
Fuse ops for e.g. ArgMax-Cast(int64->int32)
Convert int32 output tensor back to int64

Besides, how CoreML EP handles int64 data type would be a good reference.

Describe scenario use case

N/A

fdwr · 2024-07-18T05:43:39Z

how CoreML EP handles int64 data type would be a good reference

Indeed, I really wonder given all indices are int64 in ONNX.

skottmckay · 2024-08-27T10:21:46Z

CoreML EP converts all int64 attribute and initializer values to int32 when creating the CoreML model (and checks for overflow errors as it does it).

it also tracks if it needs to convert specific inputs/outputs between int64 and int32 when executing the CoreML model.

Once you have the attributes, initializers, and coreml model inputs as int32 the internals of the coreml model will produce int32 values, and we just need to convert the output from the coreml model back to int64 if applicable.

Honry · 2024-08-30T08:53:01Z

Thank you @skottmckay, that's really helpful!

For initializer conversion, the code is here, right? Looks like most of them are written into local weight files, then the conversion from int64 to int32 was handled by CoreML, right?
For input conversion (int64->int32), the code is here, right? How does it handle the data overflow for int64 inputs? For WebNN EP we need to use cast op to convert the int64 inputs to int32, it is not safe if there's data overflow exists.

Honry added the feature request request for unsupported feature or enhancement label Jul 18, 2024

github-actions bot added the platform:mobile issues related to ONNX Runtime mobile; typically submitted using template label Jul 18, 2024

Honry mentioned this issue Jul 18, 2024

[WebNN EP] Add outputDataType option for the ArgMax/ArgMin ops #21385

Merged

fdwr added the ep:WebNN WebNN execution provider label Jul 18, 2024

sophies927 added the ep:CoreML issues related to CoreML execution provider label Jul 18, 2024

philloooo mentioned this issue Aug 15, 2024

Stable diffusion model fails on Mac microsoft/webnn-developer-preview#22

Open

fdwr mentioned this issue Sep 5, 2024

Bugfix: Add missing 64-bit integers support for some reduction operators webmachinelearning/webnn#695

Draft

guschmue removed the ep:CoreML issues related to CoreML execution provider label Dec 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WebNN EP] Support int64 output data type for CoreML backend #21401

[WebNN EP] Support int64 output data type for CoreML backend #21401

Honry commented Jul 18, 2024 •

edited by fdwr

Loading

fdwr commented Jul 18, 2024

skottmckay commented Aug 27, 2024

Honry commented Aug 30, 2024

[WebNN EP] Support int64 output data type for CoreML backend #21401

[WebNN EP] Support int64 output data type for CoreML backend #21401

Comments

Honry commented Jul 18, 2024 • edited by fdwr Loading

Describe the feature request

Describe scenario use case

fdwr commented Jul 18, 2024

skottmckay commented Aug 27, 2024

Honry commented Aug 30, 2024

Honry commented Jul 18, 2024 •

edited by fdwr

Loading