Simplify UnpackInitializerData API #8736

guoyu-wang · 2021-08-13T21:26:37Z

Description: Simplify UnpackInitializerData API

Motivation and Context

There are crashes reported due to unaligned memory access on arm, such as Fix arm crash due to unaligned raw_data buffer of initializer tensors onnx/onnx#3626, and Add timeseries imputer transformer featurizer kernel #2813, there are many places in our mobile EP code still access the initializer data directly, which may cause issue later
Plan is to move all the EP code accessing initializer data directly to use UnpackInitializerData
This change is to simplify the API of UnpackInitializerData, to use std:vector instead of a pair of std::unique_ptr for data and a size_t for size, also add one extra UnpackInitializerData to work on internal initializer only (return error when it is used on tensor with external data)
- The only disadvantage of using vector is that the memory need to be initialized, which may have slight perf hit while creating session
There will be on extra change to update all the EP code, one typical update is at onnxruntime/core/providers/shared/utils/utils.cc

edgchen1 · 2021-08-13T23:33:23Z

onnxruntime/core/framework/tensorprotoutils.cc

@@ -155,23 +155,23 @@ static Status GetExternalDataInfo(const ONNX_NAMESPACE::TensorProto& tensor_prot
 // This function does not unpack string_data of an initializer tensor
 static Status ReadExternalDataForTensor(const ONNX_NAMESPACE::TensorProto& tensor_proto,
                                        const ORTCHAR_T* tensor_proto_dir,
-                                        std::unique_ptr<unsigned char[]>& unpacked_tensor,
-                                        SafeInt<size_t>& tensor_byte_size) {
+                                        std::vector<uint8_t>& unpacked_tensor) {


is it possible to use std::byte?

Unfortunately, seems CUDA does not have std::byte support yet, revert back to use uint8+t

edgchen1 · 2021-08-13T23:49:48Z

onnxruntime/core/providers/shared/utils/utils.cc

+        LOGS(logger, ERROR) << "Error while unpack min tensor: " << status.ErrorMessage();
+        return false;
+      }
+      min = reinterpret_cast<float*>(unpacked_tensor.data())[0];


is this vector<uint8_t>::data guaranteed to be suitably aligned for floats?

In theory the data in the vector will be aligned to std::max_align_t, which is usually 16 or maybe 8 on some 32bit system, this makes it enough for all the scalar types we support for now.
To make it more robust, (so far I don't think we have 16bit data type yet, but just in case it will be added in the future), we can change the unpacked_tensor.resize(tensor_byte_size); to something like
unpacked_tensor = std::vector<std::byte>(tensor_byte_size, custom_alligned_allocator);, can look into this later

thanks - sounds like it should be fine for now then

This reverts commit 1ffa284.

This reverts commit 764a656.

…nitializer_vector

yuslepukhin · 2021-08-17T00:17:43Z

onnxruntime/core/providers/nnapi/nnapi_builtin/builders/helper.cc

  // Onnx quantization uses uint8 [int8 not yet supported], need to cast to int32_t used by NNAPI
-  zero_point = static_cast<int32_t>(unpacked_tensor.get()[0]);
+  zero_point = static_cast<int32_t>(unpacked_tensor[0]);


unpacked_tensor[0]

Need to check that the data is not empty. Perhaps, this was the reason for crashes? #Resolved

Yes, we should check the length of the buffer, but this is not the reason for the crashes

yuslepukhin · 2021-08-17T00:18:25Z

onnxruntime/core/framework/tensorprotoutils.cc

  ORT_RETURN_IF_ERROR(GetExternalDataInfo(
      tensor_proto,
      tensor_proto_dir,
      external_file_path,
      file_offset,
      tensor_byte_size));

-  unpacked_tensor.reset(new unsigned char[*&tensor_byte_size]);
+  unpacked_tensor.resize(tensor_byte_size);
  ORT_RETURN_IF_ERROR(onnxruntime::Env::Default().ReadFileIntoBuffer(


ORT_RETURN_IF_ERROR(onnxruntime::Env::Default().ReadFileIntoBuffer(

Should we check for zero len? #Resolved

Seems zero len is fine here, same in posix version

onnxruntime/onnxruntime/core/platform/windows/env.cc

Lines 255 to 256 in 2243804

if (length == 0)

return Status::OK();

yuslepukhin

guoyu-wang added 4 commits August 12, 2021 21:01

Move UnpackInitializerData to use vector

39596db

minor update

11c35d2

minor update

e8d4db2

Update getclipminmax

ec38b9f

guoyu-wang requested review from skottmckay, edgchen1 and YUNQIUGUO August 13, 2021 21:26

guoyu-wang requested a review from a team as a code owner August 13, 2021 21:26

edgchen1 reviewed Aug 13, 2021

View reviewed changes

guoyu-wang added 6 commits August 13, 2021 17:37

Change uint8_t -> std::byte

764a656

fix build break

1ffa284

Revert "fix build break"

66cee2d

This reverts commit 1ffa284.

Revert "Change uint8_t -> std::byte"

756ea09

This reverts commit 764a656.

Add todo notes for extra vector alignment

2b0769d

Merge remote-tracking branch 'origin/master' into gwang-msft/unpack_i…

06a27cc

…nitializer_vector

yuslepukhin reviewed Aug 17, 2021

View reviewed changes

add check result size

210cd1d

yuslepukhin approved these changes Aug 18, 2021

View reviewed changes

guoyu-wang merged commit 3406b7b into master Aug 18, 2021

guoyu-wang deleted the gwang-msft/unpack_initializer_vector branch August 18, 2021 01:11

guoyu-wang mentioned this pull request Aug 19, 2021

[CoreML/NNAPI EPs] Move direct use of initializer data to unpacked tensor data #8780

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify UnpackInitializerData API #8736

Simplify UnpackInitializerData API #8736

guoyu-wang commented Aug 13, 2021 •

edited

Loading

edgchen1 Aug 13, 2021

guoyu-wang Aug 16, 2021

edgchen1 Aug 13, 2021

guoyu-wang Aug 14, 2021 •

edited

Loading

edgchen1 Aug 14, 2021

yuslepukhin Aug 17, 2021 •

edited

Loading

guoyu-wang Aug 17, 2021

yuslepukhin Aug 17, 2021 •

edited

Loading

guoyu-wang Aug 17, 2021

yuslepukhin left a comment

Simplify UnpackInitializerData API #8736

Simplify UnpackInitializerData API #8736

Conversation

guoyu-wang commented Aug 13, 2021 • edited Loading

edgchen1 Aug 13, 2021

Choose a reason for hiding this comment

guoyu-wang Aug 16, 2021

Choose a reason for hiding this comment

edgchen1 Aug 13, 2021

Choose a reason for hiding this comment

guoyu-wang Aug 14, 2021 • edited Loading

Choose a reason for hiding this comment

edgchen1 Aug 14, 2021

Choose a reason for hiding this comment

yuslepukhin Aug 17, 2021 • edited Loading

Choose a reason for hiding this comment

guoyu-wang Aug 17, 2021

Choose a reason for hiding this comment

yuslepukhin Aug 17, 2021 • edited Loading

Choose a reason for hiding this comment

guoyu-wang Aug 17, 2021

Choose a reason for hiding this comment

yuslepukhin left a comment

Choose a reason for hiding this comment

guoyu-wang commented Aug 13, 2021 •

edited

Loading

guoyu-wang Aug 14, 2021 •

edited

Loading

yuslepukhin Aug 17, 2021 •

edited

Loading

yuslepukhin Aug 17, 2021 •

edited

Loading