Fix location planning for initializers used only in nested subgraphs #8642

hariharans29 · 2021-08-06T18:38:30Z

Description: The current logic for planning the location for all initializers at a graph level only involves walking the nodes at that graph level. This is incomplete as the initializers could actually be used in any of the nested subgraphs relative to that graph level. We need to walk all the nested subgraphs to have the necessary information needed to decide the correct location that the initializers needs to reside on.

The negative impact of not doing this is that if the initializers were only used in one of the nested subgraphs (not used in the graph level they are introduced in) and used in a provider node (e.g. CUDA), with the current logic keeping these initializers on CPU, the controlflow nodes' implementation takes them from CPU to the device at runtime (and the impact is much worse for Loops as these copies are done on every iteration).

This change adds logic to walk all the nested subgraphs while processing initializers at a given graph level and take them statically to the right device thus saving all the runtime copies.

Motivation and Context
~50% gains for a 1P model. Should also benefit other models having a similar structure.

…lanForInitializers

onnxruntime/core/framework/session_state.cc

onnxruntime/core/framework/allocation_planner.cc

…lanForInitializers

onnxruntime/core/framework/session_state.h

onnxruntime/core/framework/allocation_planner.cc

pranavsharma · 2021-08-27T21:35:15Z

onnxruntime/core/framework/allocation_planner.cc

+          continue;
+        }
+
+        ORT_TRY {


Can we surround GeneratePlanForWeightsHelper in a try/catch at the point of invocation inside GeneratePlanForWeights instead of doing it in every recursive call?
Secondly, not clear why do we even need a try/catch.

Agree. The top level exception handler will catch an exception and return a status so unless you're going to do something more than that there's no point catching anything here.

Surrounded the call to GeneratePlanForWeightsHelper() in a try/catch. Is there another top level exception catcher that would make even this unnecessary ?

All of InferenceSesion::Initialize is in a try/catch

removed the superfluous try/catch

hariharans29 · 2021-08-31T03:13:49Z

onnxruntime/test/framework/allocation_planner_test.cc

+  OrtValueIndex init_data_index;
+  main_graph_ort_value_index_map.GetIdx("init_data", init_data_index);
+
+  EXPECT_EQ(main_graph_plan->allocation_plan[init_data_index].location.device.Type(), OrtDevice::GPU);


Previously this would have been "planned" to CPU

skottmckay

hariharans29 added 3 commits August 6, 2021 09:52

Initial commit

4daa167

Merge remote-tracking branch 'origin/master' into hari/fixAllocationP…

8a2ecd0

…lanForInitializers

updates

3940b6b

hariharans29 requested a review from a team as a code owner August 6, 2021 18:38

hariharans29 added 2 commits August 6, 2021 12:17

Fixes

24ec939

Fixes

f72bd50

hariharans29 changed the title ~~Fix location planning for initializers~~ Fix location planning for initializers used in nested subgraphs Aug 11, 2021

hariharans29 added 2 commits August 24, 2021 17:24

Merge branch 'master' into hari/fixAllocationPlanForInitializers

e9f7c86

Fix build

6833b19

skottmckay reviewed Aug 25, 2021

View reviewed changes

onnxruntime/core/framework/session_state.cc Outdated Show resolved Hide resolved

skottmckay reviewed Aug 25, 2021

View reviewed changes

onnxruntime/core/framework/allocation_planner.cc Outdated Show resolved Hide resolved

skottmckay reviewed Aug 25, 2021

View reviewed changes

onnxruntime/core/framework/allocation_planner.cc Outdated Show resolved Hide resolved

Merge remote-tracking branch 'origin/master' into hari/fixAllocationP…

d3ffd95

…lanForInitializers

hariharans29 changed the title ~~Fix location planning for initializers used in nested subgraphs~~ Fix location planning for initializers used only in nested subgraphs Aug 27, 2021

hariharans29 added 2 commits August 26, 2021 20:41

Tes and PR review comments

3e97333

Add comment

a1ebe82

pranavsharma reviewed Aug 27, 2021

View reviewed changes

hariharans29 commented Aug 31, 2021

View reviewed changes

hariharans29 added 3 commits August 30, 2021 22:13

PR comments

3c3d4e9

PR updates

15045db

Increase binary size threshold for minimal build

1a92beb

hariharans29 mentioned this pull request Aug 31, 2021

GPT-Neo: Torch CUDA 2x faster than ONNX CUDA #7238

Open

pranavsharma previously approved these changes Aug 31, 2021

View reviewed changes

PR comment

83d97f8

hariharans29 dismissed pranavsharma’s stale review via 83d97f8 September 1, 2021 02:23

skottmckay approved these changes Sep 1, 2021

View reviewed changes

pranavsharma approved these changes Sep 1, 2021

View reviewed changes

hariharans29 merged commit acd9db7 into master Sep 1, 2021

hariharans29 deleted the hari/fixAllocationPlanForInitializers branch September 1, 2021 07:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix location planning for initializers used only in nested subgraphs #8642

Fix location planning for initializers used only in nested subgraphs #8642

hariharans29 commented Aug 6, 2021 •

edited

Loading

pranavsharma Aug 27, 2021

skottmckay Aug 31, 2021

hariharans29 Aug 31, 2021 •

edited

Loading

skottmckay Sep 1, 2021

hariharans29 Sep 1, 2021

hariharans29 Aug 31, 2021

skottmckay left a comment

Fix location planning for initializers used only in nested subgraphs #8642

Fix location planning for initializers used only in nested subgraphs #8642

Conversation

hariharans29 commented Aug 6, 2021 • edited Loading

pranavsharma Aug 27, 2021

Choose a reason for hiding this comment

skottmckay Aug 31, 2021

Choose a reason for hiding this comment

hariharans29 Aug 31, 2021 • edited Loading

Choose a reason for hiding this comment

skottmckay Sep 1, 2021

Choose a reason for hiding this comment

hariharans29 Sep 1, 2021

Choose a reason for hiding this comment

hariharans29 Aug 31, 2021

Choose a reason for hiding this comment

skottmckay left a comment

Choose a reason for hiding this comment

hariharans29 commented Aug 6, 2021 •

edited

Loading

hariharans29 Aug 31, 2021 •

edited

Loading