Added new flag for GPU peer access API control #7261

indrajit96 · 2024-05-23T02:48:02Z

Added a new flag "--enable-peer-access" at Triton startup to control creation of CUDA context at server startup.
Jira : https://jirasw.nvidia.com/browse/DLIS-6705
Core : triton-inference-server/core#361

nnshah1 · 2024-05-31T00:15:30Z

src/command_line_parser.cc

@@ -373,7 +373,8 @@ enum TritonOptionId {
  OPTION_BACKEND_CONFIG,
  OPTION_HOST_POLICY,
  OPTION_MODEL_LOAD_GPU_LIMIT,
-  OPTION_MODEL_NAMESPACING
+  OPTION_MODEL_NAMESPACING,
+  OPTION_PEER_ACCESS


question - do we gate options based on enable gpu compile flag?

@Tabrizian

Looks like we don't gate for GPU right now, although we do it for other build options (e.g. tracing, http, etc).

…-server/server into ibhosale_pinned_mem_fix

src/command_line_parser.cc

Tabrizian · 2024-05-31T23:13:59Z

qa/L0_trace/test.sh

@@ -777,6 +777,7 @@ SERVER_ARGS="--allow-sagemaker=true --model-control-mode=explicit \
                --load-model=simple --load-model=ensemble_add_sub_int32_int32_int32 \
                --load-model=repeat_int32 \
                --load-model=input_all_required \
+                --load-model=dynamic_batch \


Were we missing this model before? Is it correctly added?

Yes this is a breakage cause by one of my changes in L0_trace.
Tested the pipeline for this change fixes currently failing L0_trace

src/command_line_parser.cc

Co-authored-by: Iman Tabrizian <[email protected]>

Added new flag for GPU peer access API control

bbdf358

indrajit96 mentioned this pull request May 23, 2024

Added new flag for GPU peer access API control triton-inference-server/core#361

Merged

indrajit96 added 2 commits May 28, 2024 17:45

Add TC for --enable-peer-access falg

736c943

Change test to check for 0 memory

26b9182

nnshah1 reviewed May 31, 2024

View reviewed changes

indrajit96 added 4 commits May 30, 2024 17:16

Merge branch 'main' into ibhosale_pinned_mem_fix

a6045e4

Typo Reversed

3d31cae

Merge branch 'ibhosale_pinned_mem_fix' of github.com:triton-inference…

52689ee

…-server/server into ibhosale_pinned_mem_fix

Piggyback L0_trace nightly fix

5283299

Tabrizian reviewed May 31, 2024

View reviewed changes

src/command_line_parser.cc Outdated Show resolved Hide resolved

Update naming of flag

80f8004

Tabrizian reviewed May 31, 2024

View reviewed changes

indrajit96 and others added 2 commits May 31, 2024 16:32

Update src/command_line_parser.cc

5a42112

Co-authored-by: Iman Tabrizian <[email protected]>

Pre-commit fix

e607af5

Tabrizian previously approved these changes May 31, 2024

View reviewed changes

Function name mismatch between core and server fixed

bff2c48

indrajit96 dismissed Tabrizian’s stale review via bff2c48 June 1, 2024 00:07

Tabrizian approved these changes Jun 1, 2024

View reviewed changes

indrajit96 merged commit 99a3f44 into main Jun 3, 2024
3 checks passed

indrajit96 deleted the ibhosale_pinned_mem_fix branch June 3, 2024 16:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added new flag for GPU peer access API control #7261

Added new flag for GPU peer access API control #7261

indrajit96 commented May 23, 2024 •

edited

Loading

nnshah1 May 31, 2024

Tabrizian May 31, 2024 •

edited

Loading

Tabrizian May 31, 2024

indrajit96 May 31, 2024

Added new flag for GPU peer access API control #7261

Added new flag for GPU peer access API control #7261

Conversation

indrajit96 commented May 23, 2024 • edited Loading

nnshah1 May 31, 2024

Choose a reason for hiding this comment

Tabrizian May 31, 2024 • edited Loading

Choose a reason for hiding this comment

Tabrizian May 31, 2024

Choose a reason for hiding this comment

indrajit96 May 31, 2024

Choose a reason for hiding this comment

indrajit96 commented May 23, 2024 •

edited

Loading

Tabrizian May 31, 2024 •

edited

Loading