feat: TorchServe support #250

njhill · 2022-09-22T20:47:23Z

Motivation

The Triton runtime can be used with model-mesh to serve PyTorch torchscript models, but it does not support arbitrary PyTorch models i.e. eager mode. KServe "classic" has integration with TorchServe but it would be good to have integration with model-mesh too so that these kinds of models can be used in distributed multi-model serving contexts.

Modifications

The bulk of the required changes are to the adapter image, covered by PR kserve/modelmesh-runtime-adapter#34.

This PR contains the minimal controller changes needed to enable the support:

TorchServe ServingRuntime spec
Add "torchserve" to the list of supported built-in runtime types
Add "ID extraction" entry for TorchServe's gRPC Predictions RPC so that model-mesh will automatically extract the model name from corresponding request messages

Note the supported model format is advertised as "pytorch-mar" to distinguish from the existing "pytorch" format that refers to raw TorchScript .pt files as supported by Triton.

Result

TorchServe can be used seamlessly with ModelMesh Serving to serve PyTorch models, including eager mode.

Resolves #63

Motivation The Triton runtime can be used with model-mesh to serve PyTorch torchscript models, but it does not support arbitrary PyTorch models i.e. eager mode. KServe "classic" has integration with TorchServe but it would be good to have integration with model-mesh too so that these kinds of models can be used in distributed multi-model serving contexts. Modifications The bulk of the required changes are to the adapter image, covered by PR kserve/modelmesh-runtime-adapter#34. This PR contains the minimal controller changes needed to enable the support: - TorchServe ServingRuntime spec - Add "torchserve" to the list of supported built-in runtime types - Add "ID extraction" entry for TorchServe's gRPC Predictions RPC so that model-mesh will automatically extract the model name from corresponding request messages Note the supported model format is advertised as "pytorch-mar" to distinguish from the existing "pytorch" format that refers to raw TorchScript .pt files as supported by Triton. Result TorchServe can be used seamlessly with ModelMesh Serving to serve PyTorch models, including eager mode. Resolves #63 Signed-off-by: Nick Hill <[email protected]>

kserve-oss-bot · 2022-09-22T20:47:26Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: njhill

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [njhill]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

njhill · 2022-09-22T20:47:48Z

This PR should be merged after kserve/modelmesh-runtime-adapter#34.

njhill · 2022-09-23T16:06:42Z

One follow-on task here would be to add a torchserve-based test to the FVTs.

Signed-off-by: Nick Hill <[email protected]>

kbumsik · 2022-11-06T22:56:30Z

Hi, thank you so much for your work on pushing TorchServe to ModelMesh. I am really interested in this feature :)

Do you have any plan to continue working on this? Though it seems that there isn't much work left to do since kserve/modelmesh-runtime-adapter#34 is merged. If you need any help I would like to do some work too.

njhill · 2022-11-08T00:10:17Z

Thanks @kbumsik. This should now be finished and fully functional. I have tested it manually. The reason for the delay merging the PR is that I wanted to also include an extension to our functional verification tests to exercise the torchserve integration, but am pretty busy with other things so not sure how soon I'll get a chance.

But I'm ok with getting this merged and labeling the torchserve support as "beta" the meantime. I'll aim to get that done in the next couple of days.

kbumsik · 2022-11-08T01:11:31Z

Thank you for your quick and kind response. I will start testing by my own then 👍

njhill · 2022-11-12T00:23:03Z

I've opened a new issue to cover the FVT additions: #280

rafvasq

Tested it locally and looks good @njhill!

rafvasq · 2022-11-15T17:13:20Z

/lgtm

njhill · 2022-11-15T17:22:37Z

FVI @kbumsik this has now been merged.

#### Motivation Support for TorchServe was added in #250 and kserve/modelmesh-runtime-adapter#34. A test should be added for it as well. #### Modifications - Adds basic FVT for load/inference with a TorchServe MAR model using the native TorchServe gRPC API - Disables OVMS runtime and tests to allow TorchServe to be tested due to resource constraints #### Result Closes #280 Signed-off-by: Rafael Vasquez <[email protected]>

kserve-oss-bot added the do-not-merge/work-in-progress label Sep 22, 2022

kserve-oss-bot requested review from ckadner and Tomcli September 22, 2022 20:47

kserve-oss-bot added the approved label Sep 22, 2022

Update README.md with torchserve info and add build status badge

f82d8de

Signed-off-by: Nick Hill <[email protected]>

njhill force-pushed the torchserve branch 2 times, most recently from 65ba358 to a89172a Compare September 26, 2022 17:26

njhill added 2 commits September 27, 2022 08:23

Updates to tests

d039034

Signed-off-by: Nick Hill <[email protected]>

Disable torchserve ServingRuntime for now in CI FVTs

848975e

Signed-off-by: Nick Hill <[email protected]>

njhill force-pushed the torchserve branch from 0f77c2c to 848975e Compare September 27, 2022 15:23

njhill marked this pull request as ready for review November 8, 2022 00:10

kserve-oss-bot removed the do-not-merge/work-in-progress label Nov 8, 2022

njhill mentioned this pull request Nov 12, 2022

Add FVT for TorchServe runtime #280

Closed

njhill requested a review from rafvasq November 12, 2022 02:15

rafvasq approved these changes Nov 15, 2022

View reviewed changes

njhill merged commit bd16e5b into main Nov 15, 2022

njhill deleted the torchserve branch November 15, 2022 17:22

rafvasq mentioned this pull request Dec 22, 2022

test: Add TorchServe FVT #294

Merged

ckadner mentioned this pull request Jan 13, 2023

Update ModelMesh version to v0.10.0 kserve/kserve#2645

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: TorchServe support #250

feat: TorchServe support #250

njhill commented Sep 22, 2022

kserve-oss-bot commented Sep 22, 2022

njhill commented Sep 22, 2022

njhill commented Sep 23, 2022

kbumsik commented Nov 6, 2022

njhill commented Nov 8, 2022

kbumsik commented Nov 8, 2022

njhill commented Nov 12, 2022

rafvasq left a comment

rafvasq commented Nov 15, 2022

njhill commented Nov 15, 2022

feat: TorchServe support #250

feat: TorchServe support #250

Conversation

njhill commented Sep 22, 2022

Motivation

Modifications

Result

kserve-oss-bot commented Sep 22, 2022

njhill commented Sep 22, 2022

njhill commented Sep 23, 2022

kbumsik commented Nov 6, 2022

njhill commented Nov 8, 2022

kbumsik commented Nov 8, 2022

njhill commented Nov 12, 2022

rafvasq left a comment

Choose a reason for hiding this comment

rafvasq commented Nov 15, 2022

njhill commented Nov 15, 2022