docs: Update docs to make isvc focal #190

pvaneck · 2022-07-18T21:42:32Z

Motivation

The InferenceService CRD has evolved enough to become the primary interface for interacting with ModelMesh.
The documentation should reflect that.

Modifications

Documentation was adjusted to make the InferenceService CRD focal as opposed to the previous Predictor CRD.
Examples and snippets were added for InferenceServices.

Result

Users will learn and become more familiar with deploying on ModelMesh using the KServe InferenceService.

Signed-off-by: Paul Van Eck <[email protected]>

chinhuang007 · 2022-07-18T23:01:53Z

docs/README.md

@@ -11,9 +11,9 @@ The model data itself is pulled from one or more external [storage instances](pr
 ModelMesh Serving makes use of two core Kubernetes Custom Resource types:

 - `ServingRuntime` - Templates for Pods that can serve one or more particular model formats. There are three "built in" runtimes that cover the out-of-the-box model types (Triton, MLServer and OpenVINO Model Server OVMS), [custom runtimes](runtimes/) can be defined by creating additional ones.
- [`Predictor`](predictors/) - This represents a logical endpoint for serving predictions using a particular model. The Predictor spec specifies the model type, the storage in which it resides and the path to the model within that storage. The corresponding endpoint is "stable" and will seamlessly transition between different model versions or types when the spec is updated.
+- [`InferenceService`](predictors/) - This is the main interface KServe uses for managing models on Kubernetes. ModelMesh Serving can be used for deploying `InferenceService` predictors which represent a logical endpoint for serving predictions using a particular model. The `InferenceService` predictor spec specifies the model format, the storage location in which the model resides, and other optional configuration. The corresponding endpoint is "stable" and will seamlessly transition between different model versions or types when the spec is updated. Note that many features like transformers, explainers, and canary rollouts do not currently apply or fully work using InferenceServices with `deploymentMode` set to `ModelMesh`. And `PodSpec` fields that are set in the `InferenceService` predictor spec will be ignored.


I am not sure the "InferenceService link" should point to the ModelMesh predictors docs. I understand ModelMesh implements only the predictor component in the KServe InferenceService spec. But it feels a little strange to follow the link and see nothing about InferenceService. I would suggest either remove the link or have it point to KServe InferenceService.

docs/example-models.md

docs/predictors/README.md

docs/predictors/inferenceservice-cr.md

chinhuang007 · 2022-07-19T00:24:18Z

docs/predictors/inferenceservice-cr.md

+  - `modelId` - The internal id of the model in question. This includes a hash of the InferenceService's predictor spec.
+  - `time` - The time at which the failure occurred, if applicable.
+
+Upon creation, InferenceService the model status will always transition to `Loaded` state (unless the loading fails), but later if unused it is possible that they end up in a `Standby` state which means they are still available to serve requests but the first request could incur a loading delay. Whether this happens is a function of the available capacity and usage pattern of other models. It's possible that models will transition from `Standby` back to `Loaded` "by themselves" if more capacity becomes available.


Maybe just me, not quite sure

what "InferenceService the model status" means. I suppose it means the model status of the InferenceService?

what "they" refer to in "...they end up in a Standby state which means they are still available to serve requests..." I suppose they means InferenceServices?

Yea, it's not quite clear here. I will reword.

docs/quickstart.md

docs/runtimes/custom_runtimes.md

chinhuang007

Thanks, @pvaneck ! Looks very good, just a few minor comments.

Signed-off-by: Paul Van Eck <[email protected]>

pvaneck · 2022-07-19T07:04:43Z

Thanks, @chinhuang007 for the thorough review!

chinhuang007

/lgtm

kserve-oss-bot · 2022-07-20T00:52:04Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: chinhuang007, pvaneck

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [pvaneck]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

#### Motivation The InferenceService CRD has evolved enough to become the primary interface for interacting with ModelMesh. The documentation should reflect that. #### Modifications Documentation was adjusted to make the InferenceService CRD focal as opposed to the previous Predictor CRD. Examples and snippets were added for InferenceServices. #### Result Users will learn and become more familiar with deploying on ModelMesh using the KServe InferenceService. Signed-off-by: Paul Van Eck <[email protected]>

kserve-oss-bot requested review from chinhuang007 and Tomcli July 18, 2022 21:42

kserve-oss-bot added the approved label Jul 18, 2022

pvaneck requested review from njhill and removed request for Tomcli July 18, 2022 21:42

docs: Update docs to make isvc focal

e623d96

Signed-off-by: Paul Van Eck <[email protected]>

pvaneck force-pushed the update-isvc-docs branch from 35cd183 to e623d96 Compare July 18, 2022 21:45

chinhuang007 reviewed Jul 18, 2022

View reviewed changes

docs/example-models.md Outdated Show resolved Hide resolved

chinhuang007 reviewed Jul 18, 2022

View reviewed changes

docs/predictors/README.md Outdated Show resolved Hide resolved

chinhuang007 reviewed Jul 19, 2022

View reviewed changes

docs/predictors/inferenceservice-cr.md Outdated Show resolved Hide resolved

chinhuang007 reviewed Jul 19, 2022

View reviewed changes

docs/quickstart.md Outdated Show resolved Hide resolved

chinhuang007 reviewed Jul 19, 2022

View reviewed changes

docs/runtimes/custom_runtimes.md Outdated Show resolved Hide resolved

chinhuang007 reviewed Jul 19, 2022

View reviewed changes

Adjust docs for review

004421d

Signed-off-by: Paul Van Eck <[email protected]>

chinhuang007 approved these changes Jul 20, 2022

View reviewed changes

kserve-oss-bot assigned chinhuang007 Jul 20, 2022

kserve-oss-bot added the lgtm label Jul 20, 2022

kserve-oss-bot merged commit cf88ddf into kserve:main Jul 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Update docs to make isvc focal #190

docs: Update docs to make isvc focal #190

pvaneck commented Jul 18, 2022

chinhuang007 Jul 18, 2022

chinhuang007 Jul 19, 2022

pvaneck Jul 19, 2022

chinhuang007 left a comment

pvaneck commented Jul 19, 2022

chinhuang007 left a comment

kserve-oss-bot commented Jul 20, 2022

docs: Update docs to make isvc focal #190

docs: Update docs to make isvc focal #190

Conversation

pvaneck commented Jul 18, 2022

Motivation

Modifications

Result

chinhuang007 Jul 18, 2022

Choose a reason for hiding this comment

chinhuang007 Jul 19, 2022

Choose a reason for hiding this comment

pvaneck Jul 19, 2022

Choose a reason for hiding this comment

chinhuang007 left a comment

Choose a reason for hiding this comment

pvaneck commented Jul 19, 2022

chinhuang007 left a comment

Choose a reason for hiding this comment

kserve-oss-bot commented Jul 20, 2022