Fix calls to defunct AsyncEngineClient #138

NickLucche · 2024-09-23T12:40:42Z

Address breaking changes in upstream main (v0.6.2?) vllm-project/vllm#8673 and vllm-project/vllm#8157. The latter is yet another bulky change to the rpc client-server interaction: I couldn't spot differences in functionality and tests are fine on our side, but I could really use some help to double check that what's been implemented upstream is still fully compatible with the use being made of here.

I branched off #136, so we should be merging that PR first.

codecov-commenter · 2024-09-23T12:48:14Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 57.26%. Comparing base (10dd1d9) to head (f929661).

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #138      +/-   ##
==========================================
+ Coverage   56.80%   57.26%   +0.45%     
==========================================
  Files          25       25              
  Lines        1542     1542              
  Branches      256      256              
==========================================
+ Hits          876      883       +7     
+ Misses        588      581       -7     
  Partials       78       78

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…eClient type with interface

dtrifiro · 2024-09-23T15:21:59Z

src/vllm_tgis_adapter/grpc/grpc_server.py

@@ -262,7 +262,7 @@ async def Generate(
                log_tracing_disabled_warning()
            generators.append(
                self.engine.generate(
-                    inputs=inputs,
+                    inputs,


I'd rather not remove this, I prefer explicitly specifying the names of the input, as this has caused issues in the past

can I move inputs to prompt and assume >v0.6.1.post2 or do I have to be backward compatible?

Ah, sad this wasn't done in a non-breaking way in vllm: https://github.com/vllm-project/vllm/pull/8673/files#diff-e8f334a55697b72398acbbed4d280bcb3e5eb6c2e66fc81ccde477818b4b0351R38. Maybe we can simply comment here that the kwarg name is removed due to this change, and then in later releases add in prompt=?

Might also be worth changing the name of the inputs variable here to prompt

We are reverting upstream for now vllm-project/vllm#8750.

I guess it could've been done in a less abrupt way upstream by warning about deprecation of inputs while adding both inputs and prompts for some time..

dtrifiro

Looks good, one minor nit

njhill · 2024-09-23T17:40:24Z

src/vllm_tgis_adapter/grpc/grpc_server.py

@@ -177,12 +177,12 @@ class TextGenerationService(generation_pb2_grpc.GenerationServiceServicer):

    def __init__(
        self,
-        engine: AsyncEngineClient | AsyncLLMEngine,
+        engine: EngineClient | AsyncLLMEngine,


I'm not sure that the union should be needed here.. EngineClient is a Protocol and AsyncLLMEngine implements that protocol.. so only the former should be needed I think?

njhill · 2024-09-23T17:42:01Z

src/vllm_tgis_adapter/grpc/grpc_server.py

@@ -65,7 +65,7 @@
    from transformers import PreTrainedTokenizer
    from vllm import CompletionOutput, RequestOutput
    from vllm.config import ModelConfig
-    from vllm.engine.protocol import AsyncEngineClient
+    from vllm.engine.protocol import EngineClient


Since this was renamed in the latest changes, won't this break usage of the adapter with existing releases (which I thought @dtrifiro tries to maintain)?

Suggested change

from vllm.engine.protocol import EngineClient

try:

from vllm.engine.protocol import EngineClient

except ImportError:

from vllm.engine.protocol import AsyncEngineClient as EngineClient

Maybe this is simple to alias like so?

yep my bad, good one!

…on removal

joerunde · 2024-09-24T15:00:57Z

Thanks @NickLucche!

NickLucche requested review from dtrifiro and joerunde September 23, 2024 12:45

fix inputs->prompt arg rename in generate; replace defunct AsyncEngin…

08652e5

…eClient type with interface

dtrifiro force-pushed the asyncengineclient-replacement-fix branch from 1525dd3 to 08652e5 Compare September 23, 2024 15:20

dtrifiro reviewed Sep 23, 2024

View reviewed changes

dtrifiro self-requested a review September 23, 2024 15:22

dtrifiro approved these changes Sep 23, 2024

View reviewed changes

dtrifiro mentioned this pull request Sep 23, 2024

grpc_server: use x-correlation-id as request-id when possible #128

Merged

njhill reviewed Sep 23, 2024

View reviewed changes

NickLucche and others added 3 commits September 24, 2024 11:49

reverted prompts->inputs in light of upstreams step back

1dbf052

backward compatible protocol import and reduntant union type annotati…

73a7cf0

…on removal

Merge branch 'main' into asyncengineclient-replacement-fix

f929661

joerunde merged commit 5852586 into main Sep 24, 2024
3 checks passed

joerunde deleted the asyncengineclient-replacement-fix branch September 24, 2024 14:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix calls to defunct AsyncEngineClient #138

Fix calls to defunct AsyncEngineClient #138

NickLucche commented Sep 23, 2024 •

edited

Loading

codecov-commenter commented Sep 23, 2024 •

edited

Loading

dtrifiro Sep 23, 2024

NickLucche Sep 23, 2024

joerunde Sep 23, 2024

njhill Sep 23, 2024

NickLucche Sep 24, 2024

dtrifiro left a comment

njhill Sep 23, 2024

njhill Sep 23, 2024

joerunde Sep 23, 2024

NickLucche Sep 24, 2024

joerunde commented Sep 24, 2024

Fix calls to defunct AsyncEngineClient #138

Fix calls to defunct AsyncEngineClient #138

Conversation

NickLucche commented Sep 23, 2024 • edited Loading

codecov-commenter commented Sep 23, 2024 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dtrifiro left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joerunde commented Sep 24, 2024

NickLucche commented Sep 23, 2024 •

edited

Loading

codecov-commenter commented Sep 23, 2024 •

edited

Loading