[Bug]: AttributeError in OpenAIServingChat when accessing `chat_template` when using ray serve #4296

BoussouarSari · 2024-04-23T15:49:29Z

Your current environment

vllm version : [v0.4.1]

🐛 Describe the bug

Description

I'm encountering an AttributeError in the OpenAIServingChat module when integrating Ray Serve with the OpenAI API. The error arises because the tokenizer object is accessed before it is fully initialized.

Error Message

the error occured in this line

AttributeError: 'NoneType' object has no attribute 'chat_template'

Issue Details

The tokenizer is instantiated asynchronously in the _post_init() function of the OpenAIServing class. However, this instantiation occurs conditionally within the constructor based on the status of an existing event loop

The _load_chat_template function, which relies on tokenizer, is invoked synchronously here, potentially before tokenizer is fully initialized.

Suggested Solution

Convert _load_chat_template to an asynchronous function and invoke it similarly to _post_init, ensuring it is executed after the tokenizer has been initialized. This modification should maintain the sequence of initialization and avoid premature access.

The text was updated successfully, but these errors were encountered:

schoennenbeck · 2024-04-24T11:09:57Z

See this PR: #2727

DarkLight1337 · 2024-05-31T03:45:38Z

Fixed by #2727

Iven2132 · 2024-12-13T18:11:11Z

Hi @DarkLight1337 Its not been fixed yet i still that that

    import fastapi
    import vllm.entrypoints.openai.api_server as api_server
    from vllm.engine.arg_utils import AsyncEngineArgs
    from vllm.engine.async_llm_engine import AsyncLLMEngine
    from vllm.entrypoints.logger import RequestLogger
    from vllm.entrypoints.openai.serving_chat import OpenAIServingChat
    from vllm.entrypoints.openai.serving_completion import (
        OpenAIServingCompletion,
    )
    from vllm.entrypoints.openai.serving_engine import BaseModelPath
    from vllm.usage.usage_lib import UsageContext

    # create a fastAPI app that uses vLLM's OpenAI-compatible router
    web_app = fastapi.FastAPI(
        title=f"OpenAI-compatible {MODEL_NAME} server",
        description="Run an OpenAI-compatible LLM server with vLLM on modal.com 🚀",
        version="0.0.1",
        docs_url="/docs",
    )

    router = fastapi.APIRouter()

    # wrap vllm's router in auth router
    router.include_router(api_server.router)
    # add authed vllm to our fastAPI app
    web_app.include_router(router)

    engine_args = AsyncEngineArgs(
        model=MODEL_NAME,
        tensor_parallel_size=N_GPU,
        gpu_memory_utilization=0.90,
        max_model_len=8096,
        enforce_eager=False,  # capture the graph for faster inference, but slower cold starts (30s > 20s)
    )

    engine = AsyncLLMEngine.from_engine_args(
        engine_args, usage_context=UsageContext.OPENAI_API_SERVER
    )

    model_config = engine.get_model_config()

    request_logger = RequestLogger(max_log_len=2048)

    base_model_paths = [
        BaseModelPath(name=MODEL_NAME.split("/")[1], model_path=MODEL_NAME)
    ]

    api_server.chat = lambda s: OpenAIServingChat(
        engine,
        model_config=model_config,
        base_model_paths=base_model_paths,
        chat_template=None,
        response_role="assistant",
        lora_modules=[],
        prompt_adapters=[],
        request_logger=request_logger,
    )
    api_server.completion = lambda s: OpenAIServingCompletion(
        engine,
        model_config=model_config,
        base_model_paths=base_model_paths,
        lora_modules=[],
        prompt_adapters=[],
        request_logger=request_logger,
    )

    return web_app

DarkLight1337 · 2024-12-14T02:07:16Z

Can you open a new issue and provide more detailed information?

BoussouarSari added the bug Something isn't working label Apr 23, 2024

ayusher mentioned this issue Apr 24, 2024

[Bugfix] Fix async initializer for OpenAI serving #4321

Closed

DarkLight1337 closed this as completed May 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: AttributeError in OpenAIServingChat when accessing `chat_template` when using ray serve #4296

[Bug]: AttributeError in OpenAIServingChat when accessing `chat_template` when using ray serve #4296

BoussouarSari commented Apr 23, 2024

schoennenbeck commented Apr 24, 2024

DarkLight1337 commented May 31, 2024

Iven2132 commented Dec 13, 2024

DarkLight1337 commented Dec 14, 2024

[Bug]: AttributeError in OpenAIServingChat when accessing chat_template when using ray serve #4296

[Bug]: AttributeError in OpenAIServingChat when accessing chat_template when using ray serve #4296

Comments

BoussouarSari commented Apr 23, 2024

Your current environment

🐛 Describe the bug

Description

Error Message

Issue Details

Suggested Solution

schoennenbeck commented Apr 24, 2024

DarkLight1337 commented May 31, 2024

Iven2132 commented Dec 13, 2024

DarkLight1337 commented Dec 14, 2024

[Bug]: AttributeError in OpenAIServingChat when accessing `chat_template` when using ray serve #4296

[Bug]: AttributeError in OpenAIServingChat when accessing `chat_template` when using ray serve #4296