Performance decreased severely #331

TheOnlyWiseJEDI · 2023-11-06T19:18:04Z

Using: text-generastion-webui
Model: ehartford_dolphin-2.2-mistral-7b - not quantized
Installed: using pip install -r requirements
OS: Windows 10
Shell: cmd.exe
Python: Python 3.11.3
pymemgpt installed: yes
Hardware: 12th Gen i7 64GB/RAM, RTX3060 12GB RAM
LLM and MemGPT on same system

I have recently reinstalled memgpt so all is fresh and up to date. I have NOT altered any settings in text-generation-webui. This problem existed before and after a reinstall.

There is a significant performance decrease since a recent update. 'reporter' said he thought it might be the grammar update and that he noticed it also. What used to be a processing time of 4 - 12 seconds has become more often than not, 20+ seconds. Even as much as 39 seconds.

With the model loaded and memgpt running these results are consistent through the CLI however if I flip to the webui interface and pose the same question the performance time is more than 1/2 faster than when done through memgpt. This was not true last week (presumably prior to whatever update changed things).

cpacker · 2023-11-06T21:05:14Z

@TheOnlyWiseJEDI can you try disabling grammar by forcing a specific wrapper?

If you pass --model airoboros-l2-70b-2.1, it shouldn't use grammar, so things should be as they were before (unless there's something I'm missing).

(If you don't pass any model, the default will be grammar-based)

TheOnlyWiseJEDI · 2023-11-06T21:10:56Z

It is no longer possible to use --model when using memgpt with 'run' ![image](https://github.com/cpacker/MemGPT/assets/123831682/5c338994-1d5d-415c-a442-ac2d90951dc2) ![image](https://github.com/cpacker/MemGPT/assets/123831682/6ea23a2f-604f-4cab-a333-b1a66fbfc6ee)

TheOnlyWiseJEDI · 2023-11-06T21:12:14Z

I tried to include model when creating a new agent - same result

…

On Mon, Nov 6, 2023 at 4:05 PM Charles Packer ***@***.***> wrote: @TheOnlyWiseJEDI <https://github.com/TheOnlyWiseJEDI> can you try disabling grammar by forcing a specific wrapper? If you pass --model airoboros-l2-70b-2.1, it shouldn't use grammar, so things should be as they were before (unless there's something I'm missing). — Reply to this email directly, view it on GitHub <#331 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A5QYLAREUMC7ENB34FY2BMLYDFGJJAVCNFSM6AAAAAA7AAMNI2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTOOJWGQ2DEMJYGQ> . You are receiving this because you were mentioned.Message ID: ***@***.***>

TheOnlyWiseJEDI · 2023-11-06T21:22:06Z

it looked like the images didn't get included when I replied from email, not sure if you could see them, I edited and reapplied the images, they should be available now

cpacker · 2023-11-06T22:06:36Z

It is no longer possible to use --model when using memgpt with 'run'

@vivi @sarahwooders can we do a hotpatch on this?

vivi · 2023-11-06T22:09:09Z

@TheOnlyWiseJEDI For now you can run memgpt run --model airoboros-l2-70b-2.1 and hit 'N' when asked "Would you like to select an existing agent?"

Hmm -- just saw your comment that you tried to do that. Looking into this now.

vivi · 2023-11-06T22:26:37Z

@TheOnlyWiseJEDI I can't seem to reproduce your bug on 0.1.18a1. Could you run pip show pymemgpt and if the version is <0.1.18a1, run pip install pymemgpt -U --pre?

I do think that you should be able to reload the same agent with the same wrapper so I have opened #338.

TheOnlyWiseJEDI · 2023-11-07T00:06:42Z

ok done. I had 0.1.17, now updated. I did this early this morning, was there another release today? What should be fixed now in this update?

…

On Mon, Nov 6, 2023 at 5:26 PM Vivian Fang ***@***.***> wrote: @TheOnlyWiseJEDI <https://github.com/TheOnlyWiseJEDI> I can't seem to reproduce your bug on 0.1.18a1. Could you run pip show pymemgpt and if the version is <0.1.18a1, run pip install pymemgpt -U --pre? — Reply to this email directly, view it on GitHub <#331 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A5QYLAREBJW7GIY5FIEJ3JDYDFP2RAVCNFSM6AAAAAA7AAMNI2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTOOJWHA4TCNJWGA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

sarahwooders · 2023-11-07T01:29:49Z

If you run pip install pymemgpt -U --pre, you should be able to have the --model flag without an error, as long as it matches the model of the agent you are resuming. You don't need to specify any additional flags if you're resuming an existing agent.

TheOnlyWiseJEDI · 2023-11-07T01:32:20Z

I can't run memgpt at all now, even with just "memgpt run" - take a look at the other ticket you created

…

On Mon, Nov 6, 2023 at 8:30 PM Sarah Wooders ***@***.***> wrote: If you run pip install pymemgpt -U --pre, you should be able to have the --model flag without an error, as long as it matches the model of the agent you are resuming. You don't need to specify any additional flags if you're resuming an existing agent. — Reply to this email directly, view it on GitHub <#331 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A5QYLATL3HYQY4DUDFZV3BTYDGFJPAVCNFSM6AAAAAA7AAMNI2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTOOJXGEZDMNZUGI> . You are receiving this because you were mentioned.Message ID: ***@***.***>

cpacker · 2023-11-07T06:20:58Z

Closing this particular issue because it should be resolved in the update that just went live (0.1.18), but @TheOnlyWiseJEDI feel free to reopen if it's still happening or open a new issue if something else is wrong.

cpacker added the local-llm label Nov 6, 2023

cpacker assigned sarahwooders and vivi Nov 6, 2023

cpacker closed this as completed Nov 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance decreased severely #331

Performance decreased severely #331

TheOnlyWiseJEDI commented Nov 6, 2023

cpacker commented Nov 6, 2023 •

edited

Loading

TheOnlyWiseJEDI commented Nov 6, 2023 via email •

edited

Loading

TheOnlyWiseJEDI commented Nov 6, 2023 via email

TheOnlyWiseJEDI commented Nov 6, 2023

cpacker commented Nov 6, 2023

vivi commented Nov 6, 2023 •

edited

Loading

vivi commented Nov 6, 2023 •

edited

Loading

TheOnlyWiseJEDI commented Nov 7, 2023 via email

sarahwooders commented Nov 7, 2023

TheOnlyWiseJEDI commented Nov 7, 2023 via email

cpacker commented Nov 7, 2023

Performance decreased severely #331

Performance decreased severely #331

Comments

TheOnlyWiseJEDI commented Nov 6, 2023

cpacker commented Nov 6, 2023 • edited Loading

TheOnlyWiseJEDI commented Nov 6, 2023 via email • edited Loading

TheOnlyWiseJEDI commented Nov 6, 2023 via email

TheOnlyWiseJEDI commented Nov 6, 2023

cpacker commented Nov 6, 2023

vivi commented Nov 6, 2023 • edited Loading

vivi commented Nov 6, 2023 • edited Loading

TheOnlyWiseJEDI commented Nov 7, 2023 via email

sarahwooders commented Nov 7, 2023

TheOnlyWiseJEDI commented Nov 7, 2023 via email

cpacker commented Nov 7, 2023

cpacker commented Nov 6, 2023 •

edited

Loading

TheOnlyWiseJEDI commented Nov 6, 2023 via email •

edited

Loading

vivi commented Nov 6, 2023 •

edited

Loading

vivi commented Nov 6, 2023 •

edited

Loading