-
Notifications
You must be signed in to change notification settings - Fork 358
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cuda+cpu mode panic at unwrap #1114
Comments
meet problem too |
@grinapo @Sherlock-Holo I merged some fixes, can you please try this again after |
I try commit c9ac321 when I ask
it answer strange words
I have to Ctrl+C to stop it |
@Sherlock-Holo @grinapo this error should be somewhat fixed in #1137 (comment). I would recommend specifying a chat template explicitly, as documented here. |
You probably meant rather this about templates. |
commit 8d89c14 (HEAD -> master, origin/master, origin/HEAD)
|
I do not say that I am familar with the system, so apologies if stating the obvious, but the exact same command line works with 1.5B (which fits into GPU) and fails with 7B (which doesn't, see above). Their chat templates (or actually whole |
Describe the bug
Since I got out of memory all the time (GPU memory full after a few prompts, I guess), I tried '-n 16'. The result is not pretty, regardless on what number I choose (between 0-27, for this model)
Latest commit or version
$ ./mistralrs-server.gpu --version
mistralrs-server 0.4.0
(pulled today morning from git)
The text was updated successfully, but these errors were encountered: