-
Notifications
You must be signed in to change notification settings - Fork 369
GPT-2 segfaults when used through the CLI #167
Comments
I wonder if this has to do w/ loading through the snapshot. |
I am not able to reproduce this problem
|
How weird... is that q4 or f16? |
q4? I'm not sure honestly 😅 I think I'm testing w/ this model that appears to have been taken down 🤷🏻 https://huggingface.co/mongolian-basket-weaving/cerebras-gpt-13b-ggml-q4_0 |
Ok, just tested with https://huggingface.co/xzuyn/GPT-2-124M-ggml-q4_1/blob/main/ggml-model-q4_1.bin on macOS:
|
Aha - I think you've figured it out... Running with Or maybe not.
|
Quick findings with a debugger:
This is definitely something we should investigate and fix, but not a showstopper for now, I think. |
Fairly certain this fixes rustformers#167
Trying any GPT-2 GGML model through the CLI appears to cause an immediate segfault:
This appears to be true regardless of the model (Cerebras and base GPT-2 seem to both suffer from this).
This doesn't happen when run through the GPT-2 example.
The text was updated successfully, but these errors were encountered: