-
Notifications
You must be signed in to change notification settings - Fork 284
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GGUF: missing split.no
metadata
#604
Comments
@ngxson you can run the gguf js parser in any node/js env without needing hf frontend, as described in readme https://github.com/huggingface/huggingface.js/blob/main/packages%2Fgguf%2FREADME.md If you can produce a snippet to reproduce the bug or even better a fix for the gguf js parser, that would be great ! |
@julien-c The bug seems easy to fix: do not remove metadata with 0 as value :) |
Cool! thanks @mishig25 for the explanations |
Keeping the issue open until the UI fix is deployed (the fix is already merged) |
The fix is deployed now. You can see results https://huggingface.co/ngxson/tinyllama_split_test/tree/main?show_tensors=stories15M-q8_0-00001-of-00003.gguf |
Related to: ggerganov/llama.cpp#6343 (comment)
Explanation: We recently introduced
gguf-split
tool to llama.cpp, which allows user to split the model into smaller shards. Each shard has 3 metadata to know its info:split.count
: Total number of splitssplit.no
: The number of the current splitsplit.tensors.count
: Total number of tensors of the original model (= sum of tensors of all splits)The
split.no
is however missing when viewing from GGUF viewer on huggingface. It is still visible when inspecting usinggguf-py
This can be reproduce using a smaller model: https://huggingface.co/ngxson/tinyllama_split_test/tree/main?show_tensors=stories15M-q8_0-00001-of-00003.gguf
Here is the command that I used to split the model:
I'd be happy to help you guys with this. Feel free to let me know if you need more info.
The text was updated successfully, but these errors were encountered: