You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This change is related to these upstream PR:
- ggml-org/llama.cpp#11195 allows using
tag-based repo name like on ollama
- ggml-org/llama.cpp#11214 automatically turn
on `--conversation` mode for models having chat template
Example:
```sh
# for "instruct" model, conversation mode is enabled automatically
llama-cli -hf bartowski/Llama-3.2-1B-Instruct-GGUF
# for non-instruct model, it runs as completion
llama-cli -hf TheBloke/Llama-2-7B-GGUF -p "Once upon a time,"
```
0 commit comments