Fix code typo in llama-cli #8198

ngxson · 2024-06-28T21:30:58Z

Fix a small typo that breaks chat template support on llama-cli -cnv

I have read the contributing guidelines
Self-reported review complexity:
- Low

slaren · 2024-06-28T22:02:52Z

I don't think it is completely right yet, there are still extra new lines added to the assistant messages randomly. I suspect that at least one issue is that the chat template expects a new line after the <end_of_turn> of the assistant, but it is not being added.

ngxson · 2024-06-28T22:08:43Z

Hmm, that's strange. My result is pretty consistent (with -t 0)

> say 1
1

> say 2
2

> say 3
3

> say 4
4

> say 5
5

> say 6
6

>

slaren · 2024-06-28T22:11:27Z

That case is also fixed for me, but I still see many messages ending in double or triple lines during random chat.

ngxson · 2024-06-28T22:14:12Z

Probably there is some thing more specific for gemma template (or the model itself).

In any case, I'll merge this PR now and have a deeper look on gemma later.

ngxson · 2024-06-28T22:33:10Z

I don't think it is completely right yet, there are still extra new lines added to the assistant messages randomly. I suspect that at least one issue is that the chat template expects a new line after the <end_of_turn> of the assistant, but it is not being added.

I see what you mean. In this case, we must either patch the template behavior with add_ass, or either patch llama_chat_format_single to be aware of the trailing new line. I'll see which way is better.

fix code typo in llama-cli

b386363

ngxson added the Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix label Jun 28, 2024

ngxson requested a review from slaren June 28, 2024 21:30

github-actions bot added the examples label Jun 28, 2024

This was referenced Jun 28, 2024

Add attention and final logit soft-capping, update scaling factor to Gemma2 #8197

Merged

Support glm3 and glm4. #8031

Merged

slaren approved these changes Jun 28, 2024

View reviewed changes

ngxson merged commit 72272b8 into ggerganov:master Jun 28, 2024
53 checks passed

ngxson mentioned this pull request Jun 29, 2024

Fix new line issue with chat template, disable template when in-prefix/suffix is set #8203

Merged

2 tasks

Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Jun 29, 2024

fix code typo in llama-cli (ggerganov#8198)

4a3430c

Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Jun 30, 2024

fix code typo in llama-cli (ggerganov#8198)

37089b9

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Jun 30, 2024

fix code typo in llama-cli (ggerganov#8198)

17672cd

Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Jun 30, 2024

fix code typo in llama-cli (ggerganov#8198)

a2d0846

Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Jun 30, 2024

fix code typo in llama-cli (ggerganov#8198)

295ce3b

MagnusS0 pushed a commit to MagnusS0/llama.cpp-normistral-tokenizer that referenced this pull request Jul 1, 2024

fix code typo in llama-cli (ggerganov#8198)

9b44016

Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Jul 1, 2024

fix code typo in llama-cli (ggerganov#8198)

3eb12af

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix code typo in llama-cli #8198

Fix code typo in llama-cli #8198

ngxson commented Jun 28, 2024

slaren commented Jun 28, 2024

ngxson commented Jun 28, 2024

slaren commented Jun 28, 2024

ngxson commented Jun 28, 2024 •

edited

Loading

ngxson commented Jun 28, 2024 •

edited

Loading

Fix code typo in llama-cli #8198

Fix code typo in llama-cli #8198

Conversation

ngxson commented Jun 28, 2024

slaren commented Jun 28, 2024

ngxson commented Jun 28, 2024

slaren commented Jun 28, 2024

ngxson commented Jun 28, 2024 • edited Loading

ngxson commented Jun 28, 2024 • edited Loading

ngxson commented Jun 28, 2024 •

edited

Loading

ngxson commented Jun 28, 2024 •

edited

Loading