Add assertion informing the API user about missing llama_encode() call #8400

fairydreaming · 2024-07-09T20:17:42Z

Using encoder-decoder models like T5 without calling llama_encode() first currently results in a cryptic error message:

GGML_ASSERT: ggml/src/ggml.c:5278: !ggml_is_transposed(a)

that is already causing confusion: #8398.

This PR adds an assertion to build_t5() that informs the API user about the necessity of calling llama_encode() first if there are no encoder outputs present during llama_decode(). With this PR the error message is:

GGML_ASSERT: src/llama.cpp:13203: n_outputs_enc > 0 && "call llama_encode() first"

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

Co-authored-by: Stanisław Szymczyk <[email protected]>

llama : add assertion informing about missing llama_encode() call

f4c3b96

fairydreaming mentioned this pull request Jul 9, 2024

Bug: ggml.c:5278: !ggml_is_transposed(a) #8398

Closed

ggerganov approved these changes Jul 10, 2024

View reviewed changes

ggerganov merged commit a8be1e6 into ggerganov:master Jul 10, 2024
53 checks passed

Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Jul 11, 2024

llama : add assert about missing llama_encode() call (ggerganov#8400)

c670d06

Co-authored-by: Stanisław Szymczyk <[email protected]>

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Jul 13, 2024

llama : add assert about missing llama_encode() call (ggerganov#8400)

a464b6f

Co-authored-by: Stanisław Szymczyk <[email protected]>

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Jul 13, 2024

llama : add assert about missing llama_encode() call (ggerganov#8400)

f4e68cd

Co-authored-by: Stanisław Szymczyk <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add assertion informing the API user about missing llama_encode() call #8400

Add assertion informing the API user about missing llama_encode() call #8400

fairydreaming commented Jul 9, 2024

Add assertion informing the API user about missing llama_encode() call #8400

Add assertion informing the API user about missing llama_encode() call #8400

Conversation

fairydreaming commented Jul 9, 2024