llama : add early return for empty range #8327

danbev · 2024-07-05T12:21:09Z

This commit adds an early return to the llama_kv_cache_seq_add and llama_kv_cache_seq_div functions.

The motivation for adding this is to avoid looping over the cache when the range is empty. I ran into this when using the self-extend feature in main.cpp.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

src/llama.cpp

This commit adds an early return to the llama_kv_cache_seq_add and llama_kv_cache_seq_div functions. The motivation for adding this is to avoid looping over the cache when the range is empty. I ran into this when using the self-extend feature in main.cpp. Signed-off-by: Daniel Bevenius <[email protected]>

This commit attempts to fix the following warning/error: ```console src/llama.cpp:7271:31: error: comparison of integer expressions of different signedness: ‘int’ and ‘uint32_t’ {aka ‘unsigned int’} [-Werror=sign-compare] 7271 | if (i < hparams.n_layer_dense_lead) { | ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~ ``` This can be reproduced locally by setting -Wsign-compare in the Makefile. Signed-off-by: Daniel Bevenius <[email protected]>

Remove the setting of cache.head to 0 when the range is empty. Signed-off-by: Daniel Bevenius <[email protected]>

src/llama.cpp

* llama : add early return for empty range This commit adds an early return to the llama_kv_cache_seq_add and llama_kv_cache_seq_div functions. The motivation for adding this is to avoid looping over the cache when the range is empty. I ran into this when using the self-extend feature in main.cpp. Signed-off-by: Daniel Bevenius <[email protected]> * llama : add static_cast to fix CI warning/error This commit attempts to fix the following warning/error: ```console src/llama.cpp:7271:31: error: comparison of integer expressions of different signedness: ‘int’ and ‘uint32_t’ {aka ‘unsigned int’} [-Werror=sign-compare] 7271 | if (i < hparams.n_layer_dense_lead) { | ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~ ``` This can be reproduced locally by setting -Wsign-compare in the Makefile. Signed-off-by: Daniel Bevenius <[email protected]> * squash! llama : add early return for empty range Remove the setting of cache.head to 0 when the range is empty. Signed-off-by: Daniel Bevenius <[email protected]> * Update src/llama.cpp --------- Signed-off-by: Daniel Bevenius <[email protected]> Co-authored-by: Georgi Gerganov <[email protected]>

ggerganov reviewed Jul 5, 2024

View reviewed changes

src/llama.cpp Outdated Show resolved Hide resolved

danbev added 3 commits July 5, 2024 18:34

squash! llama : add early return for empty range

eb572f9

Remove the setting of cache.head to 0 when the range is empty. Signed-off-by: Daniel Bevenius <[email protected]>

danbev force-pushed the llama-kv-cache-range-check branch from d6c5e3d to eb572f9 Compare July 5, 2024 16:38

ggerganov approved these changes Jul 6, 2024

View reviewed changes

src/llama.cpp Outdated Show resolved Hide resolved

Update src/llama.cpp

c9d6700

ggerganov merged commit 87e25a1 into ggerganov:master Jul 6, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : add early return for empty range #8327

llama : add early return for empty range #8327

danbev commented Jul 5, 2024

llama : add early return for empty range #8327

llama : add early return for empty range #8327

Conversation

danbev commented Jul 5, 2024