Xsn/llama batch remove compat #316

Nexesenex · 2024-10-15T19:38:33Z

No description provided.

* ggml : move more prints to the ggml log system * show BLAS OpenMP warnings in all builds using debug print

Signed-off-by: Xiaodong Ye <[email protected]>

…#9798) * llama : improve infill support ggml-ci * llama : add more FIM token strings ggml-ci * server : update prompt on slot restore (ggerganov#9800) * gguf : deprecate old FIM token KVs

* server : remove legacy system_prompt feature ggml-ci * readme : update [no ci] * server : fix non-transformer logic + remove response from /props

* server : remove self-extend ggml-ci * server : fix context limit check to use slot.n_past ggml-ci

ggml-ci

ngxson and others added 19 commits October 11, 2024 11:48

refactor llama_batch_get_one

b226c5b

adapt all examples

1c48616

Merge branch 'master' into xsn/llama_batch_remove_compat

9970316

fix simple.cpp

9276950

fix llama_bench

59fd6b6

fix

7740c96

fix context shifting

6a9769a

ggml : move more prints to the ggml log system (ggerganov#9839)

9677640

* ggml : move more prints to the ggml log system * show BLAS OpenMP warnings in all builds using debug print

musa : update doc (ggerganov#9856)

943d20b

Signed-off-by: Xiaodong Ye <[email protected]>

llama : improve infill support and special token detection (ggerganov…

11ac980

…#9798) * llama : improve infill support ggml-ci * llama : add more FIM token strings ggml-ci * server : update prompt on slot restore (ggerganov#9800) * gguf : deprecate old FIM token KVs

server : remove legacy system_prompt feature (ggerganov#9857)

95c76e8

* server : remove legacy system_prompt feature ggml-ci * readme : update [no ci] * server : fix non-transformer logic + remove response from /props

server : remove self-extend features (ggerganov#9860)

1bde94d

* server : remove self-extend ggml-ci * server : fix context limit check to use slot.n_past ggml-ci

server : add option to time limit the generation phase (ggerganov#9865)

edc2656

ggml-ci

free batch before return

0639ff1

Merge branch 'master' into xsn/llama_batch_remove_compat

b4c9911

use common_batch_add, reuse llama_batch in loop

734f9e2

null terminated seq_id list

7264596

fix save-load-state example

6395174

fix perplexity

4be7ecf

Nexesenex merged commit bbb1ca9 into Nexesenex:lcpp_pr_llama_batch Oct 15, 2024
6 of 7 checks passed

github-actions bot added documentation Improvements or additions to documentation examples python server ggml android labels Oct 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xsn/llama batch remove compat #316

Xsn/llama batch remove compat #316

Nexesenex commented Oct 15, 2024

Xsn/llama batch remove compat #316

Xsn/llama batch remove compat #316

Conversation

Nexesenex commented Oct 15, 2024