Const ref pair #302

Nexesenex · 2024-08-15T22:31:00Z

No description provided.

- std::pair<llama_ngram, llama_ngram_cache_part> (72 bytes -> 8 bytes) - std::tuple<std::string, float> (40 bytes -> 8 bytes)

* Optimize Vulkan REPEAT performance * Use Vulkan GLSL fused multiply-add instruction where possible * Add GGML_VULKAN_PERF option to output performance data per operator * Rework and fix Vulkan descriptor set and descriptor pool handling * Fix float32 concat f16 shader validation error * Add Vulkan GROUP_NORM eps parameter * Fix validation error with transfer queue memory barrier flags * Remove trailing whitespaces

) Signed-off-by: Jiri Podivin <[email protected]>

…ov#8850)

…nov#8778)

…rganov#8994)

* retrieval * Reuse querybatch to reduce frequent memory allocation * delete unused white space

GermanAizek and others added 11 commits May 13, 2024 20:07

Added const reference for std::pair<> and std::tuple<> more 16 bytes:

ced5bfe

- std::pair<llama_ngram, llama_ngram_cache_part> (72 bytes -> 8 bytes) - std::tuple<std::string, float> (40 bytes -> 8 bytes)

Merge branch 'ggerganov:master' into const-ref-pair

ce4a390

Added const reference for std::pair<> and std::tuple<> more 16 bytes:

f2e4d92

- std::pair<llama_ngram, llama_ngram_cache_part> (72 bytes -> 8 bytes) - std::tuple<std::string, float> (40 bytes -> 8 bytes)

server : init stop and error fields of the result struct (ggerganov#9026

234b306

) Signed-off-by: Jiri Podivin <[email protected]>

ci : disable bench workflow (ggerganov#9010)

d5492f0

llama : add pre-tokenizer regexes for BLOOM and gpt3-finnish (ggergan…

6bda7ce

…ov#8850)

common : remove duplicate function llama_should_add_bos_token (ggerga…

4af8420

…nov#8778)

server : fix duplicated n_predict key in the generation_settings (gge…

37501d9

…rganov#8994)

retrieval : fix memory leak in retrieval query handling (ggerganov#8955)

4b9afbb

* retrieval * Reuse querybatch to reduce frequent memory allocation * delete unused white space

Merge branch 'master' into const-ref-pair

2793b86

Nexesenex merged commit 6d504c4 into Nexesenex:lcpp_pr_ngram_cache Aug 15, 2024
8 of 11 checks passed

github-actions bot added examples python server ggml devops Vulkan labels Aug 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Const ref pair #302

Const ref pair #302

Nexesenex commented Aug 15, 2024

Const ref pair #302

Const ref pair #302

Conversation

Nexesenex commented Aug 15, 2024