0cc4m/vulkan iq4 nl #253

Nexesenex · 2024-07-21T10:13:16Z

No description provided.

Signed-off-by: thxCode <[email protected]>

* Add additional error information when model files fail to load. * Adding additional error information to most instances of fopen.

* llama : bump max layers from 256 to 512 * llama : replace asserts with exceptions

* ggml : fix iq4_nl dot product with odd number of blocks * ggml : fix odd blocks for ARM_NEON (#8556) * ggml : fix iq4_nl dot product with odd number of blocks * ggml : fix q4_1 * ggml : fix q5_0 * ggml : fix q5_1 * ggml : fix iq4_nl metal ggml-ci * ggml : fix q4_0 * ggml : fix q8_0 ggml-ci * ggml : remove special Q4_0 code for first 2 blocks * ggml : fix sumf redefinition --------- Co-authored-by: slaren <[email protected]> --------- Co-authored-by: Georgi Gerganov <[email protected]>

mofosyne and others added 10 commits July 19, 2024 17:51

convert-*.py: add general.name kv override (#8571)

3d0e436

fix: typo of chatglm4 chat tmpl (#8586)

f299aa9

Signed-off-by: thxCode <[email protected]>

ggml : add friendlier error message to fopen errors (#8575)

b57eb9c

* Add additional error information when model files fail to load. * Adding additional error information to most instances of fopen.

readme : fix server badge

be0cfb4

llama : bump max layers from 256 to 512 (#8530)

d197545

* llama : bump max layers from 256 to 512 * llama : replace asserts with exceptions

convert-*.py: remove add_name from ChatGLMModel class (#8590)

57b1d4f

Fix Vulkan matmul tests compile errors

c8ee1bc

Add Vulkan IQ4_NL support

6274b3f

Fix Vulkan DeepSeek-Coder-V2-Lite MoE support

3252afb

Nexesenex merged commit 75db6f7 into Nexesenex:lcpp_pr_vulk_iq4nl_support Jul 21, 2024
6 checks passed

github-actions bot added testing python ggml Vulkan labels Jul 21, 2024

0cc4m deleted the 0cc4m/vulkan-iq4_nl branch July 23, 2024 08:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0cc4m/vulkan iq4 nl #253

0cc4m/vulkan iq4 nl #253

Nexesenex commented Jul 21, 2024

0cc4m/vulkan iq4 nl #253

0cc4m/vulkan iq4 nl #253

Conversation

Nexesenex commented Jul 21, 2024