Releases · githubcto/llama.cpp · GitHub

27 Oct 15:10

SD3GGUF-b3986-89406a9

SD3GGUF-b3986-89406a9 Latest

Latest

llama-quantize.exe for SD SDXL SD3 FLUX SD3.5
GGUF lcpp_sd3.patch patched
GPU_TARGETS="gfx1100;gfx1101;gfx1102;gfx1030;gfx906"

Assets 19

cudart-llama-bin-win-cu11.7.1-x64.zip

293 MB 2024-10-27T15:10:01Z
cudart-llama-bin-win-cu12.2.0-x64.zip

413 MB 2024-10-27T15:10:07Z
llama-SD3GGUF-b1-89406a9-bin-win-hip-x64-gfx1030.zip

236 MB 2024-10-27T15:10:15Z
llama-SD3GGUF-b1-89406a9-bin-win-hip-x64-gfx1100-gfx1101-gfx1102-gfx1030-gfx906.zip

269 MB 2024-10-27T15:10:20Z
llama-SD3GGUF-b1-89406a9-bin-win-hip-x64-gfx1100.zip

238 MB 2024-10-27T15:10:25Z
llama-SD3GGUF-b1-89406a9-bin-win-hip-x64-gfx1101.zip

238 MB 2024-10-27T15:10:30Z
llama-SD3GGUF-b3986-89406a9-bin-win-avx-x64.zip

7.89 MB 2024-10-27T15:10:34Z
llama-SD3GGUF-b3986-89406a9-bin-win-avx2-x64.zip

7.88 MB 2024-10-27T15:10:35Z
llama-SD3GGUF-b3986-89406a9-bin-win-avx512-x64.zip

7.89 MB 2024-10-27T15:10:36Z
llama-SD3GGUF-b3986-89406a9-bin-win-cuda-cu11.7.1-x64.zip

145 MB 2024-10-27T15:10:36Z
Source code (zip)

2024-10-27T14:24:15Z
Source code (tar.gz)

2024-10-27T14:24:15Z

15 Oct 11:17

qt-b3923-a58a0a4

llama-quantize.exe for SD SDXL SD3 FLUX qt-b3923-a58a0a4

llama-quantize.exe for SD SDXL SD3 FLUX
GGUF lcpp.patch hand patched

Assets 19

15 Oct 02:47

b3920

add ROCm6 gfx1100;gfx1101;gfx1102;gfx1030;gfx906

Assets 23

14 Oct 08:02

b3917

server : handle "logprobs" field with false value (#9871)

Co-authored-by: Gimling <[email protected]>

Assets 22