Skip to content

ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU #18222

ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU

ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU #18222

Annotations

1 error

windows-latest-cmake (avx512-x64, -DGGML_NATIVE=OFF -DLLAMA_BUILD_SERVER=ON -DGGML_RPC=ON -DGGML_...

succeeded Jan 10, 2025 in 6m 47s