Skip to content

ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU #18222

ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU

ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU #18222

windows-latest-cmake (openblas-x64, -DGGML_NATIVE=OFF -DLLAMA_BUILD_SERVER=ON -DGGML_RPC=ON -DGGM...

succeeded Jan 10, 2025 in 5m 47s