Skip to content

ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU #18222

ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU

ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU #18222

windows-latest-cmake (msvc-arm64, -G "Ninja Multi-Config" -D CMAKE_TOOLCHAIN_FILE=cmake/arm64-win...

succeeded Jan 10, 2025 in 2m 53s