Skip to content

ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU#11183

Open
compilade wants to merge 7 commits intomasterfrom compilade/cuda-tq2_0

Commits

Commits on Dec 28, 2024

Commits on Jan 9, 2025

Commits on Jan 10, 2025

Commits on Jan 12, 2025