You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU#11183
Open
compilade wants to merge 7 commits intomasterggerganov/llama.cpp:masterfrom compilade/cuda-tq2_0ggerganov/llama.cpp:compilade/cuda-tq2_0Copy head branch name to clipboard