ggml: aarch64: implement mmla kernels for q8_0_q8_0, q4_0_q8_0 and q4_1_q8_1 quantized gemm#4966
Merged
ggerganov merged 5 commits intoggml-org:masterfrom snadampal:smmla_aarch64Feb 11, 2024
+442-89
Commits
Commits on Feb 8, 2024
- committed
- committed
- committed
- committed
- committed