ggml : storing strides as number of elements instead of number of bytes #623

slaren · 2023-11-27T13:54:42Z

Currently, we store the strides between elements of each dimension as a number of bytes in ggml_tensor::nb. In practice, this complicates code because strides always need to be multiplied by the element size, and accessing elements requires first casting the pointers to char *.

I am not sure if there are any cases where we would want a byte stride that isn't a multiple of the element size, as this would mean that the addresses would no longer be aligned to the element size, which is not ok in many platforms. Therefore I think we could simplify the code a bit by storing strides as numbers of elements instead of numbers of bytes.

The text was updated successfully, but these errors were encountered:

ggerganov · 2023-11-27T16:16:02Z

Yup, I guess it would be an improvement. Probably we can rename nb to ns so that we get errors in all places in the codebase when refactoring this. And leave a comment explaining to 3rd party devs how to update if they are using nb somewhere in their projects

MarioSieg · 2025-02-13T16:28:57Z

Are the ggml strides and memory actually row major?
Because when I compare ggml strides to numpy strides,
ggml strides are reversed which looks like column major ordering, but in the ggml source comments it says that tensors are stored in row-major order.
I'ts confusing to convert data between ggml and numpy when the stride layout differs so significantly...

ggerganov · 2025-02-13T18:13:06Z

I agree it's confusing but it's too late to change. The ggml data is stored in row-major. The shapes and strides are in reverse compared to python.

ggerganov added the refactoring Refactoring label Nov 27, 2023

ggerganov changed the title ~~Storing strides as number of elements instead of number of bytes~~ ggml : storing strides as number of elements instead of number of bytes Nov 27, 2023

ggerganov added this to ggml : roadmap Nov 27, 2023

ggerganov moved this to Todo in ggml : roadmap Nov 27, 2023

ggerganov mentioned this issue Jan 30, 2024

ggml: aarch64: implement mmla kernels for q8_0_q8_0, q4_0_q8_0 and q4_1_q8_1 quantized gemm ggml-org/llama.cpp#4966

Merged

ggerganov added the roadmap Part of a roadmap project label Feb 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml : storing strides as number of elements instead of number of bytes #623

ggml : storing strides as number of elements instead of number of bytes #623

slaren commented Nov 27, 2023 •

edited

Loading

ggerganov commented Nov 27, 2023

MarioSieg commented Feb 13, 2025

ggerganov commented Feb 13, 2025

ggml : storing strides as number of elements instead of number of bytes #623

ggml : storing strides as number of elements instead of number of bytes #623

Comments

slaren commented Nov 27, 2023 • edited Loading

ggerganov commented Nov 27, 2023

MarioSieg commented Feb 13, 2025

ggerganov commented Feb 13, 2025

slaren commented Nov 27, 2023 •

edited

Loading