Notes #1

Determine the MLA (+RoPE) / MoE computation graph in depth, including linear algebra mathematical representation
Survey Inference optimizations and low level techniques

These two tasks should produce for us by the end of milestone 1 a narrowed-down algorithmic section that we are 'attacking' and the specific techniques we propose to use for the optimization.

Amranax self-assigned this Feb 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Notes #1

Notes #1

Amranax commented Feb 15, 2025

Amranax commented Feb 16, 2025 •

edited

Loading

Amranax commented Feb 16, 2025 •

edited

Loading

mauer4 commented Feb 16, 2025 •

edited

Loading

mauer4 commented Feb 16, 2025

mauer4 commented Feb 16, 2025 •

edited

Loading

Notes #1

Notes #1

Comments

Amranax commented Feb 15, 2025

Amranax commented Feb 16, 2025 • edited Loading

Amranax commented Feb 16, 2025 • edited Loading

Papers on RoPE

Papers on MLA

Papers on MoE

Papers on MoE parameter prediction (from proff Piazza)

mauer4 commented Feb 16, 2025 • edited Loading

mauer4 commented Feb 16, 2025

mauer4 commented Feb 16, 2025 • edited Loading

Amranax commented Feb 16, 2025 •

edited

Loading

Amranax commented Feb 16, 2025 •

edited

Loading

mauer4 commented Feb 16, 2025 •

edited

Loading

mauer4 commented Feb 16, 2025 •

edited

Loading