Use Linear Layout to describe 2D block loads #3487

alexbaden · 2025-02-21T13:58:29Z

This PR introduces a new linear layout in the Triton Load to LLVM lowering for block loads. The layout describes the block load in terms of three input parameters:

offset which is the 1D offset into the loaded data for a single DPAS invocation inside a sub-group
iteration which identifies the DPAS invocation when multiple DPAS invocations share a single load
load which identifies the load index when multiple loads occur for a given operand

The output of the layout function identifies the global (x,y) tensor coordinate. This was designed to allow composition of the DPAS layout and the load layout to go from offset, iteration, load to block, warp, lane, register or vice versa. Note that I do not encode all the information about the load into the layout currently - I wanted to maintain surjective properties of the layout and it's a bit easier to construct this way. So, sometimes a manual offset must be applied depending on the desired layout parameter.

Currently the block load / tile layout is implemented within the existing loop structure. But, the layout was designed to be used to generate the 2D block loads. I left the existing loop structure in-place along with lots of debug info so we can more easily check any regressions. I am planning to remove the existing loop structure and generate loads only using layout parameters in a follow-up PR.

Close #3008

alexbaden added 15 commits February 21, 2025 02:26

Describe block load to dpas layout conversion using linear layout

1113d35

Support linear layouts in shuffle

e6a2d1e

cleanup

2d830ad

fix iteration dimensions

7bc974e

fixup loads indexing

45e45d4

fix packed elements per slot calculation

a496cbd

cleanup

62daca1

lit tests passing

b69dc6a

format, cleanup, remove unused debug info

c9b5a7f

remove more debug code

36ab568

better itr offset col coord computation

28cec20

format

ee55e30

fixup # of loads and load indexing 1/?

22b212a

fixup # of loads and load indexing 2/?

1e3ab5b

fixup # of loads and load indexing 3/?

711e645

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use Linear Layout to describe 2D block loads #3487

Use Linear Layout to describe 2D block loads #3487

alexbaden commented Feb 21, 2025

Use Linear Layout to describe 2D block loads #3487

Are you sure you want to change the base?

Use Linear Layout to describe 2D block loads #3487

Conversation

alexbaden commented Feb 21, 2025