New Pytorch Triton breaks custom cast kernel MX #1799

drisspg · 2025-02-28T18:36:52Z

Summary

To run: TRITON_ALWAYS_COMPILE=1 TRITON_DUMP_DIR=my_directory_2 TRITON_KERNEL_DUMP=1 pytest -s -v test/prototype/mx_formats/test_custom_cast.py -k "test_fp4_triton_unscaled_cast"

Bad ttir on left good on right. No real differences
https://www.diffchecker.com/ueX5YZw4

TTGIR:
https://www.diffchecker.com/M5PS6QJg/

Differences in PTX
https://www.diffchecker.com/8mseNnKA/

The text was updated successfully, but these errors were encountered:

CliveUnger · 2025-03-20T20:46:43Z

I started to take a look at this and found that it fails on both Hopper and Blackwell machines. I narrowed it down to a single culprit commit on Triton where bf16 op lowering is offloaded to LLVM instead of custom code conversion.
Here is the commit and related PR:

Looking closer at the kernel it seems that the non-denormalized expontents are not being biased correctly. Currently, trying to understand what is going wrong in the vectorized LLVM code.

davidberard98 · 2025-03-24T18:17:39Z

@drisspg @CliveUnger @danielvegamyhre any ideas why this failure didn't show up in CI?

drisspg changed the title ~~New Pytorch Triton breaks custom cast kernel~~ New Pytorch Triton breaks custom cast kernel MX Feb 28, 2025

drisspg added the mx label Feb 28, 2025

This was referenced Mar 1, 2025

[float8] add perf benchmarks for float8 training with rowwise + tensorwise scaling #1793

Merged

[float8] add float8 training benchmarking scripts #1802

Merged

drisspg mentioned this issue Mar 10, 2025

Blackwell Only Bugs pytorch/pytorch#147478

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Pytorch Triton breaks custom cast kernel MX #1799

New Pytorch Triton breaks custom cast kernel MX #1799

drisspg commented Feb 28, 2025 •

edited

Loading

CliveUnger commented Mar 20, 2025

davidberard98 commented Mar 24, 2025

New Pytorch Triton breaks custom cast kernel MX #1799

New Pytorch Triton breaks custom cast kernel MX #1799

Comments

drisspg commented Feb 28, 2025 • edited Loading

Summary

CliveUnger commented Mar 20, 2025

davidberard98 commented Mar 24, 2025

drisspg commented Feb 28, 2025 •

edited

Loading