Doc only change to CUTLASS 3.3 changelog #1180

manishucsd · 2023-11-08T23:05:57Z

This doc-only PR addresses the discussion at #1170

Mixed-Precision to Mixed-Input. Mixed-Precision is taken by the GEMM data-type where inputs (DataType(operandA) == DataType(operandB) are mixed with a different accumulation data type (F16*F16+F32 and BF16*BF16+F32). The code uses cutlass::arch::OpMultiplyAddMixedInputUpcast tag to navigate and communicate that input data types are mixed. It would be good to set a nomenclature that is consistent and distinguishes between Mixed-Precision and Mixed-Input use-case.
Update the hyperlink for Mixed Precision Ampere GEMMs to the PR#1084 which has detailed description, steps to only compile Ampere mixed-input GEMMs, reproduce performance results, and a performance graph.

hwu36 · 2023-11-09T19:22:30Z

could you pleae also change CHANGELOG.md?

manishucsd · 2023-11-09T19:52:32Z

could you pleae also change CHANGELOG.md?

done

manishucsd force-pushed the doc_only_change_changelog_3.3 branch from 2e578f9 to a87fb92 Compare November 9, 2023 00:22

Doc only change changelog 3.3

56fb032

manishucsd force-pushed the doc_only_change_changelog_3.3 branch from a87fb92 to 56fb032 Compare November 9, 2023 19:37

hwu36 approved these changes Nov 13, 2023

View reviewed changes

hwu36 merged commit 5ae8133 into NVIDIA:main Nov 13, 2023

manishucsd deleted the doc_only_change_changelog_3.3 branch June 24, 2024 15:50

Provide feedback