Add `--n_iters` flag to CLI and perform constant propagation before running model #202

robertknight · 2024-05-21T23:17:29Z

The n_iters flag is useful for assessing the initial run vs "warmed up" inference times for models
The constant propagation step is useful to highlight where models have unnecessary operators in them that could be eliminated by a graph optimization (ORT does this automatically)

This is useful when running model inference in a loop, with a fixed set of options that have been passed as a parameter to the function running the loop.

This runs the model repeatedly. This is useful for assessing how much performance varies on the initial run vs after warmup is completed.

When testing `decoder_model.onnx` from https://huggingface.co/Mozilla/distilvit/tree/main/onnx I found that it was very slow in RTen compared to ONNX Runtime (>100ms vs 20ms). The main culprit turned out to be an expensive `Transpose` operator with constant inputs. ORT does constant propagation as part of its automatic optimizations when the model is loaded. RTen doesn't have an optimization like this implemented, but by implementing it "manually" in the CLI, we can see how much benefit this would provide.

robertknight added 3 commits May 22, 2024 00:08

Make RunOptions impl Clone + PartialEq

57bd684

This is useful when running model inference in a loop, with a fixed set of options that have been passed as a parameter to the function running the loop.

Add --n_iters flag to rten CLI

c104069

This runs the model repeatedly. This is useful for assessing how much performance varies on the initial run vs after warmup is completed.

robertknight merged commit 2c24dbb into main May 21, 2024
2 checks passed

robertknight deleted the rten-cli-multiple-iters branch May 21, 2024 23:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `--n_iters` flag to CLI and perform constant propagation before running model #202

Add `--n_iters` flag to CLI and perform constant propagation before running model #202

robertknight commented May 21, 2024

Add --n_iters flag to CLI and perform constant propagation before running model #202

Add --n_iters flag to CLI and perform constant propagation before running model #202

Conversation

robertknight commented May 21, 2024

Add `--n_iters` flag to CLI and perform constant propagation before running model #202

Add `--n_iters` flag to CLI and perform constant propagation before running model #202