Perform constant propagation when loading models #241

robertknight · 2024-06-17T07:14:05Z

Add initial infrastructure to perform graph optimizations as part of loading a model, and implement constant propagation (pre-evaluating the parts of the graph that don't depend on dynamic input values) as the first graph-level optimization. This is an optimization that was already implemented downstream in rten-cli and rten-generate after the model was loaded. Those downstream implementations are also removed in this change.

Optimization is performed by default when a model is loaded, but can be disabled using ModelOptions. This mirrors ONNX Runtime's approach of enabling optimizations unless the consumer opts out.

TODO:

Tests
Consider making the graph immutable once optimized. This avoids hazards where eg. cached execution plans could become invalidated by model changes (Edit: Will revisit this later)

This causes parts of the graph that depend only on constant inputs (ie. weights, biases) to be replaced when the model is loaded.

This optimization is now performed as part of graph optimizations inside `Model::load`, so does not need to be done by the CLI.

Since constant propagation is now performed as a graph optimization when the model is loaded, it only needs to be re-done if additional constants are added. This change assumes that graph optimizations were enabled when the model was loaded. If they were disabled, and no other constants were added, this could lead to expensive re-evaluation of unchanging parts of the graph on each run.

Add initial graph optimization APIs

3949b96

robertknight force-pushed the const-prop-optimize branch 2 times, most recently from cf31d1a to 4a7cfe5 Compare June 18, 2024 06:12

robertknight added 3 commits June 18, 2024 09:00

Add constant propagation optimization to graph optimizer

24b7f6d

This causes parts of the graph that depend only on constant inputs (ie. weights, biases) to be replaced when the model is loaded.

Remove constant propagation step from rten-cli evaluation

fd579e4

This optimization is now performed as part of graph optimizations inside `Model::load`, so does not need to be done by the CLI.

robertknight force-pushed the const-prop-optimize branch from 4a7cfe5 to 75570a9 Compare June 18, 2024 08:02

Add a note on graph optimizations to the Model docs

d99e393

robertknight marked this pull request as ready for review June 18, 2024 08:18

robertknight merged commit 7750899 into main Jun 18, 2024
2 checks passed

robertknight deleted the const-prop-optimize branch June 18, 2024 08:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perform constant propagation when loading models #241

Perform constant propagation when loading models #241

robertknight commented Jun 17, 2024 •

edited

Loading

Perform constant propagation when loading models #241

Perform constant propagation when loading models #241

Conversation

robertknight commented Jun 17, 2024 • edited Loading

robertknight commented Jun 17, 2024 •

edited

Loading