Keras structured SIMD pruning #871

reuvenperetz · 2023-11-29T12:44:26Z

Pull Request Description:

This pull request introduces an SIMD structured pruning to Keras models. The primary goal is to optimize models to meet specific Key Performance Indicators (KPIs). Key components include:

keras_pruning_experimental Function:
- Interface to prune Keras models. It includes converting Keras models into an internal graph representation, applying compression configuration, and initiating the pruning process.
- Integrates with Pruner class to apply graph pruning based on specified KPIs.
Pruner Class:
- Central component responsible for executing the pruning process on the computational graph.
- Computation of importance scores for each node using the provided data generator.
- Supports pruning strategies with an initial focus on Greedy approach.
GreedyMaskCalculator Class:
- Computes pruning masks for each prunable node in a Keras model's computational graph, using a greedy algorithm. It aims to meet a target KPI for memory footprint.
- SIMD group indices and scores initialization for SIMD mask updates.
- Incorporates a MemoryCalculator to estimate the memory footprint of the pruned graph.
- get_mask(): Retrieves the pruning mask for the graph, computing it if not already done.
LFHImportanceMetric Class:
- Implements the calculation of Label-Free-Hessian (LFH) based importance scores for nodes in the graph.
- get_entry_node_to_score: Calculates importance scores for each entry node in a provided list using the LFH method. It then normalizes scores using L2 norms and the number of parameters for each output channel.
MemoryCalculator Class:
- Estimation of the memory usage of a neural network's computational graph under various pruning masks.
- get_pruned_graph_memory: Calculates the estimated memory usage of the pruned graph.
- get_pruned_graph_num_params: Computes the total number of parameters in the pruned graph.
build_pruned_graph Function:
- This function prunes a given computational graph based on specified pruning masks for output channels.
  It returns a new, pruned version of the computational graph.
PruningConfig Class:
- Serves as a configuration class that specifies the approach and criteria for pruning a neural network.
- num_score_approximations (int): Specifies the number of score approximations to be performed when calculating the importance of channels.
- importance_metric (ImportanceMetric): Dictates the metric used to assess the importance of channels in the network. The default is set to Label-Free-Hessian (LFH) approximation.
- channels_filtering_strategy (ChannelsFilteringStrategy): Determines the strategy for filtering or pruning channels, with the default strategy being a Greedy approach.
PruningSectionMask Class:
- Represents the masks to be applied to a specific section of a neural network during pruning, including both input and output channels at the entry and exit nodes.
PruningSection Class:
- Represents a section within a graph that is targeted for pruning, including an entry node, any intermediate nodes, and an exit node.

Checklist before requesting a review:

I set the appropriate labels on the pull request.
I have added/updated the release note draft (if necessary).
I have updated the documentation to reflect my changes (if necessary).
All function and files are well documented.
All function and classes have type hints.
There is a licenses in all file.
The function and variable names are informative.
I have checked for code duplications.
I have added new unittest (if necessary).

haihabi · 2023-11-29T13:14:14Z

Missing description

model_compression_toolkit/constants.py

model_compression_toolkit/core/common/data_loader.py

model_compression_toolkit/core/common/framework_implementation.py

…butes

…_pruning in PruningKerasImplementation

model_compression_toolkit/core/common/graph/base_graph.py

model_compression_toolkit/core/common/pruning/importance_metrics/importance_metric_factory.py

model_compression_toolkit/core/common/pruning/importance_metrics/lfh_importance_metric.py

model_compression_toolkit/core/common/pruning/mask/per_channel_mask.py

ofirgo · 2023-12-25T14:14:31Z

model_compression_toolkit/core/common/pruning/memory_calculator.py

+            np.ndarray: The input mask for the specified node, or None if not found.
+        """
+        for section in pruning_sections:
+            # If the node is the exit node of a pruning section, return the entry node's mask.


still not sure what the answer is here

model_compression_toolkit/core/common/pruning/pruning_section.py

tutorials/notebooks/example_keras_pruning.py

tutorials/notebooks/example_keras_pruning_mnist.ipynb

…sk is based on the section

reuvenp added 8 commits November 29, 2023 10:15

Init pruning support

b276815

Seperate is_entry and is_exit for keras nodes functions

96f6df5

split intermediate section mask to 2 masks

3a8ffbb

fixed pruned model to be trainable

58eeedd

split keras functions into multiple files

c0fbf11

Add l2norm and params count to lfh scores

877cf45

Add check for exit nodes #IC to match the #OC of their entry node

957133c

Consider null channels in graph memory computation

6a2c099

github-actions bot added auto:core auto:target_platform_capabilities auto:tests auto:tutorials labels Nov 29, 2023

Remove debug code

b50dc1c

haihabi reviewed Nov 29, 2023

View reviewed changes

model_compression_toolkit/constants.py Outdated Show resolved Hide resolved

haihabi reviewed Nov 29, 2023

View reviewed changes

model_compression_toolkit/core/common/data_loader.py Outdated Show resolved Hide resolved

haihabi reviewed Nov 29, 2023

View reviewed changes

model_compression_toolkit/core/common/framework_implementation.py Outdated Show resolved Hide resolved

Add PruningFrameworkImplementation

a3b3c92

haihabi requested review from ofirgo and Idan-BenAmi November 29, 2023 14:34

reuvenp added 3 commits November 29, 2023 16:51

add small sections tests

c1a3bc1

Fix tf imports when tf was not found

476fcc8

Run pruning tests in keras workflow

73b9971

reuvenperetz added the pr: major feature label Nov 30, 2023

reuvenp added 6 commits December 3, 2023 12:12

add memory calculator test

54e6fb3

add simd padding to tpc

0e69771

Take score computation out to a new LFH importance score calculator

0433744

split memory count from params count in memory calculator

8c7f0ee

rename pruning section attributes

6fd3fa3

move has_matching_channel_count to common

92cdaf5

reuvenp added 17 commits December 24, 2023 13:29

Use unittests assert when needed

3518e64

use only one cr for testing pretrained models

fd5ebb2

use keras kernel constant in random importance metric

04b3652

Add assertion in get_Attributes_info before iterating on kernel attri…

0613e95

…butes

rename get_node_attributes_with_oi_axis to attrs_oi_channels_info_for…

bf6c00f

…_pruning in PruningKerasImplementation

Merged from main

7f124e8

remove todos

19100b0

use property in channels_grouping

aff8068

remove todo

dfb194e

add comments to mask files

a30ab5d

remove commented out code from pruner

4ffc45c

remove pytorch init file

becc499

remove todos

78e8d29

add example usage in notebooks

5eab6d4

Add notebook

b492be6

fixes to notebook

6b305f1

add docs for pruning API

38d61e1

github-actions bot added the auto:docsrc label Dec 25, 2023

ofirgo requested changes Dec 26, 2023

View reviewed changes

reuvenp added 7 commits December 28, 2023 10:51

Rename importance metric factory

7513006

use mask indicator enum when initializing masks

5cf8994

replace a list of params count in LFH with a np array

449bd2e

Add fw_impl typehints

60a24ea

create pruning config separately in pruning tutorial

fa709b4

rename function name in memory calculator for specifying the input ma…

294f203

…sk is based on the section

Update readme

0c2290c

ofirgo approved these changes Dec 28, 2023

View reviewed changes

reuvenperetz merged commit 5306a8d into sony:main Dec 28, 2023
23 of 24 checks passed

reuvenperetz deleted the add-pruning-support branch December 28, 2023 12:08

haihabi mentioned this pull request Jan 1, 2024

Add pruning capability #885

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Keras structured SIMD pruning #871

Keras structured SIMD pruning #871

reuvenperetz commented Nov 29, 2023 •

edited

Loading

haihabi commented Nov 29, 2023

ofirgo Dec 25, 2023

Keras structured SIMD pruning #871

Keras structured SIMD pruning #871

Conversation

reuvenperetz commented Nov 29, 2023 • edited Loading

Pull Request Description:

Checklist before requesting a review:

haihabi commented Nov 29, 2023

ofirgo Dec 25, 2023

Choose a reason for hiding this comment

reuvenperetz commented Nov 29, 2023 •

edited

Loading