Bdellabe/awq modifier v3 #1177

brian-dellabetta · 2025-02-19T18:04:57Z

SUMMARY:
Draft of AWQModifier, replaces #181 and #824 (hence v3)

TEST PLAN:
"please outline how the changes were tested"

Signed-off-by: Brian Dellabetta <[email protected]>

github-actions · 2025-02-19T18:05:11Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

kylesayrs · 2025-02-19T18:07:28Z

src/llmcompressor/modifiers/awq/base.py

-                )
+                with align_module_device(fc):
+                    fc.weight.mul_(scales_view)
+                    fc.weight.data = (


Here you're updating a parameter, so you'll have to use update_offload_parameter

…LENGTH are very low Signed-off-by: Brian Dellabetta <[email protected]>

kylesayrs · 2025-02-19T18:07:59Z

src/llmcompressor/modifiers/awq/base.py

+                    if module in balance_layers:
+                        module.weight.mul_(scales.view(1, -1).to(module.weight.device))
+                    elif module == smooth_layer:
+                        if module.weight.ndim == 1:


Here you're updating a weight and bias, so you'll have to use update_offload_parameter

we have to use update_offload_parameter for weights and biases too?

weights and biases are parameters, so yes

You can keep your inplace modification and just call

update_offload_parameter(module, "weight") update_offload_parameter(module, "bias")

Signed-off-by: Brian Dellabetta <[email protected]>

kylesayrs · 2025-02-20T18:47:12Z

src/llmcompressor/modifiers/awq/base.py

+
+        samples = [batch["input_ids"] for batch in dataloader]
+
+        samples = torch.cat(samples, dim=0)


Why are you putting all samples into the same batch. This is likely the source of your memory issues

thanks for pointing that out! will toggle this and see if it affects anything

brian-dellabetta added 5 commits February 18, 2025 17:40

cherry picked files from stale PR #181 branch awq-feature-branch

98a5b73

Signed-off-by: Brian Dellabetta <[email protected]>

updated to be compatible with latest, unit tests passing

2611966

Signed-off-by: Brian Dellabetta <[email protected]>

switch to using HooksMixin api

88aeab8

Signed-off-by: Brian Dellabetta <[email protected]>

pydantic serialization issue fix

2b74ccf

Signed-off-by: Brian Dellabetta <[email protected]>

switch to accelerate with align_module_device

cb5956e

Signed-off-by: Brian Dellabetta <[email protected]>

kylesayrs reviewed Feb 19, 2025

View reviewed changes

AWQ running but OOMs unless NUM_CALIBRATION_SAMPLES and MAX_SEQUENCE_…

5cb055c

…LENGTH are very low Signed-off-by: Brian Dellabetta <[email protected]>

kylesayrs reviewed Feb 20, 2025

View reviewed changes

brian-dellabetta force-pushed the bdellabe/awq-modifier-v3 branch from 052ed7e to 9273ef3 Compare February 20, 2025 17:20

working with larger num_calibration_samples

28f8bca

Signed-off-by: Brian Dellabetta <[email protected]>

brian-dellabetta force-pushed the bdellabe/awq-modifier-v3 branch from 9273ef3 to 28f8bca Compare February 20, 2025 17:27

fix pile dataset issue

2226bfd

Signed-off-by: Brian Dellabetta <[email protected]>

kylesayrs reviewed Feb 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bdellabe/awq modifier v3 #1177

Bdellabe/awq modifier v3 #1177

brian-dellabetta commented Feb 19, 2025

github-actions bot commented Feb 19, 2025

kylesayrs Feb 19, 2025

brian-dellabetta Feb 19, 2025 •

edited

Loading

kylesayrs Feb 19, 2025

brian-dellabetta Feb 20, 2025

kylesayrs Feb 20, 2025

kylesayrs Feb 20, 2025

kylesayrs Feb 20, 2025

brian-dellabetta Feb 20, 2025


		samples = [batch["input_ids"] for batch in dataloader]

		samples = torch.cat(samples, dim=0)

Bdellabe/awq modifier v3 #1177

Are you sure you want to change the base?

Bdellabe/awq modifier v3 #1177

Conversation

brian-dellabetta commented Feb 19, 2025

github-actions bot commented Feb 19, 2025

kylesayrs Feb 19, 2025

Choose a reason for hiding this comment

brian-dellabetta Feb 19, 2025 • edited Loading

Choose a reason for hiding this comment

kylesayrs Feb 19, 2025

Choose a reason for hiding this comment

brian-dellabetta Feb 20, 2025

Choose a reason for hiding this comment

kylesayrs Feb 20, 2025

Choose a reason for hiding this comment

kylesayrs Feb 20, 2025

Choose a reason for hiding this comment

kylesayrs Feb 20, 2025

Choose a reason for hiding this comment

brian-dellabetta Feb 20, 2025

Choose a reason for hiding this comment

brian-dellabetta Feb 19, 2025 •

edited

Loading