[Test Fix] Add Quantization then finetune tests #964

horheynm · 2024-12-09T17:47:47Z

~~Contingent on merge of huggingface/transformers#34719~~
^ has been merged not yet released

SUMMARY:
Add test to

Given a model, oneshot quantize, then run ptq - training.
Model must be run_compressed = False to run

Note:

When running finetune on an already optimized (one-shotted) mode, the model needs to be decompressed explicitly using CompressedTensorsConfig. See https://github.com/vllm-project/llm-compressor/pull/964/files#diff-e480ed475c0a5b2beb4052c1dd2aca671999634ace41a5ea017fdff1ce68be0bR130-R135
Tests using x2 H100s passed

Also fix a bug where in log_sparsification, the layer name is not being recognized so fails. Here nothting is being sparsified, so num params is set to zero

TEST PLAN:
ran the test using transformers main
must pass tests/llmcompressor/transformers/finetune/test_oneshot_then_finetune.py

github-actions · 2024-12-09T17:47:59Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

horheynm · 2024-12-11T22:31:42Z

/ready

tests/llmcompressor/transformers/finetune/test_oneshot_then_finetune.py

…pressor into quant-then-finetune

src/llmcompressor/pytorch/utils/sparsification.py

…pressor into quant-then-finetune

dsikka

LGTM.

As per offline conversation, lets decompress before running finetune
and verify if we still need the skipif

…pressor into quant-then-finetune

dsikka

LGTM. please fix quality

rahul-tuli

LGTM

kylesayrs

Need some clarification on params_quantized, otherwise LGTM!

src/llmcompressor/pytorch/utils/sparsification.py

kylesayrs · 2025-01-22T20:00:53Z

src/llmcompressor/pytorch/utils/sparsification.py

-                else 0
+        num_params = 0
+        for name, layer in get_quantized_layers(self.module):
+            num_param_weight = torch.numel(


if getattr(layer, "weight", None) is not None: num_params += torch.numel(layer.weight) if getattr(layer, "bias", None) is not None: num_params += torch.numel(layer.bias)

add quantization then finetune -- run_compressed=False

92fbddc

horheynm added 4 commits December 9, 2024 12:48

add test

299eed3

Merge branch 'main' into quant-then-finetune

aebae9a

clean up

9ea94ed

Merge branch 'main' into quant-then-finetune

a264fc0

dsikka marked this pull request as draft December 12, 2024 17:01

kylesayrs reviewed Dec 12, 2024

View reviewed changes

tests/llmcompressor/transformers/finetune/test_oneshot_then_finetune.py Outdated Show resolved Hide resolved

horheynm changed the title ~~Add Quantization then finetune tests~~ [Test Fix] Add Quantization then finetune tests Dec 16, 2024

comments

ee4c70d

horheynm marked this pull request as ready for review December 23, 2024 14:12

Merge branch 'main' into quant-then-finetune

0d32d23

horheynm marked this pull request as draft December 23, 2024 15:03

Merge branch 'main' into quant-then-finetune

72b5431

kylesayrs previously approved these changes Jan 9, 2025

View reviewed changes

tests/llmcompressor/transformers/finetune/test_oneshot_then_finetune.py Show resolved Hide resolved

horheynm added 2 commits January 10, 2025 08:43

add clarity on loading ckpt and carrying out finetune on saved model

8696be2

Merge branch 'quant-then-finetune' of github.com:vllm-project/llm-com…

cd22e88

…pressor into quant-then-finetune

horheynm dismissed kylesayrs’s stale review via cd22e88 January 10, 2025 13:43

horheynm marked this pull request as ready for review January 10, 2025 13:43

Merge branch 'main' into quant-then-finetune

fa5f3ff

kylesayrs previously approved these changes Jan 10, 2025

View reviewed changes

dsikka requested changes Jan 10, 2025

View reviewed changes

src/llmcompressor/pytorch/utils/sparsification.py Show resolved Hide resolved

horheynm added 3 commits January 10, 2025 14:44

Merge branch 'main' into quant-then-finetune

ff318f9

update calculations

8ebf898

Merge branch 'quant-then-finetune' of github.com:vllm-project/llm-com…

5e952a0

…pressor into quant-then-finetune

horheynm dismissed kylesayrs’s stale review via 5e952a0 January 10, 2025 20:12

kylesayrs previously approved these changes Jan 10, 2025

View reviewed changes

Merge branch 'main' into quant-then-finetune

c8f56e6

dsikka reviewed Jan 14, 2025

View reviewed changes

dsikka and others added 3 commits January 14, 2025 17:30

Merge branch 'main' into quant-then-finetune

9270c09

decompress model explicitly

336c867

Merge branch 'quant-then-finetune' of github.com:vllm-project/llm-com…

d9c806e

…pressor into quant-then-finetune

horheynm dismissed kylesayrs’s stale review via d9c806e January 14, 2025 22:39

remove skipif

ab35528

dsikka previously approved these changes Jan 14, 2025

View reviewed changes

horheynm added 2 commits January 14, 2025 20:36

Merge branch 'main' into quant-then-finetune

00e4d9f

lint

a219e95

horheynm dismissed dsikka’s stale review via a219e95 January 15, 2025 01:38

Merge branch 'main' into quant-then-finetune

ca743a5

rahul-tuli previously approved these changes Jan 20, 2025

View reviewed changes

kylesayrs reviewed Jan 20, 2025

View reviewed changes

src/llmcompressor/pytorch/utils/sparsification.py Show resolved Hide resolved

dsikka and others added 2 commits January 20, 2025 12:30

Merge branch 'main' into quant-then-finetune

837430b

Merge branch 'main' into quant-then-finetune

305a93f

kylesayrs reviewed Jan 22, 2025

View reviewed changes

comment

15884dc

horheynm dismissed rahul-tuli’s stale review via 15884dc January 22, 2025 20:52

unindent

6e7bfa6

kylesayrs approved these changes Jan 22, 2025

View reviewed changes

dsikka approved these changes Jan 22, 2025

View reviewed changes

dsikka added the ready When a PR is ready for review label Jan 22, 2025

Merge branch 'main' into quant-then-finetune

6884b78

dsikka merged commit b105c55 into main Jan 23, 2025
5 of 7 checks passed

dsikka deleted the quant-then-finetune branch January 23, 2025 00:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Test Fix] Add Quantization then finetune tests #964

[Test Fix] Add Quantization then finetune tests #964

horheynm commented Dec 9, 2024 •

edited

Loading

github-actions bot commented Dec 9, 2024

horheynm commented Dec 11, 2024

dsikka left a comment

dsikka left a comment

rahul-tuli left a comment

kylesayrs left a comment

kylesayrs Jan 22, 2025 •

edited

Loading

[Test Fix] Add Quantization then finetune tests #964

[Test Fix] Add Quantization then finetune tests #964

Conversation

horheynm commented Dec 9, 2024 • edited Loading

github-actions bot commented Dec 9, 2024

horheynm commented Dec 11, 2024

dsikka left a comment

Choose a reason for hiding this comment

dsikka left a comment

Choose a reason for hiding this comment

rahul-tuli left a comment

Choose a reason for hiding this comment

kylesayrs left a comment

Choose a reason for hiding this comment

kylesayrs Jan 22, 2025 • edited Loading

Choose a reason for hiding this comment

horheynm commented Dec 9, 2024 •

edited

Loading

kylesayrs Jan 22, 2025 •

edited

Loading