[Test Fix] Quant model reload #974

horheynm · 2024-12-11T19:59:23Z

~~Contingent on merge of huggingface/transformers#34719~~
~~^ has been merged not yet released~~
^ has been released

SUMMARY:
Update test to use AutoModelForCausalLM decompressor instead of manually instantiating the compressor and decompressing. AutoModelForCausalLM will run code that if quantization_config is recognized, it will run the same decompression

TEST PLAN:
Ran the test using transformers main
Must pass: tests/llmcompressor/transformers/sparsification/test_compress_tensor_utils.py

github-actions · 2024-12-11T19:59:35Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

horheynm · 2024-12-11T22:31:22Z

/ready

tests/llmcompressor/transformers/sparsification/test_compress_tensor_utils.py

…ject/llm-compressor into fix-test_compress-tensors-utils

tests/llmcompressor/transformers/sparsification/test_compress_tensor_utils.py

rahul-tuli

Could we make the description a little clearer with details of what was going on and why this change is needed?

horheynm · 2025-01-10T21:19:57Z

Could we make the description a little clearer with details of what was going on and why this change is needed?

Done!

~~Contingent on merge of huggingface/transformers#34719 ~~^ has been merged not yet released~~ ^ has been released SUMMARY: Update test to use AutoModelForCausalLM decompressor instead of manually instantiating the compressor and decompressing. AutoModelForCausalLM will run code that if quantization_config is recognized, it will run the same decompression TEST PLAN: Ran the test using transformers main Must pass: tests/llmcompressor/transformers/sparsification/test_compress_tensor_utils.py Signed-off-by: Kyle Sayers <[email protected]>

fix test - use automodelforcausallm decompress

5da01c4

horheynm marked this pull request as ready for review December 11, 2024 19:59

dsikka marked this pull request as draft December 12, 2024 17:00

kylesayrs approved these changes Dec 14, 2024

View reviewed changes

horheynm changed the title ~~fix test - use automodelforcausallm decompress~~ [Test Fix] Decompression Dec 16, 2024

horheynm changed the title ~~[Test Fix] Decompression~~ [Test Fix] Sparse model reload Dec 16, 2024

horheynm marked this pull request as ready for review December 23, 2024 14:09

Merge branch 'main' into fix-test_compress-tensors-utils

c0a552a

horheynm marked this pull request as draft December 23, 2024 15:02

kylesayrs reviewed Jan 9, 2025

View reviewed changes

tests/llmcompressor/transformers/sparsification/test_compress_tensor_utils.py Outdated Show resolved Hide resolved

horheynm changed the title ~~[Test Fix] Sparse model reload~~ [Test Fix] Quanti model reload Jan 9, 2025

kylesayrs previously approved these changes Jan 9, 2025

View reviewed changes

horheynm changed the title ~~[Test Fix] Quanti model reload~~ [Test Fix] Quant model reload Jan 9, 2025

horheynm added 2 commits January 9, 2025 15:51

change num calib samples to 16

83eac37

Merge branch 'fix-test_compress-tensors-utils' of github.com:vllm-pro…

2f1581b

…ject/llm-compressor into fix-test_compress-tensors-utils

horheynm dismissed kylesayrs’s stale review via 2f1581b January 9, 2025 20:52

horheynm marked this pull request as ready for review January 10, 2025 13:49

horheynm added 2 commits January 10, 2025 08:49

Merge branch 'main' into fix-test_compress-tensors-utils

9c66d05

Merge branch 'main' into fix-test_compress-tensors-utils

4a68e3d

kylesayrs approved these changes Jan 10, 2025

View reviewed changes

dsikka requested changes Jan 10, 2025

View reviewed changes

tests/llmcompressor/transformers/sparsification/test_compress_tensor_utils.py Show resolved Hide resolved

tests/llmcompressor/transformers/sparsification/test_compress_tensor_utils.py Show resolved Hide resolved

Merge branch 'main' into fix-test_compress-tensors-utils

25e870e

dsikka approved these changes Jan 10, 2025

View reviewed changes

Merge branch 'main' into fix-test_compress-tensors-utils

fc241b2

rahul-tuli approved these changes Jan 10, 2025

View reviewed changes

dsikka merged commit 0535613 into main Jan 10, 2025
6 of 7 checks passed

dsikka deleted the fix-test_compress-tensors-utils branch January 10, 2025 21:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Test Fix] Quant model reload #974

[Test Fix] Quant model reload #974

horheynm commented Dec 11, 2024 •

edited

Loading

github-actions bot commented Dec 11, 2024

horheynm commented Dec 11, 2024

rahul-tuli left a comment

horheynm commented Jan 10, 2025

[Test Fix] Quant model reload #974

[Test Fix] Quant model reload #974

Conversation

horheynm commented Dec 11, 2024 • edited Loading

github-actions bot commented Dec 11, 2024

horheynm commented Dec 11, 2024

rahul-tuli left a comment

Choose a reason for hiding this comment

horheynm commented Jan 10, 2025

horheynm commented Dec 11, 2024 •

edited

Loading