-
Notifications
You must be signed in to change notification settings - Fork 84
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Test Fix] Quant model reload #974
Conversation
👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review. |
/ready |
tests/llmcompressor/transformers/sparsification/test_compress_tensor_utils.py
Outdated
Show resolved
Hide resolved
…ject/llm-compressor into fix-test_compress-tensors-utils
tests/llmcompressor/transformers/sparsification/test_compress_tensor_utils.py
Show resolved
Hide resolved
tests/llmcompressor/transformers/sparsification/test_compress_tensor_utils.py
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could we make the description a little clearer with details of what was going on and why this change is needed?
Done! |
~~Contingent on merge of huggingface/transformers#34719 ~~^ has been merged not yet released~~ ^ has been released SUMMARY: Update test to use AutoModelForCausalLM decompressor instead of manually instantiating the compressor and decompressing. AutoModelForCausalLM will run code that if quantization_config is recognized, it will run the same decompression TEST PLAN: Ran the test using transformers main Must pass: tests/llmcompressor/transformers/sparsification/test_compress_tensor_utils.py Signed-off-by: Kyle Sayers <[email protected]>
Contingent on merge of huggingface/transformers#34719^ has been merged not yet released^ has been released
SUMMARY:
Update test to use AutoModelForCausalLM decompressor instead of manually instantiating the compressor and decompressing. AutoModelForCausalLM will run code that if quantization_config is recognized, it will run the same decompression
TEST PLAN:
Ran the test using transformers main
Must pass: tests/llmcompressor/transformers/sparsification/test_compress_tensor_utils.py