Fix: Disable Sparse Decompression for Dense Compressors #237

rahul-tuli · 2025-01-09T15:10:17Z

Problem

When the sparse compressor is set to "dense", sparse decompression is incorrectly triggered, causing uninitialized weights and downstream errors.
Example CI failure: [GitHub Actions Log](https://github.com/vllm-project/llm-compressor/actions/runs/12659596814/job/35326229412).

Solution

Added a condition to skip sparse decompression when the sparsity configuration format is "dense".

Testing

Verified against llm-compressor main commit: 03e21770.
Confirmed weights load correctly for dense compressors.
All CI workflows pass without regressions.

mgoin

Makes sense, thanks!

src/compressed_tensors/compressors/model_compressors/model_compressor.py

kylesayrs

LGTM

src/compressed_tensors/compressors/model_compressors/model_compressor.py

dsikka

LGTM

~~Contingent on merge of huggingface/transformers#34719 ~~ ^ has been merged not yet released ~~ ^ has been released Blocked on neuralmagic/compressed-tensors#237 SUMMARY: * In multiple optimization tests, automatically decompress model if provided as optimized model * Fix recipe stage length * Revive old code * When running multiple optimizations (ex. oneshot then finetune, oneshot and oneshot), the recipes needs to be added to the session using `initialize_recipe`. Example here https://github.com/vllm-project/llm-compressor/pull/971/files#diff-c9ae8b3ad24d13abeea5b649a5fd6d0b0925f5c9cc40220cbfbe21ae81242f8dR63-R65 TEST PLAN: ran the test using transformers main Must pass tests/llmcompressor/transformers/obcq/test_consecutive_runs.py --------- Co-authored-by: Dipika Sikka <[email protected]> Co-authored-by: Rahul Tuli <[email protected]>

~~Contingent on merge of huggingface/transformers#34719 ~~ ^ has been merged not yet released ~~ ^ has been released Blocked on neuralmagic/compressed-tensors#237 SUMMARY: * In multiple optimization tests, automatically decompress model if provided as optimized model * Fix recipe stage length * Revive old code * When running multiple optimizations (ex. oneshot then finetune, oneshot and oneshot), the recipes needs to be added to the session using `initialize_recipe`. Example here https://github.com/vllm-project/llm-compressor/pull/971/files#diff-c9ae8b3ad24d13abeea5b649a5fd6d0b0925f5c9cc40220cbfbe21ae81242f8dR63-R65 TEST PLAN: ran the test using transformers main Must pass tests/llmcompressor/transformers/obcq/test_consecutive_runs.py --------- Co-authored-by: Dipika Sikka <[email protected]> Co-authored-by: Rahul Tuli <[email protected]> Signed-off-by: Rahul Tuli <[email protected]>

rahul-tuli changed the title ~~Turn off sparse decompression when sparse compressor is dense~~ Fix: Disable Sparse Decompression for Dense Compressors Jan 9, 2025

rahul-tuli marked this pull request as ready for review January 10, 2025 00:22

mgoin previously approved these changes Jan 10, 2025

View reviewed changes

src/compressed_tensors/compressors/model_compressors/model_compressor.py Outdated Show resolved Hide resolved

kylesayrs requested changes Jan 10, 2025

View reviewed changes

src/compressed_tensors/compressors/model_compressors/model_compressor.py Outdated Show resolved Hide resolved

rahul-tuli dismissed mgoin’s stale review via 91aa19d January 10, 2025 14:05

horheynm mentioned this pull request Jan 10, 2025

[Test Fix] Fix Consecutive oneshot vllm-project/llm-compressor#971

Merged

rahul-tuli force-pushed the turn-off-sparse-decompression-when-dense branch from ff2b26a to 51cee4e Compare January 10, 2025 14:10

horheynm mentioned this pull request Jan 10, 2025

[Test Fix] Quant model reload vllm-project/llm-compressor#974

Merged

dsikka approved these changes Jan 10, 2025

View reviewed changes

horheynm approved these changes Jan 10, 2025

View reviewed changes

rahul-tuli requested a review from kylesayrs January 10, 2025 14:20

rahul-tuli added 3 commits January 10, 2025 14:27

Turn off sparse decompression when sparse compressor is dense

90beee8

Update: Condiition to use enum instead of raw string

26b23f7

Remove unnecesarry style change

cc4f78e

rahul-tuli force-pushed the turn-off-sparse-decompression-when-dense branch from 51cee4e to cc4f78e Compare January 10, 2025 14:28

kylesayrs approved these changes Jan 10, 2025

View reviewed changes

dsikka merged commit 6fffbd7 into main Jan 10, 2025
1 check failed

dsikka deleted the turn-off-sparse-decompression-when-dense branch January 10, 2025 15:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Disable Sparse Decompression for Dense Compressors #237

Fix: Disable Sparse Decompression for Dense Compressors #237

rahul-tuli commented Jan 9, 2025 •

edited

Loading

mgoin left a comment

kylesayrs left a comment

dsikka left a comment

Fix: Disable Sparse Decompression for Dense Compressors #237

Fix: Disable Sparse Decompression for Dense Compressors #237

Conversation

rahul-tuli commented Jan 9, 2025 • edited Loading

Problem

Solution

Testing

mgoin left a comment

Choose a reason for hiding this comment

kylesayrs left a comment

Choose a reason for hiding this comment

dsikka left a comment

Choose a reason for hiding this comment

rahul-tuli commented Jan 9, 2025 •

edited

Loading