Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable GPTQModel #2064

Merged
merged 28 commits into from
Dec 19, 2024
Merged
Show file tree
Hide file tree
Changes from 15 commits
Commits
Show all changes
28 commits
Select commit Hold shift + click to select a range
3b6ddfc
align gptq check to transformers for supporting cpu
jiqing-feng Oct 16, 2024
50a405a
fix comment
jiqing-feng Oct 16, 2024
1cce05b
Merge branch 'huggingface:main' into gptq
jiqing-feng Nov 29, 2024
9bb7694
gptqmodel
jiqing-feng Nov 29, 2024
4709869
compatible with auto-gptq
jiqing-feng Dec 2, 2024
32d434f
fix compatible with auto-gptq
jiqing-feng Dec 2, 2024
0640350
fix compatible with auto-gptq linear
jiqing-feng Dec 2, 2024
8c1c142
revert unrelated changes
jiqing-feng Dec 2, 2024
27d2f2b
gptqmodel need use checkpoint_format (#1)
LRL-ModelCloud Dec 3, 2024
b65cd7f
Mod backend code (#2)
LRL-ModelCloud Dec 4, 2024
20e81c4
fix format and log
jiqing-feng Dec 4, 2024
1bad53e
fix version check
jiqing-feng Dec 4, 2024
ea29c3c
enable gptqmodel tests
jiqing-feng Dec 4, 2024
aa3d558
update check quant type
jiqing-feng Dec 4, 2024
5979473
Fix optimum compat (#3)
ZX-ModelCloud Dec 5, 2024
3603a0b
fix format and convert v2 to v1
jiqing-feng Dec 16, 2024
32b0e7d
[Fix] all tensors not same device (#5)
ZX-ModelCloud Dec 16, 2024
dea8a47
fix format
jiqing-feng Dec 16, 2024
69cf2e3
add gptqmodel tests which contains cpu
jiqing-feng Dec 16, 2024
7312b7a
fix all auto-gptq tests
jiqing-feng Dec 17, 2024
f9b30c1
revert tests
jiqing-feng Dec 17, 2024
19e7261
rm gptqmodel yaml
jiqing-feng Dec 17, 2024
7125fe9
fix comment
jiqing-feng Dec 17, 2024
b61ef4a
enable real cpu tests by fp32
jiqing-feng Dec 18, 2024
d81ce2e
fix test model name
jiqing-feng Dec 18, 2024
f2b9688
keep the original device setting when using auto-gptq
jiqing-feng Dec 18, 2024
c446522
Update optimum/gptq/quantizer.py
jiqing-feng Dec 19, 2024
c8a6528
Update optimum/gptq/quantizer.py
jiqing-feng Dec 19, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading
Loading