Skip tests broken by change of `_convert_weight_to_int4pack` #504

jerryzh168 · 2024-07-15T23:17:03Z

Summary:
Skips torchao tests after the bc breaking change in pytorch/pytorch#129940

waiting for @yanbing-j to fix the issue

Test Plan:
python test/quantization/test_quant_api.py -k test_quantized_tensor_subclass_int4 python test/integration/test_integration.py -k test_save_load_int4woqtensors_2_cpu

Reviewers:

Subscribers:

Tasks:

Tags:

pytorch-bot · 2024-07-15T23:17:05Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/504

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 35ac8f5 with merge base 1029df3 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

msaroufim · 2024-07-15T23:29:44Z

Can you also include this commit? https://github.com/pytorch/ao/pull/501/files

jerryzh168 · 2024-07-15T23:32:37Z

Fixes torchao code after the bc breaking change in pytorch/pytorch#129940

sure, this PR is not done yet btw

msaroufim · 2024-07-16T01:18:48Z

per offline discussion let's just add a skip test for torch version greater than 2.5

Summary: Fixes torchao code after the bc breaking change in pytorch/pytorch#129940 Test Plan: python test/quantization/test_quant_api.py -k test_quantized_tensor_subclass_int4 python test/integration/test_integration.py -k test_save_load_int4woqtensors_2_cpu Reviewers: Subscribers: Tasks: Tags:

…#504) Summary: Fixes torchao code after the bc breaking change in pytorch/pytorch#129940 Test Plan: python test/quantization/test_quant_api.py -k test_quantized_tensor_subclass_int4 python test/integration/test_integration.py -k test_save_load_int4woqtensors_2_cpu Reviewers: Subscribers: Tasks: Tags:

* remove macos-12 test * pip to pip3

* make --device fast the default * Update iOS.md (pytorch#517) * Update iOS.md * Update iOS.md * Pip to pip3 (pytorch#504) * remove macos-12 test * pip to pip3 * break aoti CI jobs separately (pytorch#500) * init * fixes * more fixes * fixes * fix * fix * bug fix * add objcopy update * suppress int8 * undefined variable --------- Co-authored-by: Michael Gschwind <[email protected]> * Support llama3 in chat in run.cpp (pytorch#486) * refactor chat runner in preparation for llama3 * add sketch for llama3 prompt template and move to returning tokens * fix tiktoken * fixes to chat * add default llama_ver * Add tests for quantize json, add cuda device specification and precision to cuda.json (pytorch#519) * remove code for no KV Cache path (pytorch#527) * Update ADVANCED-USERS.md (pytorch#529) Update Advanced Users description to reflect changes in the repo since the description was initially created. * runner-aoti on cuda (pytorch#531) * runner-aoti on cuda * transfer results back to CPU * transfer results back to CPU * runner-aoti on cuda * Update runner_build.md (pytorch#530) Update description of runner and build process in runner_build.md * clean up runner code a little (pytorch#532) * clean up runner code a little * update * update * pull out generate loop in chat * updates * edit docs * typo * move int8 linear class and function into qops.py (pytorch#534) * add dtype tests for runner-aoti + runner-et (pytorch#539) * add dtype tests for runner-aoti + runner-et * typo * Quantized embedding (pytorch#536) * move int8 linear class and function into qops.py * move Quantized Embedding to qops.py * Move Linear int4 to qops (pytorch#537) * move int8 linear class and function into qops.py * move Quantized Embedding to qops.py * move int4 linear to qops * Revert "add dtype tests for runner-aoti + runner-et (pytorch#539)" (pytorch#548) This reverts commit a7a24577a65be67ac9ae4dc05452f35d9c49e5d1. * fix generate for llama3 (pytorch#538) * fix generate for llama3 * switch more things to C * remove C++ header * add delegation visualization instructions (pytorch#551) * Add dtype runner aoti (pytorch#552) * add dtype tests for runner-aoti + runner-et * typo * add dtype test runner-aoti * test sdpa with fp16 (pytorch#553) * test sdpa with fp16 * kv cache fp32 * typo * update (pytorch#560) * Only support newest versions of lm-eval (pytorch#556) Summary: remove support for lm-eval 0.3 to reduce the options we have Test Plan: CI Reviewers: Subscribers: Tasks: Tags: * split cpu eval CI by dtype (pytorch#554) * split cpu eval CI by dtype * fix * differentiate names with checks * keep one name the same as old * fix * Removing duplicate HF issue message from README (pytorch#559) Co-authored-by: Michael Gschwind <[email protected]> * doc updates (pytorch#567) * Add VM-safe MPS check --------- Co-authored-by: Anthony Shoumikhin <[email protected]> Co-authored-by: metascroy <[email protected]> Co-authored-by: Nikita Shulga <[email protected]> Co-authored-by: lucylq <[email protected]> Co-authored-by: Jerry Zhang <[email protected]> Co-authored-by: Jack-Khuu <[email protected]>

* code beautification * code beautification, move functions together * make --device fast the default (pytorch#515) * make --device fast the default * Update iOS.md (pytorch#517) * Update iOS.md * Update iOS.md * Pip to pip3 (pytorch#504) * remove macos-12 test * pip to pip3 * break aoti CI jobs separately (pytorch#500) * init * fixes * more fixes * fixes * fix * fix * bug fix * add objcopy update * suppress int8 * undefined variable --------- Co-authored-by: Michael Gschwind <[email protected]> * Support llama3 in chat in run.cpp (pytorch#486) * refactor chat runner in preparation for llama3 * add sketch for llama3 prompt template and move to returning tokens * fix tiktoken * fixes to chat * add default llama_ver * Add tests for quantize json, add cuda device specification and precision to cuda.json (pytorch#519) * remove code for no KV Cache path (pytorch#527) * Update ADVANCED-USERS.md (pytorch#529) Update Advanced Users description to reflect changes in the repo since the description was initially created. * runner-aoti on cuda (pytorch#531) * runner-aoti on cuda * transfer results back to CPU * transfer results back to CPU * runner-aoti on cuda * Update runner_build.md (pytorch#530) Update description of runner and build process in runner_build.md * clean up runner code a little (pytorch#532) * clean up runner code a little * update * update * pull out generate loop in chat * updates * edit docs * typo * move int8 linear class and function into qops.py (pytorch#534) * add dtype tests for runner-aoti + runner-et (pytorch#539) * add dtype tests for runner-aoti + runner-et * typo * Quantized embedding (pytorch#536) * move int8 linear class and function into qops.py * move Quantized Embedding to qops.py * Move Linear int4 to qops (pytorch#537) * move int8 linear class and function into qops.py * move Quantized Embedding to qops.py * move int4 linear to qops * Revert "add dtype tests for runner-aoti + runner-et (pytorch#539)" (pytorch#548) This reverts commit a7a24577a65be67ac9ae4dc05452f35d9c49e5d1. * fix generate for llama3 (pytorch#538) * fix generate for llama3 * switch more things to C * remove C++ header * add delegation visualization instructions (pytorch#551) * Add dtype runner aoti (pytorch#552) * add dtype tests for runner-aoti + runner-et * typo * add dtype test runner-aoti * test sdpa with fp16 (pytorch#553) * test sdpa with fp16 * kv cache fp32 * typo * update (pytorch#560) * Only support newest versions of lm-eval (pytorch#556) Summary: remove support for lm-eval 0.3 to reduce the options we have Test Plan: CI Reviewers: Subscribers: Tasks: Tags: * split cpu eval CI by dtype (pytorch#554) * split cpu eval CI by dtype * fix * differentiate names with checks * keep one name the same as old * fix * Removing duplicate HF issue message from README (pytorch#559) Co-authored-by: Michael Gschwind <[email protected]> * doc updates (pytorch#567) * Add VM-safe MPS check --------- Co-authored-by: Anthony Shoumikhin <[email protected]> Co-authored-by: metascroy <[email protected]> Co-authored-by: Nikita Shulga <[email protected]> Co-authored-by: lucylq <[email protected]> Co-authored-by: Jerry Zhang <[email protected]> Co-authored-by: Jack-Khuu <[email protected]> * add unpacking support (pytorch#525) * add unpacking support * fix typos and linter * perform parallel prefill when possible (pytorch#568) * perform parallel prefill when possible * typo * disable hack * remove print * remove debug messages which prevent export * fixes * stream results in generate.py (#571) * remove logging interfering with export --------- Co-authored-by: Anthony Shoumikhin <[email protected]> Co-authored-by: metascroy <[email protected]> Co-authored-by: Nikita Shulga <[email protected]> Co-authored-by: lucylq <[email protected]> Co-authored-by: Jerry Zhang <[email protected]> Co-authored-by: Jack-Khuu <[email protected]>

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 15, 2024

jerryzh168 changed the title ~~Update torchao after https://github.com/pytorch/pytorch/pull/129940~~ Update torchao after bc breaking change of _convert_weight_to_int4pack Jul 15, 2024

jerryzh168 changed the title ~~Update torchao after bc breaking change of _convert_weight_to_int4pack~~ Update torchao after bc breaking change of _convert_weight_to_int4pack Jul 15, 2024

jerryzh168 mentioned this pull request Jul 15, 2024

update the input weight of _convert_weight_to_int4pack to [n][k / 2] uint8 pytorch/pytorch#129940

Closed

jerryzh168 force-pushed the fix-ci branch from bd4b1da to bc021f8 Compare July 15, 2024 23:35

gau-nernst mentioned this pull request Jul 16, 2024

Fix failing uint4 test #501

Closed

msaroufim mentioned this pull request Jul 16, 2024

pin nightly to 2.5.0.dev20240709+cu121 #505

Merged

jerryzh168 force-pushed the fix-ci branch 2 times, most recently from 6b204df to 78a3c09 Compare July 16, 2024 03:01

jerryzh168 force-pushed the fix-ci branch from 78a3c09 to 35ac8f5 Compare July 16, 2024 17:22

jerryzh168 requested review from msaroufim and gau-nernst July 16, 2024 18:14

jerryzh168 changed the title ~~Update torchao after bc breaking change of _convert_weight_to_int4pack~~ Skip tests broken by change of _convert_weight_to_int4pack Jul 16, 2024

andrewor14 approved these changes Jul 16, 2024

View reviewed changes

jerryzh168 merged commit 6e7cf71 into pytorch:main Jul 16, 2024
13 checks passed

yanbing-j pushed a commit to yanbing-j/ao that referenced this pull request Dec 9, 2024

Pip to pip3 (pytorch#504)

217cb72

* remove macos-12 test * pip to pip3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Skip tests broken by change of `_convert_weight_to_int4pack` #504

Skip tests broken by change of `_convert_weight_to_int4pack` #504

jerryzh168 commented Jul 15, 2024 •

edited

Loading

pytorch-bot bot commented Jul 15, 2024 •

edited

Loading

msaroufim commented Jul 15, 2024

jerryzh168 commented Jul 15, 2024

msaroufim commented Jul 16, 2024

Skip tests broken by change of _convert_weight_to_int4pack #504

Skip tests broken by change of _convert_weight_to_int4pack #504

Conversation

jerryzh168 commented Jul 15, 2024 • edited Loading

pytorch-bot bot commented Jul 15, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/504

✅ No Failures

msaroufim commented Jul 15, 2024

jerryzh168 commented Jul 15, 2024

msaroufim commented Jul 16, 2024

Skip tests broken by change of `_convert_weight_to_int4pack` #504

Skip tests broken by change of `_convert_weight_to_int4pack` #504

jerryzh168 commented Jul 15, 2024 •

edited

Loading

pytorch-bot bot commented Jul 15, 2024 •

edited

Loading