Mimic `adamw_torch_4bit` and have `adamw_torch_8bit` #34893

fzyzcjy · 2024-11-23T13:32:14Z

Feature request

Hi thanks for the lib! Currently there is adamw_torch_4bit, but I hope to mimic it to have a adamw_torch_8bit that uses 8bit torchao adamw.

The reason is that, I would like to use deepspeed cpu offload for the optimizer, and also use 8bit adamw. However, the 8bit one in current hf transformers does not support cpu, so I need to use the torchao one.

Motivation

Your contribution

yes, willing to PR

The text was updated successfully, but these errors were encountered:

Rocketknight1 · 2024-11-25T13:20:38Z

cc @muellerzr for deepspeed/accelerate!

muellerzr · 2024-11-25T13:33:14Z

A PR for this would be great 🤗 cc @SunMarc

fzyzcjy · 2024-11-25T13:49:44Z

Thanks! I will do that later.

SunMarc · 2024-11-25T15:55:50Z

Feel free to add it ! Let me know if you need any help

fzyzcjy · 2024-11-25T23:58:19Z

Thanks! I will firstly mimic the 4bit one and see whether it works.

fzyzcjy · 2024-11-28T12:10:58Z

PR created: #34993

fzyzcjy added the Feature request Request for a new feature label Nov 23, 2024

fzyzcjy changed the title ~~Mimic adamw_torch_4bit and have adamw_torch_8bit~~ (Willing to PR if this is acceptable) Mimic adamw_torch_4bit and have adamw_torch_8bit Nov 24, 2024

fzyzcjy mentioned this issue Nov 24, 2024

FEAT / Trainer: Add adamw 4bit optimizer #31865

Merged

fzyzcjy mentioned this issue Nov 28, 2024

Support adamw_torch_8bit #34993

Merged

5 tasks

fzyzcjy changed the title ~~(Willing to PR if this is acceptable) Mimic adamw_torch_4bit and have adamw_torch_8bit~~ Mimic adamw_torch_4bit and have adamw_torch_8bit Nov 28, 2024

ArthurZucker closed this as completed in #34993 Jan 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mimic `adamw_torch_4bit` and have `adamw_torch_8bit` #34893

Mimic `adamw_torch_4bit` and have `adamw_torch_8bit` #34893

fzyzcjy commented Nov 23, 2024 •

edited

Loading

Rocketknight1 commented Nov 25, 2024

muellerzr commented Nov 25, 2024

fzyzcjy commented Nov 25, 2024

SunMarc commented Nov 25, 2024

fzyzcjy commented Nov 25, 2024

fzyzcjy commented Nov 28, 2024

Mimic adamw_torch_4bit and have adamw_torch_8bit #34893

Mimic adamw_torch_4bit and have adamw_torch_8bit #34893

Comments

fzyzcjy commented Nov 23, 2024 • edited Loading

Feature request

Motivation

Your contribution

Rocketknight1 commented Nov 25, 2024

muellerzr commented Nov 25, 2024

fzyzcjy commented Nov 25, 2024

SunMarc commented Nov 25, 2024

fzyzcjy commented Nov 25, 2024

fzyzcjy commented Nov 28, 2024

Mimic `adamw_torch_4bit` and have `adamw_torch_8bit` #34893

Mimic `adamw_torch_4bit` and have `adamw_torch_8bit` #34893

fzyzcjy commented Nov 23, 2024 •

edited

Loading