Enable pyTorch-IMage-Models (TIMM) with HPUs #1459

ZhengHongming888 · 2024-10-28T16:13:36Z

What does this PR do?

This PR contains the scripts that showcases how to inference/fine-tune the TIMM models on intel's HPUs with the lazy/graph modes. We support the training for single/multiple HPU cards both two. Currently we support several most downloadable models from Hugging Face as below list.

Here we support the below features:

Single-HPU training
Here we show how to fine-tune the imagenette2-320 dataset and model with timm/resnet50.a1_in1k from Hugging Face.

training with hpu lazy mode

python train_hpu_lazy.py \
    --data-dir ./imagenette2-320/ \
    --device 'hpu' \
    --model resnet50.a1_in1k

training with hpu graph mode

python train_hpu_graph.py \
    --data-dir ./imagenette2-320/ \
    --device 'hpu' \
    --model resnet50.a1_in1k

Multi-HPU training

training with hpu lazy mode

torchrun --nnodes 1 --nproc_per_node 2 \
    train_hpu_lazy.py \
    --data-dir ./imagenette2-320/ \
    --device 'hpu' \
    --model resnet50.a1_in1k

training with hpu graph mode

torchrun --nnodes 1 --nproc_per_node 2 \
    train_hpu_graph.py \
    --data-dir ./imagenette2-320/ \
    --device 'hpu' \
    --model resnet50.a1_in1k

Single HPU inference

hpu with graph_mode

python inference.py \
    --data-dir='./download_ds/imagenette2-320' \
    --device='hpu' \
    --model resnet50.a1_in1k \
    --graph_mode

hpu with lazy mode

python inference.py \
    --data-dir='./download_ds/imagenette2-320' 
    --device='hpu' \
    --model resnet50.a1_in1k

Welcome for any suggestions/comments. Thanks.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

github-actions · 2024-11-25T18:45:30Z

The code quality check failed, please run make style.

HuggingFaceDocBuilderDev · 2024-11-25T18:48:49Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

examples/pytorch-image-models/README.md

Co-authored-by: regisss <[email protected]>

regisss · 2024-11-28T23:46:29Z

@ZhengHongming888 Can you add a link to the scripts these examples are inspired of please? You can put it in the docstring at the beginning of each file

ZhengHongming888 · 2024-11-30T05:35:46Z

@ZhengHongming888 Can you add a link to the scripts these examples are inspired of please? You can put it in the docstring at the beginning of each file

@regisss updated please check! Thanks.

regisss

Finally can you create a test_timm.py file in https://github.com/huggingface/optimum-habana/tree/main/tests where you quickly test the example scripts please?
For example just running a few training/inference steps to make sure the examples work.

ZhengHongming888 · 2024-12-03T07:33:36Z

Finally can you create a test_timm.py file in https://github.com/huggingface/optimum-habana/tree/main/tests where you quickly test the example scripts please? For example just running a few training/inference steps to make sure the examples work.

Thanks @regisss Done please check! Thanks.

regisss

LGTM!

ZhengHongming888 · 2024-12-03T16:57:43Z

LGTM!

@regisss @libinta Thanks much for your time on review!!

Co-authored-by: regisss <[email protected]>

ZhengHongming888 added 9 commits October 15, 2024 11:04

create branch for enable_timm_with_hpu

1a27976

readme.md

3bfe47f

Merge branch 'huggingface:main' into enable_timm_with_hpu

e6bb532

add inference

b1d33d0

minor

c44436f

minor

e9374e6

minor

a70980c

minor on readme

69f30f9

minor on readme

0a40a13

ZhengHongming888 requested a review from regisss as a code owner October 28, 2024 16:13

ssarkar2 approved these changes Nov 13, 2024

View reviewed changes

libinta added the run-test Run CI for PRs from external contributors label Nov 25, 2024

regisss reviewed Nov 25, 2024

View reviewed changes

ZhengHongming888 and others added 15 commits November 25, 2024 14:20

Merge branch 'huggingface:main' into enable_timm_with_hpu

2fd4cec

Merge branch 'huggingface:main' into enable_timm_with_hpu

97ea298

Update examples/pytorch-image-models/README.md

c85fe96

Co-authored-by: regisss <[email protected]>

Update examples/pytorch-image-models/README.md

fd4abed

Co-authored-by: regisss <[email protected]>

Update examples/pytorch-image-models/README.md

18c29a1

Co-authored-by: regisss <[email protected]>

Update examples/pytorch-image-models/README.md

f19d5cf

Co-authored-by: regisss <[email protected]>

Update examples/pytorch-image-models/README.md

4349fe3

Co-authored-by: regisss <[email protected]>

Update examples/pytorch-image-models/README.md

675038e

Co-authored-by: regisss <[email protected]>

Update examples/pytorch-image-models/README.md

e1fafc5

Co-authored-by: regisss <[email protected]>

Update examples/pytorch-image-models/README.md

5c72ab8

Co-authored-by: regisss <[email protected]>

Update examples/pytorch-image-models/README.md

8db5ce0

Co-authored-by: regisss <[email protected]>

update readme

2fde87a

Merge branch 'huggingface:main' into enable_timm_with_hpu

bb35584

update readme

0965d8f

update readme

2824d94

ZhengHongming888 added 5 commits November 26, 2024 19:54

make style

6ef5066

minor

d52179f

minor

0387e2e

minor

499d785

minor

56f5b0f

ZhengHongming888 added 2 commits November 29, 2024 21:18

Merge branch 'huggingface:main' into enable_timm_with_hpu

b9f83f8

add link in each script

40c9a6d

regisss reviewed Dec 1, 2024

View reviewed changes

ZhengHongming888 added 3 commits December 2, 2024 22:54

Merge branch 'huggingface:main' into enable_timm_with_hpu

84398b9

add timm example into tests

ee969ce

minor

47b8458

regisss approved these changes Dec 3, 2024

View reviewed changes

regisss merged commit 5485726 into huggingface:main Dec 3, 2024
4 checks passed

regisss added a commit that referenced this pull request Dec 3, 2024

Enable pyTorch-IMage-Models (TIMM) with HPUs (#1459)

7ea6a54

Co-authored-by: regisss <[email protected]>

Liangyx2 pushed a commit to HabanaAI/optimum-habana-fork that referenced this pull request Jan 20, 2025

Enable pyTorch-IMage-Models (TIMM) with HPUs (huggingface#1459)

e0e2ec4

Co-authored-by: regisss <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable pyTorch-IMage-Models (TIMM) with HPUs #1459

Enable pyTorch-IMage-Models (TIMM) with HPUs #1459

ZhengHongming888 commented Oct 28, 2024

github-actions bot commented Nov 25, 2024

HuggingFaceDocBuilderDev commented Nov 25, 2024

regisss commented Nov 28, 2024

ZhengHongming888 commented Nov 30, 2024

regisss left a comment

ZhengHongming888 commented Dec 3, 2024

regisss left a comment

ZhengHongming888 commented Dec 3, 2024

Enable pyTorch-IMage-Models (TIMM) with HPUs #1459

Enable pyTorch-IMage-Models (TIMM) with HPUs #1459

Conversation

ZhengHongming888 commented Oct 28, 2024

What does this PR do?

Before submitting

github-actions bot commented Nov 25, 2024

HuggingFaceDocBuilderDev commented Nov 25, 2024

regisss commented Nov 28, 2024

ZhengHongming888 commented Nov 30, 2024

regisss left a comment

Choose a reason for hiding this comment

ZhengHongming888 commented Dec 3, 2024

regisss left a comment

Choose a reason for hiding this comment

ZhengHongming888 commented Dec 3, 2024