Exported beam search model consumes a lot of more memory #12246

HaoboGu · 2022-07-20T03:25:08Z

Hello, I used convert_beam_search.py in latest master to convert my gpt2 model to beam search gpt2 model. I found that loading converted beam search model consumes more memory than raw gpt2 model(approximately doubled). Here is the screen shot for loading my gpt2 model before and after the convension:

I also tried using tiny-gpt2 on huggingface model hub, the memory consumption also increases a lot(30MB -> 43MB) after converting tiny-gpt2 to beam search model:

System information

OS Platform and Distribution (e.g., Linux Ubuntu 16.04): MacOS Monterey
ONNX Runtime installed from (source or binary): ort-nightly
ONNX Runtime version: 1.12.0
Python version: 3.7.9

The text was updated successfully, but these errors were encountered:

tianleiwu · 2022-07-20T22:23:35Z

I can reproduce the issue. Let me investigate the cause.

tianleiwu · 2022-07-26T21:42:15Z

@HaoboGu, please git pull the master branch, and re-generate the onnx model with convert_generation.py.

HaoboGu · 2022-07-27T03:45:04Z

install from source is quite slow.. is ort-nightly which contains this fix released?

harshithapv added the core runtime issues related to core runtime label Jul 20, 2022

tianleiwu added the bug label Jul 20, 2022

tianleiwu mentioned this issue Jul 25, 2022

Move initializers from subgraph to the main graph to reduce memory #12310

Merged

sophies927 removed the bug label Aug 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exported beam search model consumes a lot of more memory #12246

Exported beam search model consumes a lot of more memory #12246

HaoboGu commented Jul 20, 2022

tianleiwu commented Jul 20, 2022

tianleiwu commented Jul 26, 2022

HaoboGu commented Jul 27, 2022

Exported beam search model consumes a lot of more memory #12246

Exported beam search model consumes a lot of more memory #12246

Comments

HaoboGu commented Jul 20, 2022

tianleiwu commented Jul 20, 2022

tianleiwu commented Jul 26, 2022

HaoboGu commented Jul 27, 2022