Help needed to export Phi3v in ONNX #685

ladanisavan · 2024-07-08T13:07:25Z

Hi there,

I'm seeking guidance on exporting a custom fine-tuned Phi-3 Vision model to ONNX. I've followed the ONNX build model guide from this link.

The build command I used was:
python3 -m onnxruntime_genai.models.builder -i ep_2_grad_32_lr_3e-5/ -o onnx_output/ -p int4 -e cuda --extra_options int4_block_size=32 int4_accuracy_level=4

The build process was successful and generated the following files:

genai_config.json
model.onnx
model.onnx.data
special_tokens_map.json
tokenizer.json
tokenizer_config.json

However, the number of files generated doesn't match the file count in the official HF repo for ONNX microsoft/Phi-3-vision-128k-instruct-onnx-cuda

Files highlighted in red below are missing:

Additionally, while loading the model using ONNX Runtime, the following error occurs:
OrtException: Load model from onnx_output failed: Protobuf parsing failed.

I have also noticed that sections for "embedding" and "vision" are missing from the genai_config.json

Can someone help me identify if I'm missing anything? Thanks

The text was updated successfully, but these errors were encountered:

kunal-vaishnavi · 2024-07-09T00:50:17Z

The Phi-3 vision ONNX models are created as follows.

The vision component (phi-3-v-128k-instruct-vision.onnx) is created using torch.onnx.export with some modifications to the original PyTorch source code.
The text embedding component (phi-3-v-128k-instruct-text-embedding.onnx) is created using the ONNX helper APIs.

The text component (phi-3-v-128k-instruct-text.onnx) is created using the model builder with --extra_options exclude_embeds=true enabled. The model builder prints a warning that only the text component is created.

onnxruntime-genai/src/python/py/models/builder.py

Lines 2387 to 2388 in 00ceb80

    
           elif config.architectures[0] == "Phi3VForCausalLM": 
        
               print("WARNING: This is only generating the text component of the model. Setting `--extra_options exclude_embeds=true` by default.")

The genai_config.json and processor_config.json are created manually.

I can open-source the scripts used to create these ONNX models and run them with ONNX Runtime GenAI.

2U1 · 2024-07-09T02:42:47Z

@kunal-vaishnavi If you open-source it, I would really appreciate it!

ladanisavan · 2024-07-09T02:47:41Z

@kunal-vaishnavi open-source scripts would be really helpful to the Phi community.

kunal-vaishnavi · 2024-07-17T03:23:56Z

I have uploaded the necessary files in each of the Hugging Face repos and created this PR to show how to use them.

### Description This PR open-sources the scripts used to generate the Phi-3 vision ONNX models that run with ONNX Runtime GenAI. The extra files needed for generating the Phi-3 vision ONNX models have been uploaded to the Hugging Face repos. - [`microsoft/Phi-3-vision-128k-instruct-onnx-cpu`](https://huggingface.co/microsoft/Phi-3-vision-128k-instruct-onnx-cpu/tree/main/onnx) - [`microsoft/Phi-3-vision-128k-instruct-onnx-cuda`](https://huggingface.co/microsoft/Phi-3-vision-128k-instruct-onnx-cuda/tree/main/onnx) - [`microsoft/Phi-3-vision-128k-instruct-onnx-directml`](https://huggingface.co/microsoft/Phi-3-vision-128k-instruct-onnx-directml/tree/main/onnx) ### Motivation and Context This PR allows users to build the ONNX models needed for Phi-3 vision. It also helps the following issues. - #571 - #685

tgalery · 2024-09-22T15:14:51Z

Quick question, would the same guide work for Phi3.5 vision model family ?

kunal-vaishnavi · 2024-09-23T17:32:43Z

Yes, but the num_crops value in processor_config.json here needs to be set to 4 for Phi-3.5 vision as the value has changed.

Please note that this guide only works for a single image, however. We have re-designed the ONNX models so that there is multi-image support for both Phi-3 vision and Phi-3.5 vision. As mentioned here, the new ONNX models are undergoing Microsoft's Responsible AI evaluations before they can be published officially.

The changes needed within ONNX Runtime GenAI for the new ONNX models have already been merged in this PR. A revised guide as well as a new ONNX Runtime GenAI stable release will be published together to support this work.

github-actions bot added the ep:CUDA label Jul 8, 2024

kunal-vaishnavi mentioned this issue Jul 9, 2024

Help needed to export in ONNX microsoft/onnxruntime#21282

Closed

kunal-vaishnavi self-assigned this Jul 9, 2024

kunal-vaishnavi mentioned this issue Jul 17, 2024

Add example to build Phi-3 vision ONNX models #705

Merged

kunal-vaishnavi closed this as completed Jul 17, 2024

kunal-vaishnavi mentioned this issue Sep 30, 2024

Does Microsoft.ML.OnnxRuntimeGenAI.Cuda (version 0.4.0) support Phi-3.5 Vision Onnx format? #943

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Help needed to export Phi3v in ONNX #685

Help needed to export Phi3v in ONNX #685

ladanisavan commented Jul 8, 2024

kunal-vaishnavi commented Jul 9, 2024 •

edited

Loading

2U1 commented Jul 9, 2024

ladanisavan commented Jul 9, 2024

kunal-vaishnavi commented Jul 17, 2024

tgalery commented Sep 22, 2024

kunal-vaishnavi commented Sep 23, 2024

Help needed to export Phi3v in ONNX #685

Help needed to export Phi3v in ONNX #685

Comments

ladanisavan commented Jul 8, 2024

kunal-vaishnavi commented Jul 9, 2024 • edited Loading

2U1 commented Jul 9, 2024

ladanisavan commented Jul 9, 2024

kunal-vaishnavi commented Jul 17, 2024

tgalery commented Sep 22, 2024

kunal-vaishnavi commented Sep 23, 2024

kunal-vaishnavi commented Jul 9, 2024 •

edited

Loading