Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Play around with QDQ Support #32

Open
pranav-prakash opened this issue Apr 4, 2021 · 2 comments
Open

Play around with QDQ Support #32

pranav-prakash opened this issue Apr 4, 2021 · 2 comments

Comments

@pranav-prakash
Copy link
Collaborator

pranav-prakash commented Apr 4, 2021

As mentioned in microsoft/onnxruntime#7033 ORT added support for converting QDQ internally into the quantized equivalent. Since this is done via graph transform, it should work for Systolic -- we would need to change the assigned EP though I think

As also mentioned with microsoft/onnxruntime#7144 this makes it easy to run quantize-aware-trained models. We can play around with this if we need more accuracy than post-training quantization can give us.

@pranav-prakash
Copy link
Collaborator Author

More info on QDQ is also seen in this presentation I found: https://gist.github.com/daquexian/4107a9c94038af3aa429b3a3371b332d

@pranav-prakash
Copy link
Collaborator Author

If this is done we also need to bump quantizer to take advantage of QAT export support: https://github.com/pranav-prakash/onnxruntime-riscv/blob/2021-04-02/systolic_runner/docs/quantizer.md#exporting-quantized-models

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant