You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As mentioned in microsoft/onnxruntime#7033 ORT added support for converting QDQ internally into the quantized equivalent. Since this is done via graph transform, it should work for Systolic -- we would need to change the assigned EP though I think
As also mentioned with microsoft/onnxruntime#7144 this makes it easy to run quantize-aware-trained models. We can play around with this if we need more accuracy than post-training quantization can give us.
The text was updated successfully, but these errors were encountered:
As mentioned in microsoft/onnxruntime#7033 ORT added support for converting QDQ internally into the quantized equivalent. Since this is done via graph transform, it should work for Systolic -- we would need to change the assigned EP though I think
As also mentioned with microsoft/onnxruntime#7144 this makes it easy to run quantize-aware-trained models. We can play around with this if we need more accuracy than post-training quantization can give us.
The text was updated successfully, but these errors were encountered: