Skip to content
This repository has been archived by the owner on Feb 3, 2025. It is now read-only.

No speed improvements after TF-TRT optimizing on a tensorflow BERT model #330

Open
SohaKhazaeli opened this issue Nov 7, 2022 · 0 comments

Comments

@SohaKhazaeli
Copy link

SohaKhazaeli commented Nov 7, 2022

After optimizing the model with either FP32 or FP16 I don't get any speed improvements.

The optimization is done on tensorflow/tensorflow:2.10.0-gpu docker image. The model uses tensorflow-text and tf-models-official libraries

This is the log from optimization process:

image

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant