Slow response times #178

samueleresca · 2020-11-12T12:27:51Z

Is it changed something recently that may affect the response time of an endpoint?

I was using the docker image for TF 2.1 built around 5 months ago (This commit ). The response time was around 100ms (ModelLatency average on cloudwatch). I rebuilt the serving docker image yesterday and running with the same endpoint config and the same targz model, but now the response time is around 1000ms(ModelLatency average on cloudwatch). I'm also noticing a new record in the logs:

  | 2020-11-12T11:48:57.327+00:00 | INFO:tfs_utils:sagemaker tfs attributes:
-- | -- | --
  | 2020-11-12T11:48:57.327+00:00 | {}

Is it changed something that could have impacted the performance at this level?

The text was updated successfully, but these errors were encountered:

henryhu666 · 2020-12-17T05:37:56Z

We also noticed that the response time is quite slow (~1 seconds for the 95th ModelLatency) even on a very simple model. We were using the 763104351884.dkr.ecr.us-east-1.amazonaws.com/tensorflow-inference:2.3.0-cpu image in multi-model mode

eldonaldo · 2021-02-19T07:27:37Z

Same here, any advice?

samueleresca · 2021-02-22T09:36:07Z

@eldonaldo If you don't need multi-model capabilities, I would build the docker image from this commit.

eldonaldo · 2021-02-22T12:55:43Z

@samueleresca many thanks, that already helped! it reduced the latency by 1s.

If anyone as further hints how to reduce the latency, please let me know.

samueleresca · 2021-02-22T13:03:54Z

@eldonaldo If your endpoint is under load and receives a lot of requests, you can also try to tweak the number of workers and threads of gunicorn, see this

davesean · 2021-02-23T07:46:13Z

@samueleresca Thanks a lot for your help! The issue was fixed with this commit.(8ddbbdc) Additionally, the worker and threads worked perfectly when multiple request come in.

amaharek · 2021-03-14T14:28:00Z

Two fixes has been released

The two PRs have been merged:

chuyang-deng added type: question type: enhancement and removed type: enhancement type: question labels Nov 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slow response times #178

Slow response times #178

samueleresca commented Nov 12, 2020 •

edited

Loading

henryhu666 commented Dec 17, 2020

eldonaldo commented Feb 19, 2021

samueleresca commented Feb 22, 2021 •

edited

Loading

eldonaldo commented Feb 22, 2021

samueleresca commented Feb 22, 2021

davesean commented Feb 23, 2021 •

edited

Loading

amaharek commented Mar 14, 2021 •

edited

Loading

Slow response times #178

Slow response times #178

Comments

samueleresca commented Nov 12, 2020 • edited Loading

henryhu666 commented Dec 17, 2020

eldonaldo commented Feb 19, 2021

samueleresca commented Feb 22, 2021 • edited Loading

eldonaldo commented Feb 22, 2021

samueleresca commented Feb 22, 2021

davesean commented Feb 23, 2021 • edited Loading

amaharek commented Mar 14, 2021 • edited Loading

samueleresca commented Nov 12, 2020 •

edited

Loading

samueleresca commented Feb 22, 2021 •

edited

Loading

davesean commented Feb 23, 2021 •

edited

Loading

amaharek commented Mar 14, 2021 •

edited

Loading