Clock Producer received a too early ping event, rescheduling #439

jean-smaug · 2020-06-15T09:24:39Z

Hello 👋

I created a Telegram bot.
One of the feature is sending a message everyday.

But sometimes, there is a problème with Quantum and it's blocking the other modules of the app. So the CRON is working but all the slash commands are broken.

When I inspect logs with journalctl -ef, I get these messages :

Jun 15 11:14:45 raspberrypi my_app[446]: 11:14:45.992 [warn]  [:my_app@raspberrypi][Elixir.Quantum.ClockBroadcaster] Clock Producer received a too early ping event, rescheduling
Jun 15 11:14:46 raspberrypi my_app[446]: 11:14:46.997 [warn]  [:my_app@raspberrypi][Elixir.Quantum.ClockBroadcaster] Clock Producer received a too early ping event, rescheduling
Jun 15 11:14:47 raspberrypi my_app[446]: 11:14:46.995 [warn]  [:my_app@raspberrypi][Elixir.Quantum.ClockBroadcaster] Clock Producer received a too early ping event, rescheduling

Here is my config

config :my_app, MyApp.Scheduler,
  timezone: "Europe/Paris",
  jobs: [
    {"2 10 * * 1-5", {MyApp, :send_msg, []}},
  ]

My Scheduler

defmodule MyApp.Scheduler do
  use Quantum, otp_app: :my_app
end

Thx.

The text was updated successfully, but these errors were encountered:

maennchen · 2020-06-15T09:37:18Z

@jean-smaug What version of quantum are you using?

jean-smaug · 2020-06-15T12:45:42Z

I'm using the RC version.

I didn't notice that you released the 3.0.
Congrats and thx.

I will upgrade, I think we can close the issue for now and if the problem happen again I'll re-open this one.

maennchen · 2020-06-15T13:09:11Z

@jean-smaug I don't think that this will resolve it. There are no relevant changes since rc.

Are you sure that this does block the execution for real? When examining the code I can see that the execution should continue normally. This is only a warning.

As for the warning itself: It would make sense to switch to use abs and monotonic time. This should prevent it from sending the message too early. I'll see when I can come up with a PR.

silvadanilo · 2020-06-18T12:38:52Z

in my case it happens, in a dev environment, when I have the PC in suspension at the moment when the job should have been executed, and at the "wake up" I get flooded with this warning.

kotsius · 2020-06-19T09:02:12Z

Elixir 1.10.3, Quantum 3.0.1, Windows 10 here. The most frequently scheduled event occurs every minute, yet these warnings are constant:

Had no issues with Quantum v2.x.

maennchen · 2020-06-19T09:06:42Z

@kotsius Do you basically receive this every second or do those messages trickle in over time?

kotsius · 2020-06-19T09:18:28Z

@kotsius Do you basically receive this every second or do those messages trickle in over time?

I seem to be getting one every few seconds. The interval varies widely (say from 1″ to 40″).

maennchen · 2020-06-19T09:30:53Z

@kotsius Ok, then the send_after seems to be imprecise and should be converted to use abs.

Does this block anything or does the scheduler continue normal for you?

kotsius · 2020-06-19T09:42:16Z

Thanks. I can locate send_after/4 under Process, but not anywhere in the Quantum docs. Is this something that requires tweaking on my side?

The scheduler seems to be working fine.

maennchen · 2020-06-19T09:45:11Z

@kotsius Ah no, this is something that quantum needs to solve.

Quantum calculated the diff between the next second to trigger and now and then schedules a message with Process.send_after/3 here: https://github.com/quantum-elixir/quantum-core/blob/master/lib/quantum/clock_broadcaster.ex#L136

To make it more reliable the next time should be converted to erlang monotonic time and then supplied to send_after as abs.
https://hexdocs.pm/elixir/Process.html#send_after/4-options

kotsius · 2020-06-19T09:47:13Z

Good to know, thanks again.

ijunaid8989 · 2020-07-02T06:16:29Z

do anyone face any issues in the whole application due to this?

our application's HTTP requests got so slow due to this.

maennchen · 2020-07-02T07:00:19Z

@ijunaid8989 As i mentioned before this warning should not cause a slow down on anything.

Do you get thousands of messages a second or do you sometimes get it?

If it is the latter one, then the only thing that will really change after this is fixed, is that you’re no longer going to see a warning.

If you get thousands of messages (some kind of infinite loop) then I would be really interested in a reproduction repo and debug logs.

ghost · 2020-07-04T22:05:44Z

I'm seeing the same issue as @kotsius in the same interval on Windows (but haven't seen yet on Linux). Is there any specific reason why this even needs to be a warning? Clock drifts can happen and shouldn't be anything that needs to be warned about. I would think this log line should be degraded to debug.

@maennchen WDYT?

janpieper · 2020-07-24T07:27:12Z

I have the same issue with quantum 3.0.1 running in a docker container (elixir:1.10-slim). I get the message once per second.

I am able to reproduce this error by keeping a phoenix application running (mix phx.server) while suspending my computer and starting it again.

johannesE · 2020-07-30T12:41:42Z

I'm also only getting it on my development machine (Ubuntu) after suspending it. A reboot of the application fixes the issue for me. Is this something I should be worried about?

maennchen · 2020-07-30T12:55:05Z

As I said before: This is not something anyone should ve worried about.

I‘ll push a fix that should prevent it from happening in the future.

If that warning is logged, that only means that the event was triggered too early and that the clock producer will wait again until it’s the right time.

maennchen · 2020-08-04T10:04:53Z

The problem seems to lie deeper than expected. The Process.send_after/3 seems to trigger too early.

I only get the problem to go away if i don't try to trigger as closely to the start of the second as possible but add a few milliseconds.

next_event_abs =
  %{time | microsecond: {0, 0}}
  |> NaiveDateTime.add(1, :second)
  |> NaiveDateTime.diff(now, :native)
  |> Kernel.max(0)
  |> IO.inspect(label: "diff")
  |> Kernel.+(System.monotonic_time())
  |> Kernel.+(:erlang.convert_time_unit(10, :millisecond, :native)) # Round Up
  |> :erlang.convert_time_unit(:native, :millisecond)
  |> IO.inspect(label: "abs")

Process.send_after(self(), :ping, next_event_abs, abs: true)

I'd like to get the inputs from the people in this issue (@jean-smaug, @kotsius, @ijunaid8989, @CharlotteDunois, @janpieper, @johannesE):

If a job is triggered every second, which result would you expect?

Triggered at start of second (or as close as possible): ~N[2020-08-04 09:59:15.000057]
Triggered at middle of second (or as close as possible): ~N[2020-08-04 09:59:15.500032]
Triggered at end of second (or as close as possible): ~N[2020-08-04 09:59:15.999967]
Triggered at any point in that second (would also mean that the jobs don't have to be roughly a second apart but could also be faster / slower.

Both options 1 & 3 are hard since Process.send_after/4 seems to be too imprecise.

Options 2 & 4 should be easy since we don't have to hit the mark very closely.

maennchen · 2020-08-04T12:14:21Z

I personally think that option 4 is the best. #449 shows how this could be improved for that scenario.

I'd like to get some feedback and you can also check out that branch and see if it works how you'd expect.

kotsius · 2020-08-05T06:14:12Z

Thanks for looking into this, Jonatan. Responding to your call for feedback, I would say that my intuitive expectation would be option 1. I suppose, however, that the constancy between any two jobs (expected to be equal to one second as per your question) would be more important than the position of each one within the window of every second.

maennchen · 2020-08-10T14:34:04Z

@kotsius

however, that the constancy between any two jobs (expected to be equal to one second as per your question) would be more important

I completely agree on that.

I myself would lean towards option 4 while trying to still space the seconds apart the same more or less.

maennchen · 2020-08-18T09:29:35Z

Released as v3.0.2

jean-smaug · 2020-08-22T19:11:47Z

@maennchen thanks for your work !

kotsius · 2020-08-25T06:23:06Z

No more command line pollution, thanks Jonatan!

maennchen added bug easy help wanted labels Jun 19, 2020

maennchen mentioned this issue Aug 4, 2020

Handle Clock Skew #449

Merged

maennchen closed this as completed Aug 18, 2020

maennchen self-assigned this Aug 18, 2020

maennchen removed the help wanted label Aug 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clock Producer received a too early ping event, rescheduling #439

Clock Producer received a too early ping event, rescheduling #439

jean-smaug commented Jun 15, 2020 •

edited

Loading

maennchen commented Jun 15, 2020

jean-smaug commented Jun 15, 2020

maennchen commented Jun 15, 2020

silvadanilo commented Jun 18, 2020

kotsius commented Jun 19, 2020 •

edited

Loading

maennchen commented Jun 19, 2020

kotsius commented Jun 19, 2020

maennchen commented Jun 19, 2020

kotsius commented Jun 19, 2020

maennchen commented Jun 19, 2020

kotsius commented Jun 19, 2020

ijunaid8989 commented Jul 2, 2020

maennchen commented Jul 2, 2020

ghost commented Jul 4, 2020

janpieper commented Jul 24, 2020

johannesE commented Jul 30, 2020

maennchen commented Jul 30, 2020

maennchen commented Aug 4, 2020

maennchen commented Aug 4, 2020

kotsius commented Aug 5, 2020

maennchen commented Aug 10, 2020

maennchen commented Aug 18, 2020

jean-smaug commented Aug 22, 2020

kotsius commented Aug 25, 2020

Clock Producer received a too early ping event, rescheduling #439

Clock Producer received a too early ping event, rescheduling #439

Comments

jean-smaug commented Jun 15, 2020 • edited Loading

maennchen commented Jun 15, 2020

jean-smaug commented Jun 15, 2020

maennchen commented Jun 15, 2020

silvadanilo commented Jun 18, 2020

kotsius commented Jun 19, 2020 • edited Loading

maennchen commented Jun 19, 2020

kotsius commented Jun 19, 2020

maennchen commented Jun 19, 2020

kotsius commented Jun 19, 2020

maennchen commented Jun 19, 2020

kotsius commented Jun 19, 2020

ijunaid8989 commented Jul 2, 2020

maennchen commented Jul 2, 2020

ghost commented Jul 4, 2020

janpieper commented Jul 24, 2020

johannesE commented Jul 30, 2020

maennchen commented Jul 30, 2020

maennchen commented Aug 4, 2020

maennchen commented Aug 4, 2020

kotsius commented Aug 5, 2020

maennchen commented Aug 10, 2020

maennchen commented Aug 18, 2020

jean-smaug commented Aug 22, 2020

kotsius commented Aug 25, 2020

jean-smaug commented Jun 15, 2020 •

edited

Loading

kotsius commented Jun 19, 2020 •

edited

Loading