We had a 2 hour outage on processing new events; see the âupdatedAtâ and âcreatedAtâ fields jumping from 14:41:19 UTC to 16:27:44 UTC.
The issue shows up on all synced events for the exact same timeframe - they just werenât processed, and then they show up. Itâs not even caught up by the time itâs processing new events.
Thereâs no useful logs that I can view, other than showing that the server was responding to client read requests just fine:
- 2021-11-17T17:06:23.163Z - Can not find client undefined on disconnect
- 2021-11-17T15:43:06.899Z - Can not find client undefined on disconnect
- 2021-11-17T14:41:33.798Z - Can not find client undefined on disconnect
- 2021-11-17T13:45:19.949Z - Can not find client undefined on disconnect
We also have a second server (with a mostly identical setup, but some different calculations) processing the same events. It had the same outage.
My guess is some kind of node issue with syncing events? Iâd like to understand the mitigations youâve got in place here so we can prevent this in the future.