does anyone's work queue periodically go unhealthy? Happens to be every couple days, then I have to restart the agent. Automated restarts don't seem to be working, or I cannot set it up right, but it'd be nice for the service to not crash in the first place. Is there a recommended path to diagnose the root of the unhealthy agent states?
b
Bianca Hoch
08/29/2023, 5:56 PM
Hey Albert, where/how are you hosting the agent? Definitely seems like daemonizing it is going to help here. What have you tried so far?
a
Albert Wong
08/29/2023, 5:58 PM
I added an onfailure item to the systemd startup script, and tested it by killing the service, seemed to respawn ... but in an actual scenario the process doesn't respawn after its died
Albert Wong
08/29/2023, 5:59 PM
so, I don't know if the process actually died, maybe it's in some corrupted state
Albert Wong
08/29/2023, 5:59 PM
that's why I want to look deeper into the issue instead of putting on a bandaid
j
Jorge Severino
09/22/2023, 8:43 PM
Hello everyone,
Something similar happened to us, and was that the API Key expired:
Maybe you should check it out too.
I hope to help