https://prefect.io logo
a

Albert Wong

08/29/2023, 5:28 PM
does anyone's work queue periodically go unhealthy? Happens to be every couple days, then I have to restart the agent. Automated restarts don't seem to be working, or I cannot set it up right, but it'd be nice for the service to not crash in the first place. Is there a recommended path to diagnose the root of the unhealthy agent states?
b

Bianca Hoch

08/29/2023, 5:56 PM
Hey Albert, where/how are you hosting the agent? Definitely seems like daemonizing it is going to help here. What have you tried so far?
a

Albert Wong

08/29/2023, 5:58 PM
I added an onfailure item to the systemd startup script, and tested it by killing the service, seemed to respawn ... but in an actual scenario the process doesn't respawn after its died
so, I don't know if the process actually died, maybe it's in some corrupted state
that's why I want to look deeper into the issue instead of putting on a bandaid
j

Jorge Severino

09/22/2023, 8:43 PM
Hello everyone, Something similar happened to us, and was that the API Key expired: Maybe you should check it out too. I hope to help