with a self-hosted Prefect server. When a Prefect flow runs as a pod in a Kubernetes cluster and the pod gets removed due to a reboot, the flow’s state remains perpetually in the “running” status. How can I resolve this issue?
a
Ankit
01/08/2025, 9:36 AM
there's no solve for this probably, we run a flow which checks all the running flows and marks them cancelled if they have exceeded the time it should have taken
b
Bianca Hoch
01/09/2025, 7:17 PM
Hi Shelley and Ankit! Upgrading to version 3.1.8 of Prefect should help you out with this. It introduces flow run heartbeats to prevent flow runs from getting stuck in
RUNNING
due to infrastructure failures. There's instructions for setting this up in the link
Bring your towel and join one of the fastest growing data communities. Welcome to our second-generation open source orchestration platform, a completely rethought approach to dataflow automation.