Bring your towel and join one of the fastest growing data communities. Welcome to our second-generation open source orchestration platform, a completely rethought approach to dataflow automation.

Prefect Community

I find myself running scripts that <https://discourse.prefect.io/t/i-have-a-flow-run-that-got-stuck-in-a-running-state-how-can-i-cancel-it-from-the-orion-client/800|set 'stuck' flow runs to Cancelled or Crashed>.. Is there some more automatic way to recover from that? It can cause jams, because stuck jobs eat up queue concurrency.  Let me know if Discourse or elsewhere is the right place to ask.  I also found <https://github.com/PrefectHQ/prefect/issues/7116|this Github issue>.. how do people handle this in cases of crashed Kubernetes or ECS tasks?

I _think_ these are similar to the issues raised here, <https://prefect-community.slack.com/archives/CL09KU1K7/p1668691671670139>
But I'm mostly asking about:
&gt; if a pod is terminated, the flow status remains Running indefinitely
Is there an automated way to recover now?