The symptom is independent from the agents location or infrastructure used. My question is how to deal with "Zombies", e.g. flow-runs that have been started but due to an incident (agent crashing, careless trainee, hardware failure, power outage, ...) never changed their status. Especially when using concurrency limits these zombies keep blocking the workqueue and can have a wide impact on the entire system. Are there any methods or recommendations how to detect these "Zombies" that I'm missing?