Can timeouts be specified at the flow-level? Occasionally I’ve had issues with the dask cloudprovider process instantiating my Dask cluster on ECS, where it hangs indefinitely, preventing any tasks from actually starting. In this scenario the task timeout doesn’t help because it never actually gets to the point of running a task, it just hangs during the setup process. So it’d be helpful to be able to specify timeout at the overall flow level, if that is currently possible?
k
Kevin Kho
05/27/2021, 9:05 PM
Hi @Andrew Hannigan, what Cloud plan are you on? You can do this through Automations.
Standard+ plans get the Flow SLA Failure. Not quite documented yet I think
a
Andrew Hannigan
05/28/2021, 1:35 PM
Starter plan at the moment @Kevin Kho
k
Kevin Kho
05/28/2021, 1:36 PM
I’ll check to see what you can do and get back to you
z
Zanie
05/28/2021, 2:37 PM
Hey @Sean Harkins -- your best option is probably to implement your own timeout logic for dask cluster startup. I can help you get started with that if you want. Otherwise, unless you have a lot of task runs it's generally easy to fit into the free tier of Standard which gets you some of these "guarantees" from Cloud.
a
Andrew Hannigan
06/01/2021, 6:49 PM
@Kevin Kho Just wanted to check in here on what the suggested approach is for those using Cloud?
k
Kevin Kho
06/01/2021, 6:53 PM
The standard plan would allow you to use Automations to cancel the flow run after a certain amount of time. Otherwise, you would need the timeout logic for Dask as Michael suggested.
Bring your towel and join one of the fastest growing data communities. Welcome to our second-generation open source orchestration platform, a completely rethought approach to dataflow automation.