David Elliott
03/23/2021, 8:09 PM400 Client Error:
... "input.states[0].task_run_id"; Expected non-nullable type UUID! not to be null.
on some of the tasks when I run the flow. I'll put the full stack trace in the 🧵. It's happening on maybe 1 in every 20 tasks or so. The task then gets put into state 'ClientFailed' (and the UI can't see them) and all downstream dependents of these tasks then get set to state 'Pending'.
I've tried many dask workers, then just 1 dask worker (for simplicity), same issue. Can't replicate it with the smaller (196 task) flow. I'm wondering if there's some kind of rate limiting going on whereby there are so many concurrent tasks running simultaneously (there are a tonne all trying to be ran at the same time) that some of them are getting a generic error from cloud or something?
I would try adding a task concurrency limit to see if this helps with the above hypothesis, but the UI says it's not included in our plan (even though we're an enterprise tenant). Is it possible to set task concurrency at the flow level?
Also, the UI can't load the schematic of the big flow, though that's less of an immediate concern. Thanks in advance for any advice!David Elliott
03/23/2021, 8:09 PMJenny
03/23/2021, 8:46 PMJenny
03/23/2021, 10:09 PMDavid Elliott
03/23/2021, 10:27 PM