Erik Amundson
07/19/2021, 10:17 PMDaskExecutor
on GKE with cluster class dask_kubernetes.KubeCluster
and it seems to be dropping 1-2 mapped children per run. It's like the scheduler doesn't realize they exist, or is losing track somehow - there is no error message in the logs. This prevents the flow from proceeding to the downstream tasks, so I end up having to cancel the flow. So far it's shown the same behavior on all four test runs. If it matters, we're running on prefect 0.14.16.Kevin Kho
Erik Amundson
07/20/2021, 4:30 PMErik Amundson
07/20/2021, 4:43 PMErik Amundson
07/20/2021, 5:00 PMErik Amundson
07/20/2021, 5:13 PM2021-07-20T16:57:32.459764320Zdistributed.core - INFO - Event loop was unresponsive in Worker for 10.01s. This is often caused by long-running GIL-holding functions or moving large chunks of data. This can cause timeouts and instability.
Kevin Kho
Erik Amundson
07/20/2021, 5:18 PMKevin Kho
Erik Amundson
07/20/2021, 5:24 PMKevin Kho
Kevin Kho
Erik Amundson
07/20/2021, 5:30 PM