Marwan Sarieddine
03/25/2021, 4:17 PMMarwan Sarieddine
03/25/2021, 4:17 PMJim Crist-Harif
03/25/2021, 4:19 PMMarwan Sarieddine
03/25/2021, 4:20 PMJim Crist-Harif
03/25/2021, 4:21 PMJim Crist-Harif
03/25/2021, 4:22 PMcluster.close()
, so this also might be a dask-kubernetes issue.Marwan Sarieddine
03/25/2021, 4:22 PMMarwan Sarieddine
03/25/2021, 4:23 PMMarwan Sarieddine
03/25/2021, 7:47 PM@task()
def long_sleep():
time.sleep(10 * 60)
with Flow(self.name) as flow:
long_sleep()
Setting the task to failed doesn’t interrupt the task or kill the worker pod - but instead waits until the sleep is complete
Cancelling the flow on the other hand is more explicit about this behavior in the logs:
Stopping executor, waiting for 1 active tasks to complete
Is there a way from to interrupt this long running task from the cloud and kill the worker?Jim Crist-Harif
03/25/2021, 7:53 PMMarwan Sarieddine
03/25/2021, 7:53 PMDaskKubernetesEnvironment
Jim Crist-Harif
03/25/2021, 7:55 PMJim Crist-Harif
03/25/2021, 7:55 PMMarwan Sarieddine
03/25/2021, 7:56 PMJim Crist-Harif
03/25/2021, 7:56 PMDaskKubernetesEnvironment
looks like an external dask cluster (not a temporary dask cluster), so you get the same behavior (cancellation waits for the cluster to shutdown).Marwan Sarieddine
03/25/2021, 7:56 PMMarwan Sarieddine
03/25/2021, 7:57 PMJim Crist-Harif
03/25/2021, 7:58 PM