Arsenii
03/10/2020, 1:53 AMDaskExecutor
can be used for parallelization inside the flow between tasks, but what about parallelization between flows themselves? I see that there's DaskKubernetes
environment that spawns pods for flows, each with a temporary Dask cluster inside, which makes sense to me on the surface but Kubernetes is not currently an option for us.
Would setting up something like `FargateEnvironment`/`Fargate Agent` bring significant improvements compared to, say, regular DockerAgent
? If a flow is run as a Fargate Task
with a specified remote DaskExecutor
, where does it actually ""run"" the flow? Does it make more sense to have a dedicated remote Dask cluster somewhere, or start up a local one for each flow?
Thanks again for all the help!Chris White
Arsenii
03/10/2020, 6:36 AMBraun Reyes
03/10/2020, 1:47 PMChris White
Arsenii
03/13/2020, 6:33 AMExecutors
, the drawbacks of LocalExecutor
compared to others was very clear)... Hence this Slack discussion
It might be objectively obvious but I felt something could be added :)