Riley Hun
06/17/2021, 5:09 PMKevin Kho
Gateway Cluster but I read up on this and I think I understand it. If something I say sounds off, just let me know.
My understanding is that Dask Gateway is a server for managing clusters. You can create a Prefect DaskExecutor by passing the GatewayCluster to the DaskExecutor. Similarly, you can use the Resource Manager to spin up a cluster (and you can use the Gateway Cluster for that temporary cluster.
That said, I don’t think the question is Resource Manager or Dask Gateway because they can be used together. So the comparison here isn’t cost-effectiveness or scalability, but what the Prefect use-case is for both approaches.
If you are manipulating Dask DataFrames or writing Dask specific code, it is better to use the Resource Manager because Prefect does not pass data between tasks well and the Dask Cluster will handle garbage collection this way. If you are writing non-Dask code and parallelizing over the Dask engine with Prefect's map, then passing the GatewayCluster to the DaskExecutor is better.Riley Hun
06/17/2021, 5:28 PMEric Jurotich
06/17/2021, 5:30 PM