Riley Hun
06/17/2021, 5:09 PMKevin Kho
Gateway Cluster
but I read up on this and I think I understand it. If something I say sounds off, just let me know.
My understanding is that Dask Gateway
is a server for managing clusters. You can create a Prefect
DaskExecutor
by passing the GatewayCluster
to the DaskExecutor
. Similarly, you can use the Resource Manager
to spin up a cluster (and you can use the Gateway Cluster
for that temporary cluster.
That said, I don’t think the question is Resource Manager or Dask Gateway
because they can be used together. So the comparison here isn’t cost-effectiveness or scalability, but what the Prefect use-case is for both approaches.
If you are manipulating Dask DataFrames
or writing Dask
specific code, it is better to use the Resource Manager
because Prefect
does not pass data between tasks well and the Dask
Cluster will handle garbage collection this way. If you are writing non-Dask
code and parallelizing over the Dask
engine with Prefect's map,
then passing the GatewayCluster
to the DaskExecutor
is better.Riley Hun
06/17/2021, 5:28 PMEric Jurotich
06/17/2021, 5:30 PM