John Jacoby
10/19/2021, 8:33 PMKevin Kho
executor=DaskExecutor(
cluster_class="dask_kubernetes.KubeCluster",
cluster_kwargs={
"pod_template": make_pod_spec(
image=os.environ["AZURE_BAKERY_IMAGE"],
labels={
"flow": flow_name
},
memory_limit=None,
memory_request=None,
)
},
adapt_kwargs={"maximum": 10}
)
So I am thinking the best bet is to use the dask_jobqueue.SLURMCluster
in the same way with
executor=DaskExecutor(
cluster_class="dask_jobqueue.SLURMCluster",
cluster_kwargs=...
)
and then attach this to your flow and see if it works?
I suspect it won’t though because the SLURMCluster signature is quite different from the Dask clustersKevin Kho
JobQueueCluster
inherits from SpecCluster
, which the KubeCluster
does as well so we’ll see.John Jacoby
10/19/2021, 8:48 PMKevin Kho
KubeCluster
takes. So I put that as an example to give an idea of how to pass the kwargs
that your SLURM Cluster would take. Do you spin it up on the fly or is it long running and you want to connect with IP address?John Jacoby
10/19/2021, 8:55 PMKevin Kho
executor = DaskExecutor(address="192.0.2.255:8786")
but I guess not if it’s password protected.
How do you connect to the cluster if you weren’t using Prefect? Like just plain Python? I think if you can give me that,, we can find a way to make this work. I’m not familiar with SLURMJohn Jacoby
10/19/2021, 9:03 PMKevin Kho
John Jacoby
10/19/2021, 9:12 PMJohn Jacoby
10/19/2021, 9:13 PMZanie
Zanie
scheduler="processes", num_workers=1000
or similar might unblock you?Zanie
John Jacoby
10/19/2021, 9:47 PMJohn Jacoby
10/19/2021, 9:48 PMZanie
Zanie
scheduler="threads"
still and see if that clears up your issue.John Jacoby
11/15/2021, 3:54 PM