Chris Arderne
11/12/2021, 3:04 PMVertexRun
and GitHub Storage
. I'm now trying to get a distributed DaskExecutor
to run using dask_cloudprovider.gcp.GCPCluster
. Using the same Docker image that I already had working with VertexRun, with the Dask dependencies added. I also created a Packer image based on this.
It works if I run the Flow locally (with prefect flow run …, so Vertex is bypassed), spins up a Dask cluster and completes successfully. But when I ran it from Prefect Cloud, via Vertex, it provisioned a scheduler which had some errors (failed to restart for crond, nscd, unscd) and then didn't do anything. Aside: after I cancelled the Flow, I had to manually delete this scheduler. VPC is set up to all full access within the network, so shouldn't be anything to do with that,
Any ideas? Has anyone got this working well?Kevin Kho
Kevin Kho
Chris Arderne
11/12/2021, 3:23 PMChris Arderne
11/12/2021, 3:25 PMCreating a new Dask cluster with GCPCluster
.Kevin Kho
Chris Arderne
11/12/2021, 3:31 PMChris Arderne
11/12/2021, 3:53 PMKevin Kho
Chris Arderne
11/12/2021, 4:18 PM