Brian Keating
04/01/2021, 2:11 AMEC2Cluster
. I'm new to dask. The relevant part of my flow script is:
flow.executor = DaskExecutor(
cluster_class='dask_cloudprovider.aws.EC2Cluster',
cluster_kwargs={'n_workers': 2, 'docker_image': 'prefecthq/prefect', 'debug': True}
)
flow.run()
This fails with FlowRunner: ClientError('An error occurred (InvalidParameterValue) when calling the RunInstances operation: User data is limited to 16384 bytes')
. The issue is that the docker run command looks like this: docker run --net=host prefecthq/prefect env DASK_INTERNAL_INHERIT_CONFIG="a_very_very_long_string" python -m distributed.cli.dask_scheduler
, so I guess the command winds up being too long. Does anyone know a workaround for this issue?Jim Crist-Harif
04/01/2021, 2:13 AMEC2Cluster
implementation), I suggest filing an issue in the dask-cloudprovider
repo (https://github.com/dask/dask-cloudprovider) and see if they have any ideas there.Jim Crist-Harif
04/01/2021, 2:15 AMFargateCluster
) shouldn't have this issue.Brian Keating
04/01/2021, 2:17 AMBrian Keating
04/01/2021, 2:30 AMFargateCluster
: it creates a cluster and a scheduler task. however, no worker tasks are created (I have n_workers=2
) and produces the error prefect.FlowRunner | Unexpected error: OSError('Timed out trying to connect to <tcp://54.186.22.123:8786> after 10 s')
Brian Keating
04/01/2021, 2:30 AMJim Crist-Harif
04/01/2021, 4:27 PMdask-cloudprovider
(but maybe not a bug, rather some setting/config you're missing). Apologies that I'm not more helpful here.Brian Keating
04/01/2021, 10:39 PM