Hey folks wave Having some difficulties installing `dask clo Prefect Community #ask-community

Hey folks :wave: Having some difficulties install...

ciaran

03/26/2021, 11:48 AM

Hey folks 👋 Having some difficulties installing

dask-cloudprovider[aws]==2021.3.0

when I have

prefect[aws]==0.14.13

installed. It seems they have different requirements for botocore:

Copy code

There are incompatible versions in the resolved dependencies:
  botocore<1.19.53,>=1.19.52 (from aiobotocore==1.2.2->dask-cloudprovider[aws]==2021.3.0->-r /var/folders/kf/93zlmdv15vz6sjhr2xd0j7y40000gn/T/pipenv_bvj4rpkrequirements/pipenv-1_o8bqwg-constraints.txt (line 6))
  botocore<1.21.0,>=1.20.38 (from boto3==1.17.38->prefect[aws]==0.14.13->-r /var/folders/kf/93zlmdv15vz6sjhr2xd0j7y40000gn/T/pipenv_bvj4rpkrequirements/pipenv-1_o8bqwg-constraints.txt (line 5))

Is there a specific version of

dask-cloudprovider

that

prefect

works with?

👀 2

ciaran

03/26/2021, 1:40 PM

Without

dask-cloudprovider

I'm unable to use https://docs.prefect.io/orchestration/flow_config/executors.html#using-a-temporary-cluster

ciaran

03/26/2021, 2:22 PM

Also the listed requirements (

prefect, dask, distributed

) does not mention

dask-cloudprovider

either

Jim Crist-Harif

03/26/2021, 2:23 PM

We don't require dask-cloudprovider for prefect, only if you want to use dask-cloudprovider with prefect.

Jim Crist-Harif

03/26/2021, 2:24 PM

As for the version incompatibility, this looks like

aiobotocore

is pinning

botocore

to a single release, which doesn't work with boto3's pinnings (see https://github.com/aio-libs/aiobotocore/issues/855). For now if you disable the new pip resolver to ignore these issues you can install things together.

Jim Crist-Harif

03/26/2021, 2:24 PM

This isn't a great solution though.

ciaran

03/26/2021, 2:24 PM

hmmm I'm using

pipenv

so not sure how that plays with things

ciaran

03/26/2021, 2:25 PM

You don't require it, but the example in the documentation only specifies

dask

and

distributed

when https://docs.prefect.io/orchestration/flow_config/executors.html#using-a-temporary-cluster is explicitly using

dask-cloudprovider

so I think it probably should be listed

Jim Crist-Harif

03/26/2021, 2:26 PM

Ah, gotcha. Yeah, that text could be updated to mention the dependency requirement.

Jim Crist-Harif

03/26/2021, 2:26 PM

If there's not a way to forward pip flags through pipenv, downgrading pip to 20.2.4 might also work.

Jim Crist-Harif

03/26/2021, 2:38 PM

Hmmm, actually we're pretty flexible with our boto3 versioning in prefect (supporting back to 1.9), it looks like this might be an issue with the pip resolver not backtracking enough to find the boto3 version that goes with botocore. If you manually find a compatible boto3 version and add it to your install requirements that might help pip out.

Jim Crist-Harif

03/26/2021, 2:39 PM

I think

boto3=1.16.52

might work for you.

ciaran

03/26/2021, 2:40 PM

consoleoutput.txt

ciaran

03/26/2021, 2:41 PM

Weird as it appears that

pipenv

tried almost every version of botocore

ciaran

03/26/2021, 2:41 PM

But I'll try pinning boto3

Jim Crist-Harif

03/26/2021, 2:41 PM

It's moving botocore, but aiobotocore has pinned that. The thing it should be trying is different versions of boto3.

Jim Crist-Harif

03/26/2021, 2:43 PM

pips resolver is a bit complicated since pypi lacks access to efficient queries for version info (frequently the package has to be downloaded to get this info), so it has to iteratively backtrack. conda can do a much better job here (solving for valid versions upfront) since all version metadata is available via a separate route.

ciaran

03/26/2021, 2:50 PM

Cool pinning to that

boto3

version seems to work. How did you get to that? I'd love to be able to figure that out for the next time

pipenv

bites me in the backside 🤣 In terms of the issue (without pinning boto3) is this something to raise in

aiobotocore

? To bump their

botocore

version?

Jim Crist-Harif

03/26/2021, 2:58 PM

Glad to hear it. I don't really have any tips here. The conflict pip was reporting was between botocore versions between the one

aiobotocore

pins to and the latest version (which no dependency pins to, but pip was using). Looking at the aiobotocore

setup.py

, you can see their pinning for

botocore

, but they also have an optional dep on

boto3

(which I assumed was compatible). https://github.com/aio-libs/aiobotocore/blob/master/setup.py#L23

Jim Crist-Harif

03/26/2021, 2:59 PM

For the fix, aiobotocore, shouldn't be this strict - exact pinnings aren't friendly for users. The issue I linked above covers that.

ciaran

03/26/2021, 3:00 PM

Cool okay, I'll just add fuel to that issue 🔥 Appreciate all your help!

ciaran

03/26/2021, 3:06 PM

Oh, sad times. I don't think botocore supports FARGATE with that version 🤣 😭

Copy code

An error occurred (InvalidParameterException) when calling the RunTask operation: Task definition does not support launch_type FARGATE.

Jim Crist-Harif

03/26/2021, 3:09 PM

That's not the issue here, that's an issue we've seen with several user's ECS setups. I haven't been able to reproduce locally.

Jim Crist-Harif

03/26/2021, 3:09 PM

How did you create your ECS cluster in aws?

ciaran

03/26/2021, 3:10 PM

With the CDK

ciaran

03/26/2021, 3:11 PM

It can definitely launch Fargate instances, the agent itself is a Fargate instance

Jim Crist-Harif

03/26/2021, 3:12 PM

Oh huh, never heard of a CDK (had to google). What I'm looking for is the description of your cluster. If you have the AWS CLI, this would be the output of

Copy code

aws ecs describe-clusters --clusters <YOUR-CLUSTER-NAME>

ciaran

03/26/2021, 3:12 PM

Oh AWS CDK is lovely. Cloudformation but good

ciaran

03/26/2021, 3:14 PM

awsclioutput.json

Jim Crist-Harif

03/26/2021, 3:15 PM

Ah, you have no capacity providers. Cool, should be able to debug and work with that.

Jim Crist-Harif

03/26/2021, 3:15 PM

For now, if you add

FARGATE

to your capacity providers things should work. If you create an ECS cluster using the AWS console this is added automatically for you (this difference has caused issues).

ciaran

03/26/2021, 3:16 PM

Ah okay. I assumed it defaulted to Fargate

ciaran

03/26/2021, 3:16 PM

I'll give that a go

ciaran

03/26/2021, 3:16 PM

Stange as not using the DaskExecutor happily let me start up other Fargate tasks with ECSRun

ciaran

03/26/2021, 3:17 PM

Guess it's different api calls

Jim Crist-Harif

03/26/2021, 3:17 PM

Wait, what? Were you running against the same ECS cluster?

ciaran

03/26/2021, 3:17 PM

Yep

Jim Crist-Harif

03/26/2021, 3:18 PM

That makes no sense, the agent doesn't look at the executor at all.

ciaran

03/26/2021, 3:18 PM

In terms of error or choice 🤣

Jim Crist-Harif

03/26/2021, 3:18 PM

Did the above error message show up preventing your flow from starting? Or after your flow started but before the dask cluster started?

Jim Crist-Harif

03/26/2021, 3:19 PM

If it's the former, I'm confused. If it's only preventing your

FargateCluster

from being started then that makes sense.

ciaran

03/26/2021, 3:19 PM

That's all I got on the UI

Jim Crist-Harif

03/26/2021, 3:19 PM

baffling.

Jim Crist-Harif

03/26/2021, 3:20 PM

maybe this is a change in the defaults sent by botocore.

ciaran

03/26/2021, 3:20 PM

The ECS agent is in the cluster, and I'm pointing FargateCluster to the same cluster

Jim Crist-Harif

03/26/2021, 3:20 PM

if things were working before but now aren't

ciaran

03/26/2021, 3:20 PM

If I take out the executor option, leave the prefect default, it runs fine

Jim Crist-Harif

03/26/2021, 3:20 PM

Now?

Jim Crist-Harif

03/26/2021, 3:20 PM

after updating botocore?

ciaran

03/26/2021, 3:20 PM

Oh, good question

ciaran

03/26/2021, 3:20 PM

I'll try that.

ciaran

03/26/2021, 3:21 PM

With or without the FARGATE capacity provider?

ciaran

03/26/2021, 3:21 PM

Without I guess..

Jim Crist-Harif

03/26/2021, 3:22 PM

The agent never touches the executor, so the execution path in the agent code doesn't change if you set one or don't. Which is why I'm confused why removing it would fix things (it shouldn't change anything).

ciaran

03/26/2021, 3:22 PM

Removing the executor definition also fails

ciaran

03/26/2021, 3:22 PM

So I think it's botocore that doesn't allow FARGATE at that version

ciaran

03/26/2021, 3:23 PM

Because before installing

boto3

and

dask-cloudprovider

it ran fine

Jim Crist-Harif

03/26/2021, 3:24 PM

I don't think it's not allowed, I bet it sets some default field in the json blob that fixes things. Can you tell me what versions you had before and now? I'm hoping I can squash this issue for good.

Jim Crist-Harif

03/26/2021, 3:25 PM

boto3 & botocore versions.

ciaran

03/26/2021, 3:25 PM

Sure, let me rollback to the working versions

🙏 1

ciaran

03/26/2021, 3:28 PM

Now:

Copy code

boto3==1.16.52
botocore==1.19.52

Before (without an executor specified and without using dask):

Copy code

boto3==1.17.38
botocore==1.20.38

Jim Crist-Harif

03/26/2021, 3:30 PM

And can you confirm with the before (higher) versions things did work successfully?

ciaran

03/26/2021, 3:31 PM

Just registering & running the flow

ciaran

03/26/2021, 3:36 PM

Sorry, having to re-deploy the agent

Jim Crist-Harif

03/26/2021, 3:37 PM

No worries, thanks for helping to debug this.

ciaran

03/26/2021, 3:37 PM

Haha thanks yourself!

ciaran

03/26/2021, 3:47 PM

Now I'm confused. Same error.

ciaran

03/26/2021, 3:52 PM

PREFECT_IMAGE

is just pointing to an ECR image built with:

Copy code

FROM prefecthq/prefect:0.14.13-python3.8
ENTRYPOINT [ "prefect", "agent", "ecs", "start", "--agent-address", "http://:8080"]

flow.py

ciaran

03/26/2021, 3:53 PM

Oh hang on, looks like my versions are still wrong locally

ciaran

03/26/2021, 4:00 PM

Okay. Got a running flow.

ciaran

03/26/2021, 4:01 PM

I had to change to

image="prefecthq/prefect:0.14.13-python3.8"

ECSRun

ciaran

03/26/2021, 4:01 PM

Jim Crist-Harif

03/26/2021, 4:04 PM

Wait what. That also doesn't make sense - the issue you're experiencing should be only specific to what the agent is running.

Jim Crist-Harif

03/26/2021, 4:05 PM

Anyway, with that working setup, what versions of `boto3`/`botocore` /`prefect` does your agent have running?

ciaran

03/26/2021, 4:07 PM

Two secs. I'm deploying it so that the agent and the ECSRun point to the same image. I think something funky happened with ecr But it should be the versions in

prefecthq/prefect:0.14.13-python3.8

as all I'm doing currently extra in the Dockerfile is adding an entrypoint

ciaran

03/26/2021, 4:17 PM

Okay the failure im seeing with my rolled back stuff is something f***y going on with ECR caching

ciaran

03/26/2021, 4:17 PM

But the working flow yesterday I had, with just the ECSRunner, no Executor set, was the

prefecthq/prefect:0.14.13-python3.8

image

ciaran

03/26/2021, 4:18 PM

For both Agent and ECSRunner image.

Jim Crist-Harif

03/26/2021, 4:20 PM

Sure, but I'm wondering if something else has changed. If you can't reproduce it, I'm skeptical that a change in the code (boto or otherwise) is causing it, as we haven't seen that before.

Jim Crist-Harif

03/26/2021, 4:20 PM

Anyway, I'll try to reproduce locally and see if we can prevent this issue from happening regardless of versions.

✅ 1

ciaran

03/26/2021, 4:25 PM

Cool thanks, we're gonna cut out using CDK to handle Docker images, it's still experimental so I think that's not helping the situation. I'll try get to a state where I can confidently say it's working again. Then break it for you haha

3 Views

Open in Slack

Previous Next