Hey everyone my teammate < Zaid Naji> and I are trying to ru Prefect Community #prefect-server

Hey everyone, my teammate <@U019ZNA00KC> and I are...

Emma Willemsma

10/01/2020, 8:15 PM

Hey everyone, my teammate @Zaid Naji and I are trying to run flows using the Fargate agent. We're getting this error when we try to run a flow:

Copy code

An error occurred (InvalidParameterException) when calling the RegisterTaskDefinition operation: Invalid 'cpu' setting for task.

The entrypoint we're using for the agent is:

Copy code

["prefect","agent","start","fargate","cpu=256","memory=1024","networkConfiguration=$NETWORK_CONFIGURATION"],

Does anyone know what we should we be setting for

cpu

to get it working?

nicholas

10/01/2020, 8:17 PM

Hi @Emma Willemsma - I think that setting should be a string itself, so

cpu="256"

nicholas

10/01/2020, 8:18 PM

The same with

memory

Emma Willemsma

10/01/2020, 8:20 PM

Oh is this a docs issue then? We were following this as an example: https://docs.prefect.io/orchestration/agents/fargate.html#prefect-cli-using-kwargs

nicholas

10/01/2020, 8:25 PM

Ah it looks like that might be incorrect, can you give it a try with them as strings and report back? If so we can update the documentation

Emma Willemsma

10/02/2020, 2:36 PM

We're having a really hard time making this work. We're running the agent as a Fargate service, and we've tried a bunch of variants (with and without quotes and and escape characters) for the

cpu

parameter in the

entryPoint

and we keep getting the same error. So this for example isn't working:

Copy code

["prefect","agent","start","fargate","cpu=\"256\"","memory=\"1024\"","networkConfiguration=$NETWORK_CONFIGURATION"],

Has anyone gotten this to work?

👀 1

Dylan

10/02/2020, 2:43 PM

What are acceptable CPU values for Fargate? We’re on GCP but I’ll do my best to help

Dylan

10/02/2020, 2:44 PM

https://docs.aws.amazon.com/AmazonECS/latest/developerguide/AWS_Fargate.html

Dylan

10/02/2020, 2:44 PM

Looks like 256 is an acceptable value

Dylan

10/02/2020, 2:44 PM

And it’s not working as a number or as a string?

Emma Willemsma

10/02/2020, 2:45 PM

Yeah we've tried it both ways

josh

10/02/2020, 2:46 PM

Just jumping in here 🙂 did you happen to try

"cpu='256'"

Spencer

10/02/2020, 2:47 PM

Hi, I'm using the Fargate Agent for my runs. I found that using the ENV is the only way to configure it reliably. The Agent won't load some configurations from arguments due to oversight in the code. I resorted to using ENV for almost all of it. My entrypoint is

["prefect", "agent", "start", "fargate", "enable_task_revisions=true"]

With the environment set of:

Copy code

PREFECT__BACKEND: server
      PREFECT__CLOUD__AGENT__AGENT_ADDRESS: <http://127.0.0.1:8080>
      PREFECT__CLOUD__AGENT__LABELS: '["s3-flow-storage"]'
      executionRoleArn: ecs-task-execution-role
      memory: 512
      cpu: 256
      networkConfiguration: ${NETWORK_CONFIGURATION}
      taskRoleArn: prefect-agent-role
      containerDefinitions_logConfiguration: ${LOG_CONFIGURATION}
      cluster: ${CLUSTER_NAME}

Emma Willemsma

10/02/2020, 2:49 PM

Ah, good to know

Emma Willemsma

10/02/2020, 2:52 PM

Ok thanks, we'll give this a try!

🙏 1

josh

10/02/2020, 2:53 PM

I think I might know why this is failing from the CLI! It’s most likely due to some parsing mismatch. When passing in cpu and memory we don’t evaluate the literal value because the agent assumes it’s being passed in as a string. However when passing it in from the CLI entrypoint like above it seems as if it is interpreting it as an integer! (which is fine for every other kwargs except cpu and memory because their literal value is being interpolated) https://github.com/PrefectHQ/prefect/blob/master/src/prefect/agent/fargate/agent.py#L327 I’m going to put together a fix for this 🙂

😀 1

Spencer

10/02/2020, 2:55 PM

The entire

_parse_kwargs

stuff is rather difficult to follow 😓 I didn't make a PR because I couldn't quite understand the reason for the complexity (nor had the time to understand the flow).

josh

10/02/2020, 2:56 PM

Oh yes it is big time and we actually have a way cleaner path forward with a new RunConfig pattern we are introducing 🙂

🤝 2

josh

10/02/2020, 3:22 PM

PR for fix: https://github.com/PrefectHQ/prefect/pull/3423

👍 2

👏 2

Emma Willemsma

10/02/2020, 6:19 PM

@Spencer thanks for the help, we finally got our Fargate agent working using your suggestion 🙂

🙌 2

Zaid Naji

10/02/2020, 7:18 PM

Hi thanks guys for the support. So we are running into an interesting issue. When passing the executionRoleArn and taskRoleArn to the agent, it will not pass them to the flow tasks it will pass its own task and execution roles to them. Is that intended? Given that the prefect agent is deployed on ecs itself as a service

Spencer

10/02/2020, 7:29 PM

It is intended; the task/execution roles for the flows are configured in the

FargateTaskEnvironment

that you attach to the flows

Zaid Naji

10/02/2020, 7:29 PM

Ah prefect thank you 🙏

Spencer

10/02/2020, 7:30 PM

The task and execution roles that you configured before are solely for the flow boot task (which starts the flow run)

Spencer

10/02/2020, 7:31 PM

I found it to be a bit tricky to cleanly attach the environment to the flows; so I wrote an internal deployment library that will crawl the files, gather all the flows (essentially imports each file and pulls out all module attributes of type

prefect.Flow

) and update their environments to the proper

FargateTaskEnvironment

before registering them. Also, I am using the

S3Storage

alongside this. You can of course just set it directly on your flows; I just wasn't a fan of that configuration. I wanted my data engineers to not have to be aware of it.

Zaid Naji

10/02/2020, 7:34 PM

Got it thanks. So we have to provision the flow roles separately

Zaid Naji

10/02/2020, 7:34 PM

And attach them to the flow and register

Spencer

10/02/2020, 7:35 PM

Yeah

Zaid Naji

10/02/2020, 7:36 PM

Cool thanks 🙏

Spencer

10/02/2020, 7:37 PM

The Fargate agent is a bit funny to host on ECS because its runtime will be running on ECS (polling the API). It will spawn a task on ECS that will download the configuration from the API; which will then spawn another task the will run your flow. Agent polling -> Intermediate task (flow run) -> Your flow executing

👍 1

Zaid Naji

10/02/2020, 8:16 PM

Is there documentation to show us how to add the flow role arns before registering them?

Spencer

10/02/2020, 8:41 PM

You just specify them in the

FargateTaskEnvironment

constructor:

taskRoleArn

and

executionRoleArn

Spencer

10/02/2020, 8:42 PM

If using a custom docker image for the tasks, you need to specify it in the metadata field too

Spencer

10/02/2020, 8:42 PM

My FargateTaskEnvironment looks something like:

Copy code

FargateTaskEnvironment(
        # Task Definition
        family=task_definition_name,
        taskRoleArn=settings.task_role_arn,
        executionRoleArn=settings.execution_role_arn,
        cpu=settings.cpu,
        memory=settings.memory,
        containerDefinitions=[
            {
                "name": "flow-container",
                "image": "image",
                "command": [],
                "environment": [],
                "essential": True,
                "logConfiguration": {
                    "logDriver": "awslogs",
                    "options": {
                        "awslogs-group": settings.awslogs_group,
                        "awslogs-region": settings.awslogs_region,
                        "awslogs-stream-prefix": settings.awslogs_stream_prefix,
                    },
                },
            }
        ],
        networkMode="awsvpc",
        requiresCompatibilities=["FARGATE"],

        # Task Run
        cluster=settings.cluster_name,
        region=aws_settings.region,
        taskDefinition=task_definition_name,
        launch_type="FARGATE",
        networkConfiguration={
            "awsvpcConfiguration": {
                "assignPublicIp": "ENABLED"
                if settings.assign_public_ip
                else "DISABLED",
                "subnets": settings.subnets,
            }
        },
        metadata={"image": image},
    )

Zaid Naji

10/03/2020, 4:31 PM

Thank you for the example. When we pass the container definition like this the fargate agent is not picking it up after registering the flow and it complains about missing parameters.

Zaid Naji

10/03/2020, 4:39 PM

After checking your above comment on the ECS deployment of the agent now I get what’s happening. The initial env variables are for the task that pulls the flow config not the flow definition itself. Do you suggest a better deployment model for the fargate agent?

Spencer

10/03/2020, 5:13 PM

I host the Fargate Agent on Fargate myself 🤷‍♂️ it works for me

Spencer

10/03/2020, 5:14 PM

The containerDefinitions in the

FargateTaskEnvironment

get overridden by the agent when constructing the task IIRC

Zaid Naji

10/03/2020, 5:41 PM

So in our use case, we need to pass different roles to the flow tasks. We don’t want the agent to forward its role to the flow tasks.

Zaid Naji

10/03/2020, 5:42 PM

I guess it will only pass its own roles and that might not work for us

Spencer

10/03/2020, 6:01 PM

Oh wow, OK. Using a different role per flow would be a bit cumbersome. I think you'd have a construct a separate

FargateTaskEnvironment

with different

taskRoleArn

for each role.

Spencer

10/03/2020, 6:02 PM

The task that pulls down the flow from the API uses the task role from the agent configuration. I'm not sure what to call that launching task 🤷‍♂️

Zaid Naji

10/03/2020, 6:24 PM

Oh so the task that pulls the config gets passed the role from the agent. What about the flow task itself (that runs the flow)?

Zaid Naji

10/03/2020, 6:27 PM

Yea we have a single tenancy requirement which demands each customer to have their isolated environment. Our other option would be to delegate executions to something like AWS batch or Databricks and the prefect task would not have direct access to customer data.

Spencer

10/03/2020, 7:11 PM

The flow task itself should use the FargateTaskEnvironment ARNs

Spencer

10/03/2020, 7:12 PM

I'm not sure how you could have the roles be dynamically applied in the flows.. Perhaps you can specify a dynamic role that the task should assume within the task code? Having a root role that can only

sts:AssumeRole

that the tasks can use directly; then you setup boto3 with an

AssumeRoleCredentialsProvider

(or some such; I know boto3 doesn't have this natively but other AWS SDKs do; for boto3 https://stackoverflow.com/a/45834847) in the session?

Zaid Naji

10/03/2020, 7:49 PM

Fair thanks for the suggestion. Will look at the different strategies and get back to you.

ale

10/05/2020, 1:38 PM

Hi @Spencer! Regarding this https://prefect-community.slack.com/archives/C014Z8DPDSR/p1601671347075000?thread_ts=1601583340.049800&cid=C014Z8DPDSR How do you specify taskRoleArn and taskExecutionRoleArn in the metadata field?

Spencer

10/05/2020, 1:39 PM

This comment is referring to using a custom docker container for your flow execution; you specify this in the FargateTaskEnvironment's metadata field

metadata={"image": ...}

Specifying the taskRoleArn and executionRoleArn are native fields in the FargateTaskEnvironment separate from the metadata field.

ale

10/05/2020, 1:40 PM

Ok, thank you! 👍

2 Views

Open in Slack

Previous Next