Hello there New Prefect user here slightly smiling face We r Prefect Community #prefect-cloud

Hello there! New Prefect user here. :slightly_smil...

Giuliano Mega

11/03/2022, 10:29 PM

Hello there! New Prefect user here. 🙂 We're trying to run Prefect on a GKE cluster with autopilot (agent on GKE + Prefect cloud). I was able to get the agent working, and to figure out how to set up the blocks (GCS, KubernetesJob). Now that I'm finally able to run my example flow, though, I'm getting the following weird error from the job pod:

Copy code

Invalid flow run id. Recieved arguments: ['/usr/local/lib/python3.10/site-packages/prefect/engine.py']
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/site-packages/prefect/engine.py", line 1594, in <module>
    flow_run_id = UUID(
  File "/usr/local/lib/python3.10/uuid.py", line 171, in __init__
    raise TypeError('one of the hex, bytes, bytes_le, fields, '
TypeError: one of the hex, bytes, bytes_le, fields, or int arguments must be given

Giuliano Mega

11/03/2022, 10:30 PM

it appears it's sending command line arguments when it shouldn't?

Giuliano Mega

11/03/2022, 10:32 PM

I'm building my own image for the task runner (not sure that's the proper name - the pod that the agent fires to run the flow)

Giuliano Mega

11/03/2022, 10:32 PM

but I didn't do anything too funky with it, I think, other than installing packages:

Copy code

FROM prefecthq/prefect:2.6.5-python3.10

COPY pyproject.toml /opt/prefect
COPY poetry.lock /opt/prefect

RUN pip install poetry
RUN poetry config virtualenvs.create false && poetry install --only main

Giuliano Mega

11/03/2022, 10:33 PM

Any ideas of where I might be messing things up? Anyhow, thanks in advance for the great project and in any help getting this to work. 🙂

Mason Menges

11/03/2022, 10:36 PM

For Sure 😄, I don't think the image is the problem in this case, What version of prefect is your agent running?

Giuliano Mega

11/03/2022, 10:37 PM

Hey! I'm using

prefecthq/prefect:2.6.5-python3.9

Giuliano Mega

11/03/2022, 10:37 PM

do Python versions have to match? 😬

Mason Menges

11/03/2022, 10:41 PM

I'm actually not sure off the top of my head I know that the Prefect Agent needs to be a greater or equal version of the version of prefect running your flow in the deployment but I'm not sure about the python version, though it can't hurt to try

Giuliano Mega

11/03/2022, 10:41 PM

OK, will try it!

Giuliano Mega

11/03/2022, 10:42 PM

for reference, this is my terraform descriptor for the agent:

Giuliano Mega

11/03/2022, 10:42 PM

Copy code

resource "kubernetes_deployment" "prefect-agent" {
  metadata {
    name      = "prefect-agent"
    namespace = "default"
    labels    = {
      app = "prefect-agent"
    }
  }

  spec {
    replicas = 1
    selector {
      match_labels = {
        app = "prefect-agent"
      }
    }
    template {
      metadata {
        labels = {
          app = "prefect-agent"
        }
      }
      spec {
        container {
          name              = "agent"
          image             = "prefecthq/prefect:2.6.5-python3.9"
          command           = ["prefect", "agent", "start", "-q", "clarity-production"]
          image_pull_policy = "IfNotPresent"
          env {
            name  = "PREFECT_API_URL"
            value = local.prefect_cloud_api_url
          }
          env {
            name  = "PREFECT_API_KEY"
            value = data.google_secret_manager_secret_version.prefect-cloud-api-key.secret_data
          }
        }
      }
    }
  }
}

Mason Menges

11/03/2022, 10:43 PM

For sure, also side not our senior community engineer wrote a blog on an adjacent topic so there might be some insight to glean from there as well in regards to this https://medium.com/the-prefect-blog/serverless-prefect-flows-with-google-cloud-run-jobs-23edbf371175 😄

Giuliano Mega

11/03/2022, 10:45 PM

Yeah that looked awesome! But I have a recollection that cloud run has a 1 hour time limit for tasks and that scared me away from using it in this case 🙂

Giuliano Mega

11/03/2022, 10:55 PM

Ugh, still getting the same error 😕

Giuliano Mega

11/03/2022, 11:00 PM

tried using an unmodified prefect image, same result. Will try downgrading to an older version and see what happens

Mason Menges

11/03/2022, 11:07 PM

Hmm let me check a couple things.

👍 1

Giuliano Mega

11/04/2022, 12:20 AM

So this is looking really weird. 🙂 As far as I can see, it's tripping like, in the first few lines of code after the engine boots

Giuliano Mega

11/04/2022, 12:21 AM

Copy code

if __name__ == "__main__":
    import os
    import sys

    try:
        flow_run_id = UUID(
            sys.argv[1] if len(sys.argv) > 1 else os.environ.get("PREFECT__FLOW_RUN_ID")
        )
    except Exception:
        engine_logger.error(
            f"Invalid flow run id. Recieved arguments: {sys.argv}", exc_info=True
        )
        exit(1)

Giuliano Mega

11/04/2022, 12:21 AM

so I instrumented the startup script to print the actual command that's being run

Giuliano Mega

11/04/2022, 12:21 AM

and I get:

Giuliano Mega

11/04/2022, 12:21 AM

Copy code

****** RUNNING COMMAND: python -m prefect.engine *****
Invalid flow run id. Recieved arguments: ['/usr/local/lib/python3.10/site-packages/prefect/engine.py']
Traceback (most recent call last):

Giuliano Mega

11/04/2022, 12:22 AM

hmm...

Giuliano Mega

11/04/2022, 12:23 AM

OK so -m unpacks the module into its full path, so that explains why I see "/usr/local/lib..." instead of just the module's name, but it should not be setting argv[1] to the path of the module. That's not how python -m behaves in my local machine

Giuliano Mega

11/04/2022, 12:24 AM

that's the weirdness I still can't explain

Giuliano Mega

11/04/2022, 12:32 AM

will instrument the engine to see what on earth it's getting

Mason Menges

11/04/2022, 12:35 AM

Yeah I haven't seen that before either, I'll try asking around and see what I can dig up too

Giuliano Mega

11/04/2022, 12:36 AM

thanks a lot!

Ryan Peden

11/04/2022, 12:36 AM

I think that's normal; I believe that when you run

python -m

, getting the module location as argv[0] is expected. There's no argv[1] in your args array, but that's not the problem

Giuliano Mega

11/04/2022, 12:37 AM

ah, I see, I'm looking at the wrong hypothesis

Ryan Peden

11/04/2022, 12:37 AM

What you're seeing is (I think) happening because the

PREFECT__FLOW_RUN_ID

isn't present

Ryan Peden

11/04/2022, 12:37 AM

I get the same error message if I run

python -m prefect.engine

without that env var present, at least

Giuliano Mega

11/04/2022, 12:38 AM

Hmmm any ideas why it's not being injected?

Giuliano Mega

11/04/2022, 12:40 AM

btw now I realize it's printing the whole argv in the error message already. Sorry, I must be sleepy 🙂

Ryan Peden

11/04/2022, 12:41 AM

I don't yet know why it's not being added; I haven't worked with

KubernetesJob

much, but I wrote another infrastructure block so I'm decently familiar with them. I'm looking through the block's code now to see if I can find where/why this might happen

Ryan Peden

11/04/2022, 12:41 AM

And no problem; I only realized because I was working with the args in one of my Python scripts a couple of hours ago, so it was fresh in my mind

Giuliano Mega

11/04/2022, 12:41 AM

hey thanks a lot, will do the same

Giuliano Mega

11/04/2022, 12:42 AM

btw I'm messing around with the env in KubernetesJob

Giuliano Mega

11/04/2022, 12:42 AM

I have

Giuliano Mega

11/04/2022, 12:42 AM

Copy code

customizations=[
        {
            'op': 'add',
            'path': '/spec/template/spec/resources',
            'value': {
                'limits': {
                    'memory': '1024Mi',
                    'cpu': '500m'
                }
            }
        },
        {
            'op': 'add',
            'path': '/spec/template/spec/containers/0/env',
            'value': [
                {
                    'name': 'ENV',
                    'value': 'prod'
                },
                {
                    'name': 'COLLECTIONS_PREFIX',
                    'value': ''
                },
                {
                    'name': 'PROJECT_ID',
                    'value': 'window-finance-production'
                }
            ]
        },
        {
            'op': 'add',
            'path': '/spec/template/backoffLimit',
            'value': 3
        }
    ]

Giuliano Mega

11/04/2022, 12:43 AM

wonder if I'm overriding something I shouldn´t 😕

Ryan Peden

11/04/2022, 12:43 AM

That might be overwriting the default env variables the infrastructure block is setting

Ryan Peden

11/04/2022, 12:44 AM

I think

KubernetesJob

has an extra

env

attribute where you can put environment variables

🙌 1

Giuliano Mega

11/04/2022, 12:44 AM

oh man

Giuliano Mega

11/04/2022, 12:45 AM

you're right 🤩

Giuliano Mega

11/04/2022, 12:45 AM

lemme try that

Ryan Peden

11/04/2022, 12:45 AM

I think adding them in customizations instead gets rid of the flow ID - not your fault, I think the block should check for that so I will open a GitHub issue

Giuliano Mega

11/04/2022, 12:46 AM

awesome, thanks a lot! Will let you guys know if that solves it in a min

👍 1

Giuliano Mega

11/04/2022, 12:55 AM

yay, it worked! 😅

🙌 1

Giuliano Mega

11/04/2022, 12:56 AM

Thanks a lot for the help, it would've taken me a lot of time to figure this out on my own ❤️

Mason Menges

11/04/2022, 12:56 AM

Awesome I'm glad you got it working :) thanks @Ryan Peden

🎉 1

Ryan Peden

11/04/2022, 12:57 AM

You're welcome, I'm happy to hear it worked 😄

10 Views

Open in Slack

Previous Next