Hi I would like to reiterate this question slightly smiling Prefect Community #best-practices

Hi, I would like to reiterate this question :sligh...

Bernardo Galvao

01/06/2023, 4:12 PM

Hi, I would like to reiterate this question 🙂 https://prefect-community.slack.com/archives/C03D12VV4NN/p1671798107554319

Mike Grabbe

01/06/2023, 5:24 PM

If your parquet file is small, you could store it directly with your code in storage (S3, github, etc). If the parquet file is too big for that, you could grab it from a bucket at run time.

Mike Grabbe

01/06/2023, 5:26 PM

As for environment variables, that depends on your execution infrastructure. We use ECSTask blocks to execute all of our flows, so the environment variables are configured within the block (which gets translated into an ECS task definition in AWS).

Mike Grabbe

01/06/2023, 5:29 PM

Also, blocks are worth looking at as an alternative to environment variables. You can store configuration, secrets, or whatever you need as a block and read it in at run time.

Bernardo Galvao

01/09/2023, 5:32 PM

it seems in my case, since I use DVC for all the things and Minio as an S3-like backend, I'll have to use DVC's Python API to get the files that I need.

Bernardo Galvao

01/09/2023, 5:33 PM

What happens when a prefect flow depends on other modules? @Mike Grabbe

Bernardo Galvao

01/09/2023, 5:33 PM

it's a python library that I wrote..

Mike Grabbe

01/09/2023, 9:52 PM

Ive dealt with these situations in two different ways so far: • for specialized modules, you can include them as a child directory alongside the flow script, and directly import your code • for generalized modules, bundle your code into a python package and install it 1) locally, and 2) on all infrastructure running prefect flows

👍 1

Bernardo Galvao

01/10/2023, 10:48 AM

what if I just change the PREFECT_API_URL in Python and run it anywhere? ~~It seems possible to have GitLab CI schedule these flows as a job.~~

22 Views

Open in Slack

Previous Next