example I have a task that submits about 200 subtasks You ca Prefect Community #ask-community

example, I have a task that submits about 200 subt...

Mike Gallaspy

11/21/2024, 9:16 PM

example, I have a task that submits about 200 subtasks. You can see there is about 1 minute 30 seconds latency before any of them start

Nate

11/21/2024, 9:22 PM

can you show what your code looks like? it likely has to do with the tasks you're submitting

Copy code

In [1]: from prefect import flow, task

In [2]: @flow
   ...: def f(): task(lambda x: x +1).map(range(200)).result()

In [3]: f()

since this takes about 3 seconds to finish in prefect 3.x

Mike Gallaspy

11/21/2024, 9:23 PM

https://github.com/MCGallaspy/pokemon_showdown_replay_tools/blob/main/scripts/populate.py#L154-L164

Nate

11/21/2024, 9:24 PM

ah you might want as_completed here? https://github.com/prefecthq/prefect/blob/main/src/prefect/futures.py#L333

Mike Gallaspy

11/21/2024, 9:25 PM

can you elaborate please?

Nate

11/21/2024, 9:40 PM

https://gist.github.com/zzstoatzz/2f36e7c64009e5e391510794c03e2d92

Nate

11/21/2024, 9:40 PM

18 seconds

Nate

11/21/2024, 9:40 PM

i think that should be representative of what you're doing

Nate

11/21/2024, 9:40 PM

essentially i was just saying that bc of how you were checking for completion, you were blocking

Nate

11/21/2024, 9:41 PM

as_completed

should help with that

Mike Gallaspy

11/21/2024, 9:43 PM

still not 100% sure I understand. certainly the lines I linked are blocking. But the task that is waiting on them is itself submitted as an aysnc task using

task.submit

Mike Gallaspy

11/21/2024, 9:43 PM

Are you saying that it's still blocking?

Mike Gallaspy

11/21/2024, 9:43 PM

like it blocks the main thread?

Nate

11/21/2024, 9:45 PM

https://github.com/MCGallaspy/pokemon_showdown_replay_tools/blob/main/scripts/populate.py#L161C9-L161C22 yeah my assumption in looking at the code was that this

future.wait()

was blocking the main thread for each future, I could be wrong

Mike Gallaspy

11/21/2024, 9:46 PM

ok, I think I understand. ty for the help and the great gist

Nate

11/21/2024, 9:46 PM

👍 feel free to post back here if you're still seeing delays

Mike Gallaspy

11/21/2024, 9:50 PM

is this blocking? https://gist.github.com/zzstoatzz/2f36e7c64009e5e391510794c03e2d92#file-pokemon_replay-py-L193-L197

Nate

11/21/2024, 9:53 PM

hrmmmmm yes. but once a future comes out of

as_completed

you know its done so the

future.result()

call which would normally be blocking should happen instantly but overall yeah the list comp would be blocking bc I'm exhausting

as_completed

instead of doing something

for

each completed future that pops out

Nate

11/21/2024, 9:57 PM

i could be missing something about your intention w whats going on, i sort of whipped that together quickly if you have a minimal example of where concurrency via submit / map or as_completed is behaving in an unexpected manner, I'd be happy to take a look or you can create a discussion so other folks can benefit from the convo

Mike Gallaspy

11/21/2024, 10:00 PM

I'm just trying to understand it myself, I don't have a minimal example

Mike Gallaspy

11/21/2024, 10:01 PM

Certainly I believe that your gist has very little latency, so I'm trying to find the crucial difference between mine and yours

Mike Gallaspy

11/21/2024, 10:01 PM

It's a learning thing

Nate

11/21/2024, 10:01 PM

maybe

this is helpful▾

🙌 1

Mike Gallaspy

11/21/2024, 10:54 PM

I ran your script without modification and I'm still seeing big latency. I'm assuming it's a me issue at this point. I wonder if there's any way to profile the prefect internals?

Mike Gallaspy

11/21/2024, 10:55 PM

Screenshot 2024-11-21 145350.png

Nate

11/21/2024, 10:55 PM

hrm what does

prefect config view

say? i.e. are you running against an ephemeral server, oss server, or cloud?

Mike Gallaspy

11/21/2024, 10:56 PM

Copy code

🚀 you are connected to:
<http://127.0.0.1:4200>
PREFECT_PROFILE='local'
PREFECT_API_URL='<http://127.0.0.1:4200/api>' (from profile)

Nate

11/21/2024, 10:56 PM

looks like an open source server. so you have

prefect server start

going someplace?

Mike Gallaspy

11/21/2024, 10:56 PM

yup

Nate

11/21/2024, 10:57 PM

hm. and the script you copied w/o modification, thats the gist I shared?

Mike Gallaspy

11/21/2024, 10:57 PM

yeah

Nate

11/21/2024, 10:58 PM

huh - off the top of my head im not sure is the delay only in the resulting timeline in the UI or do you see the work literally delayed in your terminal?

Mike Gallaspy

11/21/2024, 11:00 PM

it's literally delayed

Mike Gallaspy

11/21/2024, 11:00 PM

could it be related to task caching?

Mike Gallaspy

11/21/2024, 11:20 PM

I think I'm focusing on the wrong thing here. Let me clarify my intent. Is there an idiomatic way to write a consumer-producer pattern in prefect? In my example, results from the search api produce replays ids, that I then want to consume to fetch a remote database row and persist it to disk. It is much faster to get replay ids than it is to download the corresponding data. But I ideally want the production of replay ids to execute concurrently with the consumption of them.

Mike Gallaspy

11/21/2024, 11:20 PM

both the producer and consumer processes may be very long lived

Nate

11/21/2024, 11:31 PM

im tempted to point you here (a relatively new pattern in prefect), but im not sure how literally you mean consumer/producer what im sharing above is akin to redis streams / celery, where you have • finite set of tasks you serve (this is a websocket client that gets pushed task runs from the server) i.e.

serve(*many_tasks)

◦ you can horizontally scale these (e.g. N pods) arbitrarily without race conditions because of consumer groups ◦ all the task features apply, i.e caching, results, retries etc • from somewhere like a webapp, you can

some_task.delay(**task_kwargs)

to "background" that task without blocking so this is good for cases where you want to offload a bunch of work to happen concurrently somewhere on static infra, but the caller (or delay-er) doesn't need the result of that background task am I going off the rails here or does that sound like something you're interested in?

Mike Gallaspy

11/21/2024, 11:56 PM

you're not going off the rails

Mike Gallaspy

11/21/2024, 11:56 PM

we're thinking along the same lines

Mike Gallaspy

11/22/2024, 12:00 AM

I think that I literally mean consumer/producer, but I guess I'm not sure what a metaphorical consumer/producer pattern is

Nate

11/22/2024, 12:33 AM

what a metaphorical consumer/producer pattern is

haha fair enough. well cool, those examples are almost all docker-compose and should be mostly up to date, lmk if you have any specific questions

7 Views

Open in Slack

Previous Next