With prefect server and localagent is there any way to limit Prefect Community #ask-community

With prefect server and localagent is there any wa...

Adam Brusselback

11/01/2021, 2:18 PM

With prefect server and localagent is there any way to limit job execution concurrency? If I get 1000 jobs submitted in one min, but that server can really only handle executing 150 at a time before it falls over, what ways do I have of improving this situation?

Kevin Kho

11/01/2021, 2:20 PM

Hey @Adam Brusselback, unfortunately not. It was explored here and was not performant so the effort was stopped.

Adam Brusselback

11/01/2021, 2:22 PM

These are fast executing tasks, so the server can easily get through all 1000 in a full min, but if it tries launching all at once, progress stops. If it launches a reasonable number of them, waits for them to finish, then launches more, we get through that queue of 1000 launched flows nice and quick

Adam Brusselback

11/01/2021, 2:22 PM

Well, shoot

Kevin Kho

11/01/2021, 2:23 PM

It sounds like you can rearchitect this then to be controlled by a “main flow” using LocalDaskExecutor to treat it like a queue?

Adam Brusselback

11/01/2021, 2:24 PM

Maybe, these are individual jobs running for each client in my saas. They can be on the default schedule, or an custom schedule

Adam Brusselback

11/01/2021, 2:26 PM

so there isn't a possibility for a main flow I don't believe, as they all need to be independent flows (per-client)

Adam Brusselback

11/01/2021, 2:26 PM

(a client is a project in this instance)

Kevin Kho

11/01/2021, 2:29 PM

I am honestly not seeing another way other than starting up another agent (with different compute backing it). Or Cloud cuz Cloud has flow run concurrency limiting and task run concurrency limiting but these need orchestration background services that are not shipped with server.

Kevin Kho

11/01/2021, 2:31 PM

Or this might need some other tool other than Prefect to handle these fast jobs like a task queue?

Adam Brusselback

11/01/2021, 2:31 PM

Yeah, that was essentially what I was trying to use Prefect to replace, my current job scheduler infrastructure handles these small tasks fine

Adam Brusselback

11/01/2021, 2:33 PM

just running using jpgagent job scheduler, I had implemented some concurrency limits in that years ago though to smooth this type of workload over.

Kevin Kho

11/01/2021, 2:34 PM

The lower-bound of Prefect jobs where it’s still performant is around 1 min or 30 seconds because it’s primarily a batch orchestrator and there is some overhead with the API states.

Adam Brusselback

11/01/2021, 2:37 PM

Yeah, a lot of these jobs are going to run for <1 second if it turns out there is nothing to do (the majority of the time), and then will take ~20s-2min if there is work to do

Adam Brusselback

11/01/2021, 3:03 PM

Question, is the localdaskexecutor process/thread limit a global limit for all flows using that same localdaskexecutor object? or is that just for that single flows execution, do not use more than X threads/processes to execute the tasks?

Adam Brusselback

11/01/2021, 3:04 PM

And if so, would setting up an explicit dask cluster be a better way to deal with the concurrency issue?

Kevin Kho

11/01/2021, 3:07 PM

LocalDaskExecutor is a multiprocessing pool so it’s the “do not use more than X threads/processes to execute the tasks”.

Kevin Kho

11/01/2021, 3:08 PM

In my opinion, using LocalDaskExecutor where each of those Flows are not Tasks should be smoother than the Dask executor because that implies you have a remote cluster and are sending work which would be even more overhead

2 Views

Open in Slack

Previous Next