In Prefect, push work pools allow Prefect Cloud to submit flow runs for execution to serverless computing infrastructure without running a worker. They currently support execution in GCP Cloud Run Jobs, Azure Container Instances, and AWS ECS Tasks. You can create a push work pool that sends work to these services.
However, the order in which flow runs are submitted for execution is determined by priority. If all flow runs are capable of being executed with no limitation due to concurrency or otherwise, priority is still used to determine the order of submission, but there is no impact on execution. If not all flow runs can be executed, usually as a result of concurrency limits, priority is used to determine which queue.
For more detailed information, you can refer to the following documents:
-
Push Work to Serverless Computing Infrastructure
-
Work Pools
Please note that the documents might not directly answer your question about limiting the number of workers at the same time while other jobs wait for execution. It's recommended to reach out to the Prefect community or support for more specific guidance on this topic.