Hello everyone How are concurrency limit collision strategie Prefect Community #ask-community

Hello everyone! How are concurrency limit collisio...

Lee

02/04/2025, 9:08 PM

Hello everyone! How are concurrency limit collision strategies intended to work? I set up a deployment to run every 30 seconds with a concurrency limit of 1 and a collision strategy of

CANCEL_NEW

, but I'm having it call a task that sleeps for 5 minutes as a test. I expected that when the scheduler tried to trigger a new run while the old one was still going, it either wouldn't start it or would cancel it immediately. Instead, I ended up with an ever-increasing queue of waiting flow runs. Is this the expected behavior? If so, what is the intended usage of

CANCEL_NEW

, and is there a way to avoid creating new scheduled runs if the previous run didn't finish as soon as expected?

Bianca Hoch

02/04/2025, 9:20 PM

Hey Lee! So the

CANCEL_NEW

collision strategy should cancel new runs that exceed the concurrency limit provided for a deployment (rather than queuing them). Once the limit is hit, the new runs should be cancelled immediately.

Bianca Hoch

02/04/2025, 9:22 PM

What are the states of the flow runs that are amassing in the queue? 👀

Lee

02/04/2025, 9:32 PM

They are all

Late

Bianca Hoch

02/04/2025, 9:38 PM

Hmm..when you go to look at the collision strategy on the deployment, does it show

CANCEL_NEW

Bianca Hoch

02/04/2025, 9:38 PM

^this page shows up when you go to edit the deployment

Lee

02/04/2025, 9:39 PM

Looks right to me.

Bianca Hoch

02/04/2025, 9:40 PM

weird. mind dropping code you're using to test this too?

Lee

02/04/2025, 9:41 PM

I can't drop the exact code but I'll try to get a minimal reproduction.

Bianca Hoch

02/04/2025, 9:43 PM

That'd be perfect, thank you!

Lee

02/04/2025, 10:01 PM

This seems sufficient to reproduce it. flows.py:

Copy code

from prefect import flow
from time import sleep


@flow
def slow_flow():
    sleep(90)

prefect.yaml:

Copy code

# Welcome to your prefect.yaml file! You can use this file for storing and managing
# configuration for deploying your flows. We recommend committing this file to source
# control along with your flow code.

# Generic metadata about this project
name: slow_flow_test

# build section allows you to manage and build docker images
build: null

# push section allows you to manage if and how this project is uploaded to remote locations
push: null

# pull section allows you to provide instructions for cloning this project in remote locations
pull: null

# the deployments section allows you to provide configuration for deploying flows
deployments:
    - name: "slow_flow"
      entrypoint: flows.py:slow_flow
      work_pool:
        name: slow_flow_pool
      schedule:
          interval: 30
      concurrency_limit:
          limit: 1
          collision_strategy: CANCEL_NEW

Then run something like this:

Copy code

prefect server start &
sleep 10

export PREFECT_API_URL=<http://localhost:4200/api>

prefect work-pool create slow_flow_pool --type process
prefect work-pool set-concurrency-limit slow_flow_pool 1
prefect deploy --all
prefect worker start --pool slow_flow_pool --type process --limit 1 &

Lee

02/04/2025, 10:01 PM

After looking at it I'm guessing the issue is that there is a concurrency limit on both the work pool and the deployment.

Lee

02/04/2025, 10:04 PM

Well, maybe not. Manually removing the concurrency limit on the pool didn't change anything.

Lee

02/04/2025, 10:12 PM

It looks like removing the concurrency limits on both the work pool and the worker causes the extra runs to be canceled, as expected. I can work around that for now but it is a bit confusing.

gratitude thank you 1

Bianca Hoch

02/04/2025, 10:15 PM

Interesting, good thinking testing that out.

Bianca Hoch

02/04/2025, 10:19 PM

I'll give this a whirl to try and replicate, and share with the team. Maybe I'm missing something here too, but I feel like the runs should still be clearing out regardless of there being a work pool/worker concurrency limit in place. 🤔

Lee

02/04/2025, 10:21 PM

Sounds great, thanks!

Bianca Hoch

02/05/2025, 5:28 PM

Hey Lee! I was able to reproduce what you saw yesterday. Effectively, if there was a limit on either the work pool or worker, the late runs would begin to show up. I created an issue here: https://github.com/PrefectHQ/prefect/issues/16984

Bianca Hoch

02/05/2025, 5:29 PM

let me know if there's any other details you'd like to add. you're welcome to comment there as well.

90 Views

Open in Slack

Previous Next