Suppose I have a flow that consists of 4 tasks that run run Prefect Community #ask-community

Suppose I have a flow that consists of 4 tasks tha...

Thomas Furmston

12/01/2021, 7:00 PM

Suppose I have a flow that consists of 4 tasks that run run one after the other, e.g.,

A -> B -> C -> D

. Now suppose I run the flow and tasks

and

complete, but

fails because of a bug. I therefore want to fix the code in task C and redeploy the flow.

Kevin Kho

12/01/2021, 7:15 PM

So the checkpointing behavior to be specific to a flow and not apply when you re-register. You can still do this though if you use

caching

targets

. Docs are here . These are persistence mechanisms that work across Flow runs.

Thomas Furmston

12/01/2021, 7:17 PM

Thomas Furmston

12/01/2021, 7:17 PM

thanks for the response.

Thomas Furmston

12/01/2021, 7:17 PM

So the checkpointing behavior to be specific to a flow and not apply when you re-register.

-> Sorry, I am not sure I understand your first sentence.

Kevin Kho

12/01/2021, 7:21 PM

Oh sorry I had a couple of typos there. The default checkpoint is from flow run to flow run. If you restart the flow from failure, they will be loaded but if you create a new flow run, it doesn’t. In order for the changes in task C to take effect though, it implies you registered and re-ran the flow

Thomas Furmston

12/01/2021, 7:21 PM

ah, ok. cool, makes sense.

Thomas Furmston

12/01/2021, 7:21 PM

Thanks

Thomas Furmston

12/01/2021, 7:22 PM

Yeah, persistence looks like what I was after

Kevin Kho

12/01/2021, 7:23 PM

So you can have it work by either setting an explicit cache valid for a given duration or something. Or you can create a filename to persist the output. If any future flow runs find the file exists, then it will just be loaded in. You can create filenames with timestamps by rounding the time (to nearest day or hour or example), and then they future flow runs will pull that file if it exists, otherwise it will re-run the task

Thomas Furmston

12/01/2021, 7:24 PM

Does it depend on parameters I pass to the task too?

Kevin Kho

12/01/2021, 7:25 PM

It can. For caching, you can validate/invalidate the cache based on inputs. For targets, you can maybe template them into the file name

Thomas Furmston

12/01/2021, 7:25 PM

ok, cool

Kevin Kho

12/01/2021, 7:25 PM

Target is file based persistence. Caches are time based of validation based (like the inputs)

Thomas Furmston

12/01/2021, 7:25 PM

I think I have everything I need now. Thanks a lot!

Kevin Kho

12/01/2021, 7:25 PM

Of course!

Thomas Furmston

12/01/2021, 7:25 PM

yep, got it

8 Views

Open in Slack

Previous Next