I m trying to understand retry behavior specifically which t Prefect Community #ask-community

I'm trying to understand retry behavior, specifica...

Constantino Schillebeeckx

04/15/2024, 6:49 PM

I'm trying to understand retry behavior, specifically which tasks retry when a flow fails. I'm debugging with the flow:

Copy code

from prefect import get_run_logger, task, flow

@task
def child_a():
    logger = get_run_logger()
    <http://logger.info|logger.info>("child A")
    raise ValueError("simulated")

@task
def child_b():
    logger = get_run_logger()
    <http://logger.info|logger.info>("child B")

@task
def child_c():
    logger = get_run_logger()
    <http://logger.info|logger.info>("child C")

@flow(
    name="test_flow_retry",
)
def hello_world():
    a = child_a.submit()
    child_b.submit(wait_for=[a])
    child_c.submit()

Constantino Schillebeeckx

04/15/2024, 6:50 PM

when I deploy, run the flow, and then retry it, I'm observing that all tasks are run again (after I click

retry

in the UI). that's expected right? do I need to persist results for each of those tasks so that successful tasks (e.g.

child_c

) are not executed again when the flow is retried?

Kevin Grismore

04/15/2024, 6:51 PM

yep, results need to be persisted to skip execution for successful tasks upon retry

Constantino Schillebeeckx

04/15/2024, 6:52 PM

even when the task doesn't return anything (i.e. has no results)?

Kevin Grismore

04/15/2024, 6:53 PM

python functions always return something even if that something is

None

and implicit

Constantino Schillebeeckx

04/15/2024, 6:54 PM

so then why doesn't this actually work for me? it sounds like the tasks should already be getting stored (since they return

None

)

Kevin Grismore

04/15/2024, 6:54 PM

hm, good point

Kevin Grismore

04/15/2024, 6:55 PM

let me try that too

Kevin Grismore

04/15/2024, 7:01 PM

yeah, result persistence still has to be turned on for this to happen

Constantino Schillebeeckx

04/15/2024, 7:02 PM

so are the docs wrong about

None

? do they persist with

True/False

Kevin Grismore

04/15/2024, 7:02 PM

If
persist_result
is set to
False
, these values will never be stored.

Kevin Grismore

04/15/2024, 7:02 PM

Copy code

@task(persist_result=True)
def child_c():
    logger = get_run_logger()
    <http://logger.info|logger.info>("child C")

Kevin Grismore

04/15/2024, 7:03 PM

maybe it's just unclear wording?

Constantino Schillebeeckx

04/15/2024, 7:03 PM

but

persist_result

defaults to None (not False)

Constantino Schillebeeckx

04/15/2024, 7:03 PM

this feels like an unusual default - I would expect that by default, when my flow has some failing tasks, that asking the flow to restart would just rerun the failed tasks (like in Prefect 1?)

Constantino Schillebeeckx

04/15/2024, 7:05 PM

yess 1

Kevin Grismore

04/15/2024, 7:05 PM

yeah I think I agree "stored by the API without persistence to storage" seems to imply that you shouldn't have to set

persist_result

True

for none and bool type results

yess 1

Kevin Grismore

04/15/2024, 7:07 PM

I think I'll have to do some asking around to know for sure what's intended so I know whether this is something we should clarify in writing or that this is something that needs fixing

Constantino Schillebeeckx

04/15/2024, 7:12 PM

hmmm I'm wondering if, although in tasks

persist_result=None

, there is the setting

PREFECT_RESULTS_PERSIST_BY_DEFAULT

(docs) which defaults to False - so perhaps that latter setting is ultimately dictating behavior?

Kevin Grismore

04/15/2024, 7:28 PM

from what I can see that setting only applies if

persist_result

is not set to

True

False

in the flow or task decorator

Kevin Grismore

04/15/2024, 7:29 PM

https://github.com/PrefectHQ/prefect/blob/main/src/prefect/results.py#L332

Kevin Grismore

04/15/2024, 7:31 PM

looking over how we decide to persist results, my takeaway is that we will only persist results, regardless of type, if you have persistence enabled or prefect decides it should be enabled. It never actually stays as

None

at runtime. After that, where the result is persisted to depends on its type.

Constantino Schillebeeckx

04/15/2024, 7:33 PM

I've set

PREFECT_RESULTS_PERSIST_BY_DEFAULT=True

, and the flow that I originally shared behaves as I expected: retrying the flow after an initial run only reruns the previously failed

child_a

Kevin Grismore

04/15/2024, 7:33 PM

I think that's a solid approach to get the behavior you're looking for

Constantino Schillebeeckx

04/15/2024, 7:34 PM

agreed - I think in this case the docs could have better called out that the vanilla default is not to persist any, regardless of type, unless either: •

persist_result=True

is set on the tasks OR •

PREFECT_RESULTS_PERSIST_BY_DEFAULT=True

setting is set

Constantino Schillebeeckx

04/15/2024, 7:35 PM

Thanks for all the help @Kevin Grismore

Kevin Grismore

04/15/2024, 7:35 PM

thanks for digging into it with me!

Yaron Levi

04/16/2024, 4:04 AM

Very informative thread! I'm pinning this. Thanks 👍

Open in Slack

Previous Next