Is there any way I can identify a task run by some runtime i Prefect Community #ask-community

Is there any way I can identify a task-run by some...

Sven Teresniak

10/05/2020, 12:14 PM

Is there any way I can identify a task-run by some runtime information? I need to query task-runs by information that is only available during runtime, based on

Parameter

. I cannot use slug or tags because I cannot set them to parameter values (or can I?). I still work on a Lock-like ResourceManager but thats very difficult when it comes to scheduling-/parameter-dependent locking. Creating a Lock on constants (e.g. a constant string) seems rather easy. What I need (and trying to build) is tag concurrency limits for the standalone version. :)

Kyle Moon-Wright

10/05/2020, 4:01 PM

Hey @Sven Teresniak, Not sure exactly what you're after given your use case, but we can access task run information through prefect context like this:

Copy code

@task
def my_task():
    import prefect
    logger = prefect.context.get("logger")
    <http://logger.info|logger.info>(prefect.context.task_run_id)
    pass

In addition, we can grab the

task_tags

and the

task_slug

at runtime with other variables at runtime. The full list can be found here.

Sven Teresniak

10/05/2020, 4:04 PM

You cannot set tags or slugs at runtime, right?

Sven Teresniak

10/05/2020, 4:06 PM

If not: Is it possible to access the context (or parts of it) of other flow-runs? Or in other words: Is it possible to access the context from task-run B from (a task in) task-run A (using GraphQL maybe)?

Kyle Moon-Wright

10/05/2020, 4:08 PM

I believe we can, check out this example:

Copy code

from prefect.utilities.tasks import tags

@task
def add(x, y):
    return x + y

with Flow("My Flow") as flow:
    with tags("math", "function"):
        result = add(1, 5)

print(result.tags)

Kyle Moon-Wright

10/05/2020, 4:09 PM

This would set the tag on the instance of the

add

task, named

result

Sven Teresniak

10/05/2020, 4:09 PM

Can I add a tag based on a parameter?

Sven Teresniak

10/05/2020, 4:09 PM

yes, but "function" and "math" are constants.

Sven Teresniak

10/05/2020, 4:10 PM

Is it possible to have values here ONLY known at runtime (based on schedule and/or parameters etc.)

Sven Teresniak

10/05/2020, 4:11 PM

Copy code

with Flow("My Flow") as flow:
    p = Parameter("foo")
    with tags("param {}".format(p)):
        result = add(1, 5)

Sven Teresniak

10/05/2020, 4:12 PM

i know that code is not working but it shows what i call "runtime-dependent"

Kyle Moon-Wright

10/05/2020, 4:13 PM

yeah, I'm seeing that. looks to be a hard wall there. hmmm....

Sven Teresniak

10/05/2020, 4:13 PM

Its NOT about parameters. I want to synchronize task-runs and for that I need the GraphQL API to query for other tasks

Sven Teresniak

10/05/2020, 4:14 PM

I want to build a LOCK-like ResourceManager

Sven Teresniak

10/05/2020, 4:14 PM

I have code blocks (a few tasks) that are not allowed to run concurrently.

Sven Teresniak

10/05/2020, 4:16 PM

pseudocode:

Copy code

with Flow("isolated-flow") as flow:
    with isolated(name="locktask", 
                  setup_task_kwargs={…]},
                  cleanup_task_kwargs={'tags':['locktag']})(
            skip_downstream_on_conflict=True, keys=t):
        sleep(t)

Here,

isolated

is a custom

ResourceManager

that uses GraphQL to query for other task-runs with the name

locktask.cleanup

with state

running

pending

Sven Teresniak

10/05/2020, 4:17 PM

If I can find another task then I wait (blocking lock) or throw an exception for task retry (cannot acquire lock, lock timeout, you name it)

Sven Teresniak

10/05/2020, 4:19 PM

Basically, in ResourceManager's

init

setup

I query Prefect and get a clean locking. Its not implemented yet but this could work because GraphQL is able to give me all task-runs and I can query all tags and and so on

Sven Teresniak

10/05/2020, 4:20 PM

BUT this is entirely on a task level and not on a task-run level. This kind of locking is bound to a task, its name, maybe some constant tags.

Sven Teresniak

10/05/2020, 4:21 PM

I cannot build this kind of locking for tags (slugs, labels, don't care what!) only known at runtime.

Sven Teresniak

10/05/2020, 4:23 PM

I can query some runtime-dependent stuff like parameters. But that does not help when I don't have parameters because my flow-run operates on schedule-times (batch processing)

Sven Teresniak

10/05/2020, 4:23 PM

I can set tags but they all have to be constant -- known at the time I call

flow.register()

Sven Teresniak

10/05/2020, 4:24 PM

The thing is: Its common to call tasks in a flow using results and/or parameters. But how can I call the ResourceManager's subtasks (setup, cleanup) with results/parameters?

Sven Teresniak

10/05/2020, 4:26 PM

On a larger perspective this kind of Locking is a missing feature in Prefect. It allows users to re-use code, e.g. call tasks with side effects in different flows/flow runs because of the I in ACID (from relational databases): isolation

Sven Teresniak

10/05/2020, 4:27 PM

We have flows with side effects but the flows are all idempotent. I can call it several times using the same parameters sequentially without a problem. But when I call them CONCURRENTLY I get a big fuckup. because of race conditions between "checks" and "actions"

Sven Teresniak

10/05/2020, 4:27 PM

I really need help here 🙂

Sven Teresniak

10/05/2020, 4:27 PM

EOM

Kyle Moon-Wright

10/05/2020, 4:29 PM

Gotcha, let me get some insight for you. Thanks for the details and the feedback.

Sven Teresniak

10/05/2020, 4:33 PM

If you need more input or whatever, please drop me a line. We tried really hard to workaround this but we failed.

Kyle Moon-Wright

10/05/2020, 8:05 PM

Hey @Sven Teresniak, I apologize for not being able to provide further insight, for this you'd likely need an external service to manage this, which is out of scope for our public repos. Our definitive position for this functionality is Prefect Cloud.

Sven Teresniak

10/06/2020, 5:24 AM

Prefect cloud does not have this functionality. The tag concurrency does not provide this.

3 Views

Open in Slack

Previous Next