is there a way to explicitly remove a task from another task Prefect Community #ask-community

Join Slack

is there a way to explicitly remove a task from an...

# ask-community

Sean Talia

03/29/2021, 4:28 PM

is there a way to explicitly remove a task from another task's set of upstream dependencies?

Sean Talia

03/29/2021, 4:29 PM

I have a task (actually a

PrefectSecret

) that prefect thinks is downstream of a

Parameter

that I'm using only because it's the first place that the

PrefectSecret

is actually called; it's used in several other downstream tasks, but because of this first place it's used, Prefect thinks that the Secret task only ought to be run (and therefore the secret only retrieved) if the

Parameter

True

, whereas in my use case it's going to be

False

~95% of the time

Dylan

03/29/2021, 4:33 PM

Hi @Sean Talia! That is pretty interesting. Can you share a little code with me? Your Flow schematic would also be helpful

Dylan

03/29/2021, 4:33 PM

My first thought is that configuration seems a bit strange

Sean Talia

03/29/2021, 4:43 PM

certainly, just give me a sec

👍 1

Sean Talia

03/29/2021, 4:47 PM

Copy code

password = PrefectSecret(name="PASSWORD")

# dbt source
with case(source_snapshot_freshness(), True):
  result1 = task1(env={"PASSWORD": password})

result2 = task2(env={"PASSWORD": password})

result2.set_upstream(result1)

okay this is in essence what it looks like

Sean Talia

03/29/2021, 4:47 PM

i realize that the

result2.set_upstream(result1)

is perhaps part of what's causing the issue here

Sean Talia

03/29/2021, 4:48 PM

although in my task2 initialization I do have:

Copy code

skip_on_upstream_skip=False,
trigger=all_finished

Dylan

03/29/2021, 4:49 PM

Can you post your full

with Flow() as flow:

context block?

Sean Talia

03/29/2021, 4:51 PM

it's a little gnarly (i also do some task initialization outside of the

flow

context) but I can post the relevant stuff

Dylan

03/29/2021, 4:53 PM

That’s totally okay 😄

Sean Talia

03/29/2021, 4:54 PM

okay I think this has all of it:

Sean Talia

03/29/2021, 4:54 PM

Copy code

dbt_run_task = DbtShellTask(
    name="run_dbt",
    log_stderr=True,
    return_all=True,
    skip_on_upstream_skip=False,
    trigger=all_finished
)

dbt_snapshot_freshness_task = DbtShellTask(
    name="snapshot_freshness_dbt",
)

with Flow(...) as flow:
  source_snapshot_freshness = Parameter("source_snapshot_freshness", default=False)

  snowflake_password = PrefectSecret(
        name="PREFECT_TEST_SNOWFLAKE_PW"
      , skip_on_upstream_skip=False
      , trigger=all_finished
  )

  # dbt source
  with case(source_snapshot_freshness(), True):
      dbt_snapshot_freshness_result = dbt_snapshot_freshness_task(
          env={"SNOWFLAKE_PASSWORD": snowflake_password},
          command="<...>"
      )

  # dbt run
  dbt_run_command = set_dbt_run_command(
      debug,
      full_refresh,
      models,
      strict,
      target,
  )
  dbt_run_result = dbt_run_task(
      env={"SNOWFLAKE_PASSWORD": snowflake_password},
      command=dbt_run_command,
  )

Sean Talia

03/29/2021, 4:56 PM

like really what I'm trying to do is tell prefect "hey, this

Parameter

/ case block is not really an upstream dependency of the

PrefectSecret

, I don't care what ultimately the result is there:

Sean Talia

03/29/2021, 4:57 PM

sometimes i need that case block to execute, but usually not

Sean Talia

03/29/2021, 4:57 PM

but regardless of whether or not it executes i need that

PrefectSecret

available for lot of other stuff

Dylan

03/29/2021, 4:59 PM

Just out of curiosity, why are you setting a trigger on the

snowflake_password

Sean Talia

03/29/2021, 5:00 PM

oh sorry yes that part is just from me messing around to try to get this to work

Sean Talia

03/29/2021, 5:01 PM

the:

Copy code

, skip_on_upstream_skip=False
      , trigger=all_finished

should not really be a part of the task here

Sean Talia

03/29/2021, 5:01 PM

i just wanted to see if it would work

Sean Talia

03/29/2021, 5:01 PM

but yeah, please ignore that

Dylan

03/29/2021, 5:02 PM

Okay

Dylan

03/29/2021, 5:04 PM

So I think the real problem here is that the

dbt_run_task

later on isn’t picking up the dependency for that secret

Dylan

03/29/2021, 5:04 PM

I think that’s happening because Prefect is expecting a second case statement

Dylan

03/29/2021, 5:04 PM

https://docs.prefect.io/core/idioms/conditional.html#using-conditional-logic-in-a-flow

Sean Talia

03/29/2021, 5:06 PM

okay i think you pre-empted my question which was going to be, is the easier way to take care of this just to add an

else

block and define

snowflake_password = PrefectSecret(name="PREFECT_TEST_SNOWFLAKE_PW")

in there as well or something?

Dylan

03/29/2021, 5:06 PM

If you instead have something like:

Copy code

with Flow(...) as flow:
  source_snapshot_freshness = Parameter("source_snapshot_freshness", default=False)

  snowflake_password = PrefectSecret(
        name="PREFECT_TEST_SNOWFLAKE_PW"
      , skip_on_upstream_skip=False
      , trigger=all_finished
  )

  # dbt source
  with case(source_snapshot_freshness(), True):
      dbt_snapshot_freshness_result = dbt_snapshot_freshness_task(
          env={"SNOWFLAKE_PASSWORD": snowflake_password},
          command="<...>"
      )

 with case(source_snapshot_freshness(), False):
    # dbt run
    dbt_run_command = set_dbt_run_command(
        debug,
        full_refresh,
        models,
        strict,
        target,
    )
    dbt_run_result = dbt_run_task(
        env={"SNOWFLAKE_PASSWORD": snowflake_password},
        command=dbt_run_command,
    )

Dylan

03/29/2021, 5:06 PM

I think if you want to merge the conditional branches back you need to explicitly use the

merge

control flow utility

Sean Talia

03/29/2021, 5:07 PM

well the

dbt_run_task

is the one that I always want to have run

Sean Talia

03/29/2021, 5:07 PM

like i essentially just want to do nothing if

source_snapshot_freshness

is False

Dylan

03/29/2021, 5:08 PM

Yup, totally makes sense

Sean Talia

03/29/2021, 5:08 PM

I could have some dummy task execute, merge the dummy task result with the

dbt_snapshot_freshness_result

, and then use that merged result as an upstream dependency of

dbt_run_task

Sean Talia

03/29/2021, 5:08 PM

but in considering that i was like i have to be doing something wrong here

Dylan

03/29/2021, 5:09 PM

So you shouldn’t have to have dummy tasks I don’t think

Dylan

03/29/2021, 5:10 PM

You should be able to say “if

source_snapshot_freshness

is False, do just this”

Sean Talia

03/29/2021, 5:17 PM

sorry when you say, "you should be able to...", do you mean "you ought be able to, but right now you can't", or "i believe what you're trying to do will work by using an

else

block + `merge`"

Dylan

03/29/2021, 5:19 PM

The latter haha

Sean Talia

03/29/2021, 5:31 PM

alright i'll give this a try

Open in Slack

Previous Next