Hello I have a quite high level design question that I don t Prefect Community #ask-community

Hello! I have a quite high level design question t...

Adrien Besnard

03/14/2025, 8:25 PM

Hello! I have a quite high level design question that I don't know how to handle correctly. I think I need some advice <3 Let's say that I have this

Source

class (which has a

refresh()

method) that I want both to be able to call through a CLI and within Prefect. Does it make sense to do something like this?

Copy code

class Source():
  
  def refresh(self):
    self.extract
    self.load()
    self.transform()

  @task()
  def extract(self):
    ...

  @task()
  def load(self):
    ...

  @task()
  def tranform(self):
    ...

@flow
def refresh_source():
  source = Source()
  source.load()

This way, I'm able to instanciate an instance of the

Source

class elsewhere without relying on Prefect. But in the meantime, I find weird to "infect" everything using

@task

. What do you think?

Nate

03/14/2025, 8:46 PM

hi @Adrien Besnard - we generally encourage decorating "pure" functions as opposed to instance methods what do you think about inverting this to make it functional?

Copy code

class Source:...

@task
def extract(): ...

@task
def load(): ...

@task
def transform(): ...

@flow
def refresh_source(source: Source):
    extract(...)
    load(...)
    transform(...)

that way (if you want) you can define modular (prefect-unaware) logic elsewhere and call it from the prefect tasks, where

Source

is just a serializable / portable dataclass one reason I'd recommend this is because a nice advantage of tasks is cache policies and you'd have to explicitly omit

self

from consideration (as far as computing cache keys) if your tasks are instance methods, and then there's our somewhat subjective preference for functional style 🙂 but if you are stuck with the

class Source

paradigm, I'd just define something that looks like

Copy code

@flow
def refresh_source(source: Source):

and call that from within

Source.refresh

- I just wouldn't recommend decorating instance methods if you can help it

Adrien Besnard

03/14/2025, 9:22 PM

Thanks a lot for your answer @Nate! I get the pure function paradigm, but I think even with that my question remains: I'll have to write the logic (i mean the sequential invocation of extract, load and transform) twice: • Once within a Prefect flow (to nicely display the tasks within the UI) • Once within a classic function (let's say a @click.command()) That's the repetition I'm trying to avoid actually.

Adrien Besnard

03/14/2025, 9:23 PM

(I hope I'm clear: I'm quickly writing from my phone, sorry for that)

Nate

03/14/2025, 9:24 PM

hmm i don't understand this part

Once within a classic function (let's say a @click.command())

why not just call the flow you already wrote here?

Adrien Besnard

03/14/2025, 9:29 PM

Because in some circonstances, I need to able to run the source refresh directly from the CLI but without leveraging on Prefect for the execution

Adrien Besnard

03/14/2025, 9:30 PM

Or maybe you mean doing something like refresh_source.fn()?

Nate

03/14/2025, 9:30 PM

Copy code

» ipython

#[1]
from prefect import flow

#[2]
@flow
def foo(): print('hi no prefect pls')

#[3]
foo.fn()
hi no prefect pls

Nate

03/14/2025, 9:30 PM

yeah

Adrien Besnard

03/14/2025, 9:32 PM

Okay, fair enough, I can do that

Nate

03/14/2025, 9:33 PM

or another option (which we do in library land often to keep APIs fungible but non-breaking)

Copy code

@flow
def public_foo():
  _private_foo()

but i think that's a situational preference thing (i personally don't like arbitrary extra wrapping unless we really need it)

Adrien Besnard

03/14/2025, 9:35 PM

To give you more context: we have a python project with all our flows defined, and that package depends on other packages that do not depends on Prefect. In the end, a flow is "just" a passthrough to a function defined in one of the other packages

Adrien Besnard

03/14/2025, 9:35 PM

In the end, none of our flows have tasks because the logic is already embedded in the other packages

👍 2

Nate

03/14/2025, 9:36 PM

that makes sense, my second suggestion wouldnt be as good then bc you'd to dynamically make tasks

Adrien Besnard

03/14/2025, 9:37 PM

What do you mean by "dynamically make tasks" ?

Nate

03/14/2025, 9:42 PM

im assuming that youd have some function

foo

that has (normal non-prefect) functions inside which you'd want to "be tasks" when you want

foo

to run as a flow, but since > none of our flows have tasks because the logic is already embedded in the other packages those task definitions don't exist yet

Adrien Besnard

03/14/2025, 9:43 PM

Yes exactly

Adrien Besnard

03/14/2025, 9:44 PM

And I wonder if it make sense to actually make the project containg those function dependent from Prefect anyway and adding the @task

Adrien Besnard

03/14/2025, 9:45 PM

It will still be able to run "standalone", but if used within a flow, it will be well integrated with Prefect

Adrien Besnard

03/14/2025, 9:48 PM

(I have the exact same question with the artifacts, actually)

Adrien Besnard

03/14/2025, 9:52 PM

(In the sens that we have to choose if we're okay if the Prefect API "leaks" inside our internal librairies)

Nate

03/14/2025, 10:03 PM

hmm yeah this has come up before - I will think about this!

5 Views

Open in Slack

Previous Next