are there any best practices around testing flows tasks in p Prefect Community #ask-community

are there any best practices around testing flows/...

Jai P

04/20/2022, 12:58 AM

are there any best practices around testing flows/tasks in prefect 2.0? I see this page but something we're noticing that testing can be particularly slow on `flow`s, (sometimes taking ~1s to start up a each test) and it appears we always need to wrap `task`s inside of a flow to test them

discourse 1

Zanie

04/20/2022, 1:07 AM

We don’t have recommendations yet, we’re hoping to design a nice testing UX for both tasks and flows. In particular, we’re planning to create a way to test tasks outside of flows. cc @alex Can you share an example where your test takes a second to start the flow? We’re running thousands of flows in our internal tests and I haven’t seen that.

Jai P

04/20/2022, 1:26 AM

here's a trivial example, and the associated output from running

pytest

Copy code

==================================================================================== test session starts =====================================================================================
platform darwin -- Python 3.10.2, pytest-7.1.1, pluggy-1.0.0
rootdir: /path/to/dir
plugins: anyio-3.5.0
collected 10 items

tests/test_flow.py ..........                                                                                                                                                          [100%]

==================================================================================== 10 passed in 11.09s =====================================================================================

Untitled.py

Zanie

04/20/2022, 1:35 AM

Hm interesting this seems to be related to the test harness utility

Zanie

04/20/2022, 1:35 AM

We run our internal tests with a higher performance lower level reset of the database

Zanie

04/20/2022, 1:36 AM

If I switch your example to that, it runs in about 3.5 seconds

Zanie

04/20/2022, 1:38 AM

The test harness we provide creates a temporary directory and new database for each test. You’ll find it much more performant to use that at the session scope then use a separate fixture to delete all the data between tests. We can probably expose this in the near future.

Jai P

04/20/2022, 1:41 AM

ah yeah, switching it to session scope cut the time in half! i guess there's a risk with conflicts between tests if i do that? or should i be generally safe because flows shouldn't really interact between tests

Jai P

04/20/2022, 1:42 AM

and i guess when you say

expose this in the near future

you're talking about the higher performance lower level reset? the session scope is just a pytest change right?

Zanie

04/20/2022, 1:44 AM

If it’s session scoped, yeah your tests can collide if you’re making assertions about state that requires a clean database. You should be fine since you’re just testing your flows and not asserting things like one call of a flow function results in one flow run in the backend like we are

Zanie

04/20/2022, 1:45 AM

And yeah we can expose a lower-level faster reset in the future, you can definitely just change the scope of the fixture yourself immediately.

Jai P

04/20/2022, 1:49 AM

gotcha. i think we may have cases where we want to assert subflows are kicked off but i think if things are a little slower when it comes to that stuff, its ok. we can always just use this as a local workaround and let our CI be a little bit slower until the lower level faster reset is available. is there anywhere i may be able to track the progress/availability of that? also thanks so much for responding so quickly!

Zanie

04/20/2022, 1:52 AM

You can still make assertions about the subflows by returning their states and querying for the associated flow run ids. That’s exactly the kind of thing we want to make a great UX for, like

my_flow.test(...)

returns a

TestResult

object that gives you full introspection of all of the task and flow runs that it created, their states, number of retries, return values, etc. so you can make the assertions you want.

Zanie

04/20/2022, 1:53 AM

@Marvin open “Using

prefect_test_harness

per test is slow”

Marvin

04/20/2022, 1:53 AM

https://github.com/PrefectHQ/prefect/issues/5693

Jai P

04/20/2022, 1:59 AM

ohhh that type of UX would be epic, definitely looking forward to that being rolled out! Also thanks for the issue link! i'll be sure to record it on our side so we can keep an eye on it. thanks so much and have a good one!

davzucky

04/20/2022, 12:31 PM

I run into when I developed a new feature for the was collection, At the time I did this PR https://github.com/PrefectHQ/prefect-aws/pull/27 which i replaced with the test harness after. @Zanie Why using session level can be a problem? Every run of the flow will have a different flow run which should not be a problem unless I'm missing something

Zanie

04/20/2022, 2:40 PM

We cannot use a session level fixture internally because we’re making assertions about the contents of the database directly 🙂 for most users, a session scoped fixture will definitely be fine!

Danny Sepler

04/20/2022, 6:00 PM

hi there! (i actually work with jai 🙂) thinking of testing, i'd also like to be able to tasks without all the overhead of needing a flow. ideally, it'd be nice to unit-test tasks as if they were plain old python functions! (both cause it'd be faster to run, and simpler to read) in my code, i'm doing an approach like this...

Copy code

# my_task.py
from prefect import task

@task
def double_value(value: int) -> int:
    return value * 2


# test_my_task.py
from my_task import double_value

def test_double_value():
    assert double_value.__wrapped__(1) == 2

is this an ok approach? would it make sense to make a lil helper function for this? if this is ok, i could add it to your testing docs!

Zanie

04/20/2022, 6:08 PM

You can use

double_value.fn(1)

Zanie

04/20/2022, 6:09 PM

You can’t run a task without a flow while it is being orchestrated, but yeah you can test the behavior of your underlying function if it does not rely on any Prefect behavior.

Danny Sepler

04/20/2022, 7:39 PM

ah!

.fn()

is even clearer than wrapped. thanks! yeah a few of our tasks won't rely on prefect behavior, so this is nice for those could be a nice addition to these docs, if you'd like me to diff it in?

Zanie

04/20/2022, 8:18 PM

Yeah go for it!

Zanie

04/20/2022, 8:18 PM

Thanks 🙂

davzucky

04/20/2022, 10:57 PM

Ok that make sense. I was in the context of flow or task testing only

14 Views

Open in Slack

Previous Next