< Jim Crist Harif> quick q about <https github com PrefectHQ Prefect Community #ask-community

Join Slack

Channels

ask-community

announcements

feedback-deployment-concurrency

pacc-apr-30-may-1-2024

pacc-clearcover-june-12-2023

pacc-london-sept-2024

prefect-contributors-archived

prefect-dbt

prefect-docker

prefect-gcp

prefect-getting-started

<@U011EKN35PT> quick q about <https://github.com/P...

# ask-community

Brett Naul

10/09/2020, 6:57 PM

@Jim Crist-Harif quick q about https://github.com/PrefectHQ/prefect/pull/3333: where would a callback like https://docs.prefect.io/orchestration/execution/overview.html#environment-callbacks be go once environments are ✂️ ?

Jim Crist-Harif

10/09/2020, 6:59 PM

I'm planning on porting those over to the executors

Jim Crist-Harif

10/09/2020, 6:59 PM

Since they'll be new on the executors, we might rethink what callbacks we support. What do you use these for?

Brett Naul

10/09/2020, 7:03 PM

I actually don't use them exactly but the same kind of thing lives on my custom environment class right now: basically

distributed.Client(executor.address).profile()

and save to HTML

Jim Crist-Harif

10/09/2020, 7:05 PM

Would a callback on start and on end that takes the executor itself be sufficient? You could do that then with the

DaskExecutor

in an

on_exit

callback.

Brett Naul

10/09/2020, 7:05 PM

yep definitley

Jim Crist-Harif

10/09/2020, 7:05 PM

Or perhaps we'd make the callbacks executor specific. The dask one might take in the client object, but the local one might take no args.

👍 1

Joe Schmid

10/09/2020, 7:22 PM

@Jim Crist-Harif FWIW, we currently use the

on_execute

callback in

DaskCloudProviderEnvironment

to size the number of Dask workers based on Flow Parameters. Effectively something like "oh, you have a list of 12 models to train, let's get you 12 workers for that."

Jim Crist-Harif

10/09/2020, 7:23 PM

Hmmm, interesting. I think that's pointing towards executor-specific callbacks. I'll push something up, thanks for the use-cases y'all.

👍 2

Jim Crist-Harif

10/09/2020, 9:17 PM

So I don't think the

on_execute

callback is necessary with the new scheme. We may want to add something to make things more composable, but for now I'd like to avoid it. To get the behavior you'd want, you can write a function to create your cluster object and pass it to

cluster_class

. When that function gets called the parameters (and anything else provided as

context

) will already be in

context

, so you'd have full access to configure the cluster however you wanted.

Jim Crist-Harif

10/09/2020, 9:18 PM

This should already work, with either an environment (

LocalEnvironment

/`FargateTaskEnvironment` /

KubernetesJobEnvironment

only), or the new

KubernetesRun

run config.

Jim Crist-Harif

10/09/2020, 9:24 PM

Use cases I see for callbacks: • Do something completely unrelated to the executor before the job starts (maybe ping an external service 🤷). • Dynamically configure the dask cluster. This can already be done with

cluster_class

as a function, but maybe we'd want a clearer way? • Dynamically configure the cluster scale/adapt. Currently either you'd have to do that in

cluster_class

(create the cluster and call `scale`/`adapt` before returning), or use static kwargs in

adapt_kwargs

. • Do something before the cluster shuts down (@Brett Naul's case of saving the job profile) • Do something completely unrelated to the executor before the job stops

Jim Crist-Harif

10/09/2020, 9:34 PM

All possible callbacks for

DaskExecutor

(names subject to change): •

on_start(executor) -> None

, first thing called in

executor.start()

. •

on_cluster_start(executor) -> None

, called after the cluster starts but before the flow is run. Could do scaling here if needed •

on_flow_run_stop(executor) -> None

, called after the flow run has completed, but before shutting the cluster down. Could save the profile here. •

on_stop(executor) -> None

, last thing called before exiting

executor.start()

. We could merge last two together into a single

on_stop

if we don't care about having a callback after the cluster has stopped (would happen before cluster shutdown but after flow execution). Likewise, we could merge the first two if we don't care about having a callback before the cluster starts. I'd like to minimize the number of configurables if possible, so minimizing possible callbacks would be nice unless they're all needed. Y'all have more experience actually running prefect than I do, so your thoughts would be useful here :).

Open in Slack

Previous Next