@Yaron Levi thats definitely how i use prefect. i create and use dask clusters in most of my flows that require any sort of meaningful scale
y
Yaron Levi
07/03/2024, 6:01 PM
Got it. So the Prefect flow would have a single task that would start the compute on a dask cluster and monitor it?
Yaron Levi
07/03/2024, 6:01 PM
(And not many tasks)
j
john
07/03/2024, 6:02 PM
yep. i use prefect for what its great at. being a really thin orchestration wrapper for logging, retries, state management, etc.
john
07/03/2024, 6:02 PM
in cases where you are connecting multiple etls/bigger systems you can have multiple tasks or flows with subflows
john
07/03/2024, 6:03 PM
but for parallelization like you're talking about, prefect doesnt need to be involved beyond executing the script/task and monitoring logs and exit status