Gabe Grand
05/18/2021, 4:23 PMLocalDaskExecutor
in combination with RunNamespacedJob
to launch a bunch of trivially parallelizable child processes on Kubernetes. However, it looks like only 2 child jobs are getting run at a time. Is there a way to increase the parallelism // are we doing this right? 😄Kevin Kho
Dana Merrick
05/18/2021, 4:31 PMLocalDaskExecutor()
Kevin Kho
Kevin Kho
flow.executor = LocalDaskExecutor(num_workers=2)
Dana Merrick
05/18/2021, 4:35 PMDana Merrick
05/18/2021, 4:35 PMDana Merrick
05/18/2021, 4:36 PMGabe Grand
05/18/2021, 4:44 PMGabe Grand
05/18/2021, 4:44 PMKevin Kho
Gabe Grand
05/18/2021, 4:48 PMnum_workers
for different tasks? we want high num_workers
for these RunNamespacedJob
tasks, but there are other tasks in the flow that I could see breaking with too many threadsDana Merrick
05/18/2021, 4:49 PMexecutor
in the flow definitionKevin Kho
Dylan
Gabe Grand
05/18/2021, 4:59 PMt1 = RunNamespacedJob.map(body=[b1, b2, …])
to create a bunch of parallel K8s jobs, and then we have some downstream tasks that depend on t1
. Not sure if this is the best way to achieve MapReduce-style parallelism with Prefect.Kevin Kho
Gabe Grand
05/18/2021, 4:59 PMKevin Kho
Gabe Grand
05/18/2021, 5:01 PM