Bring your towel and join one of the fastest growing data communities. Welcome to our second-generation open source orchestration platform, a completely rethought approach to dataflow automation.

Prefect Community

Hi all
I would like to understand performance tuning options for `RayTaskRunner` versus `DaskTaskRunner` ,
I have a requirement to develop a Prefect Workflow for a high volume ingestion workflow. The architecture I am looking at is based on Data Mesh.
I have a local environment and worked through several POCs similar to the workflow described <https://towardsdatascience.com/scaling-your-prefect-workflow-to-the-cloud-2dec4e0b213b|here>.
Could I please be directed to relevent reading materials or examples?

I don’t think we have any resources about making this decision. What normally happens is that people on Dask just choose the DaskTaskRunner and the people on Ray just choose the RayTaskRunner (not a lot at the moment). And then you would just tune the engine you are using. The configuration would be passed <https://orion-docs.prefect.io/concepts/task-runners/#running-tasks-on-ray|through> during initialization. There isn’t much material though about Dask versus Ray though from what I’ve seen.

Thanks Kevin, evidently it would be the chosen technology stack with Dask or Ray. My question was more around benchmarking however that can addressed through testing.