Bring your towel and join one of the fastest growing data communities. Welcome to our second-generation open source orchestration platform, a completely rethought approach to dataflow automation.

Prefect Community

Hi there!
We are working on ML applied to computer vision, therefore our pipelines take long time and deal with "_heavy_" data that's best kept _close_ to the training code. So far we have been babysitting the processing and training pipelines, but that's of course not scalable and we are starting to take too long time. I am currently investigating how we can make our pipelines fully automated, reproducible and traceable. However, I am not finding many examples of Prefect applied to image processing and computer vision on the internet. I am concerned with the fact that images are batch-processed but most often we actually want to stream, process and load to another location. Using Prefect's `.map` seems like a good choice, but won't it make the `Flow Runs UI` cluttered and not usable ?  In your experience, how do you see people solve this problem ?

Hello Maciej! You have many options to reduce the clutter in your UI. You can customize the <https://docs.prefect.io/latest/tutorials/flow-task-config/?h=flow#flow-and-task-configuration|naming convention> for the flowruns and stateruns. Additionally, you can include tags to any of your tasks, flows, and deployments to help filter them in the UI. <https://docs.prefect.io/latest/concepts/tasks/?h=tags#tags|Tags> on the task level allow you to control the concurrency level as well.