Hey folks, I'm thinking about using Prefect at a medium-sized company. I'm looking for guidelines on how to set up a flow that spans multiple teams / code repos. For example, node A in the dag may be owned by team A, runs in container A, outputs a large dataset to be consumed by the node B, owned by team B, in container B. Does anyone know of best practices for such a situation?
Robert Sokolowski
11/05/2020, 3:40 PM
One idea I have is a "repo of repos", but that seems like it can grow in complexity
You’ll be able to have isolated Repos.
But will need to use the same Prefect
r
Robert Sokolowski
11/05/2020, 3:54 PM
Hm that might work too
j
Joe Schmid
11/05/2020, 4:06 PM
@Robert Sokolowski I like @Raphaël Riel's idea of separate Flows as that will get you the best isolation / independence and be very simple. If you really need code from multiple teams to be in one Flow you might look at using git submodules. We do this in several cases, not specifically for Prefect Flows, but in principle I think it could address your needs. Nice intro that also includes some caveats/cautions: https://github.blog/2016-02-01-working-with-submodules/
upvote 1
r
Robert Sokolowski
11/05/2020, 4:49 PM
Thanks for the suggestions, I'll try them both and ask the team what we prefer