Hello! I have a few flows that take dataframe as input and return a transformed dataframe. I would like to create an orchestrator flow that calls the flows using
run_deployment
. What would be the most efficient way of passing the dataframes around?
j
Jeff Hale
12/23/2022, 2:20 PM
My first thought is to use
<http://df.to|df.to>_parquet
as you go to save out the file in compressed form and read it in at later steps.
Bring your towel and join one of the fastest growing data communities. Welcome to our second-generation open source orchestration platform, a completely rethought approach to dataflow automation.