question about memory management - I have a flow along the lines of:
Copy code
with Flow() as f:
big_obj = gen_obj1()
big_obj2 = gen_obj2(big_obj) #never need to use big_obj again, want to reclaim memory
big_obj3 = gen_obj3(big_obj2) # finished with big_obj2
...etc
what's the canonical way of freeing up these objects within the flow, once they've been consumed and are no longer needed?
k
Kevin Kho
07/29/2021, 6:03 PM
Hey @Philip MacMenamin, for larger datasets, the canoncial way is to persist them inside the task and then pass the location downstream, then open it in the downstream task so that they don’t need to be held in memory.
Bring your towel and join one of the fastest growing data communities. Welcome to our second-generation open source orchestration platform, a completely rethought approach to dataflow automation.