We want to use AWS ECS to run flows using AWS ElasticFileSystem (EFS) to store the data created during data processing.
We are able to mount the EFS from EC2 instance, but we are failing terribly when trying to do so when using runner
prefect.run_configs.ECSRun
. Our flow keeps starting and never completes - not even printing single line of text.
Is there any working example of AWS ECS based prefect runner storing data on mounted AWS EFS volume?
k
Kevin Kho
07/11/2021, 10:54 PM
Hey @Jan Vlčinský, I’ll look into this tomorrow
🙂 1
Kevin Kho
07/12/2021, 1:58 PM
Could you show me how you’re currently attaching EFS to ECS or what you tried?
j
Jan Vlčinský
07/12/2021, 7:08 PM
I will share the repository with "hello world" example which I have tried to extend to use EFS. I shall have it done within an hour or sooner.
Gotcha. Did you try the container definition under number 5 here ? There seems to be a lot of networking to get it right though. You also add it as a volume.
j
Jan Vlčinský
07/12/2021, 9:09 PM
We used this tutorial, but not exactly step by step (we could be "too smart").
I will try the tutorial step by step tomorrow morning with another colleague and see, what we learn from it.
Jan Vlčinský
07/13/2021, 9:26 PM
@Kevin Kho thanks for the advice to follow the tutorial. It was not easy but we have learnt few concepts, which might help us in making the flows using EFS. We shall see with a day or two.
👍 1
k
Kevin Kho
07/13/2021, 9:27 PM
What was hard? The networking?
j
Jan Vlčinský
07/14/2021, 8:00 AM
Yes, networking, using proper avaliability zone, understanding security groups as definition of what to get in/out but also as locations, from which is EFS accessed (the 2nd security group use surprised me).
We used single availability zone EFS and it did not match the zone of ECS.
Bring your towel and join one of the fastest growing data communities. Welcome to our second-generation open source orchestration platform, a completely rethought approach to dataflow automation.