Hey folks,
we’re experiencing some intermittent issue with Fargate Agent in EC2 mode.
we are on
prefect==0.13.13
From time to time, tasks fail to start and we get the following error:
Copy code
f6edaa77-6dc6-4b85-953a-25ccdb4fb366 finished in state <Failed: "list index out of range">
where
f6edaa77-6dc6-4b85-953a-25ccdb4fb366
is the task run id.
We can’t find any meaningful log on ECS, but I suspect this error is caused by the fact that the container fails to start.
Any suggestions?
Thanks!
k
Kevin Kho
05/24/2021, 1:37 PM
Hi @ale, this only happens sometimes? Is it always the same task?
a
ale
05/24/2021, 1:42 PM
It is happening randomly to several flows (which are ecs tasks in our case)
k
Kevin Kho
05/24/2021, 1:44 PM
And nothing in the Flow gets run at all right?
a
ale
05/24/2021, 2:41 PM
Exactly
k
Kevin Kho
05/24/2021, 2:45 PM
Ok I’ll ask the team and get back to you
a
ale
05/24/2021, 3:10 PM
Thanks @kevin! 🙌
k
Kevin Kho
05/24/2021, 7:27 PM
Do you see anything on the CloudWatch logs side?
a
ale
05/24/2021, 7:46 PM
The only logs I see are related to Prefect Agent, but nothing related to ECS service.
I was looking for some ECS logs for tasks that failed to start...but I found nothing 😭
k
Kevin Kho
05/24/2021, 8:02 PM
Man sorry there aren’t any clear suggestions other than to upgrade because that agent is deprecated
Bring your towel and join one of the fastest growing data communities. Welcome to our second-generation open source orchestration platform, a completely rethought approach to dataflow automation.