Hey wave We just hit this internal serve error on one of our Prefect Community #ask-community

Hey :wave: We just hit this internal serve error ...

James McNeilis

12/06/2021, 9:59 PM

Hey 👋 We just hit this internal serve error on one of our scheduled runs. We have hit restart, but wondering whether there is any further context that can be provided?

Copy code

Failed to set task state with error: ClientError([{'path': ['set_task_run_states'], 'message': 'An unknown error occurred.', 'extensions': {'code': 'INTERNAL_SERVER_ERROR'}}])
Traceback (most recent call last):
  File "/usr/local/lib/python3.8/site-packages/prefect/engine/cloud/task_runner.py", line 91, in call_runner_target_handlers
    state = self.client.set_task_run_state(
  File "/usr/local/lib/python3.8/site-packages/prefect/client/client.py", line 1917, in set_task_run_state
    result = self.graphql(
  File "/usr/local/lib/python3.8/site-packages/prefect/client/client.py", line 569, in graphql
    raise ClientError(result["errors"])
prefect.exceptions.ClientError: [{'path': ['set_task_run_states'], 'message': 'An unknown error occurred.', 'extensions': {'code': 'INTERNAL_SERVER_ERROR'}}]

I've put the task and flow runs urls in the 🧵 .

James McNeilis

12/06/2021, 10:00 PM

https://cloud.prefect.io/deliveroo/task-run/b71c4f8a-bb0b-44f2-b0fc-1b54dcc8b9f3 https://cloud.prefect.io/deliveroo/flow-run/aab9545f-52b8-4f39-8507-dde3ee732699

Kevin Kho

12/06/2021, 10:00 PM

Is this a big Flow in terms of number of tasks? Did the restart work?

James McNeilis

12/06/2021, 10:06 PM

Very big flow, it will likely take a couple of hours for us to hit the problem tasks from the last run; although I'm assuming it's not those tasks that were the problem, but rather something on the Prefect Cloud side?

David Elliott

12/06/2021, 10:12 PM

For context, yeah it's a massive flow, circa 1400 statically-defined tasks, ~2700 edges. We've had issues with registering the large static DAG in the past, but rarely cloud server 500 errors, wondering if you can see anything from your side in the logs?

Kevin Kho

12/06/2021, 10:15 PM

I remember that but I think that should only be registration side and I think we should be past that. The only logs I see will be what you see. I do see the error. I would need to look for some other team members to dig deeper. Will get back.

🙏 1

Kevin Kho

12/06/2021, 10:18 PM

Wait sorry it still says running. Did just one task fail but the Flow continued (apart from that error)?

James McNeilis

12/06/2021, 10:21 PM

I have cancelled the problem flow-run, and started a new one. The problem run, I restarted it from a given node in the DAG, but it didn't have expected behaviour (that node was pending, and downstream nodes were running), which resulted in my cancelling altogether.

Kevin Kho

12/06/2021, 10:23 PM

Ok gotcha

Kevin Kho

12/06/2021, 11:00 PM

Will DM

3 Views

Open in Slack

Previous Next