Adrien Besnard
01/30/2024, 10:21 AMAdrien Besnard
01/30/2024, 10:21 AMDiscovered type 'kubernetes' for work pool 'default'.
Worker 'KubernetesWorker e764a855-8b77-47b6-8aa0-362493a75652' started!
10:12:52.074 | INFO | prefect.flow_runs.worker - Worker 'KubernetesWorker e764a855-8b77-47b6-8aa0-362493a75652' submitting flow run '28a16102-2bf6-42d4-8b9d-1da08abd4a30'
10:12:52.750 | INFO | prefect.flow_runs.worker - Creating Kubernetes job...
10:12:52.879 | INFO | prefect.flow_runs.worker - Job 'fabulous-dugong-jdsqj': Pod has status 'Pending'.
10:12:52.938 | INFO | prefect.flow_runs.worker - Completed submission of flow run '28a16102-2bf6-42d4-8b9d-1da08abd4a30'
10:13:52.850 | ERROR | prefect.flow_runs.worker - Job 'fabulous-dugong-jdsqj': Pod never started.
10:13:53.200 | INFO | prefect.flow_runs.worker - Job event 'SuccessfulCreate' at 2024-01-30 10:12:52+00:00: Created pod: fabulous-dugong-jdsqj-2hv4c
10:13:53.202 | INFO | prefect.flow_runs.worker - Pod event 'Scheduled' at 2024-01-30 10:12:52.808564+00:00: Successfully assigned data-platform/fabulous-dugong-jdsqj-2hv4c to ip-10-2-0-84.eu-west-1.compute.internal
10:13:53.205 | INFO | prefect.flow_runs.worker - Pod event 'Pulled' at 2024-01-30 10:12:53+00:00: Container image "<http://406151221390.dkr.ecr.eu-west-1.amazonaws.com/data-platform/metabase-reports:0.0.1-dev.464_7e69dd0|406151221390.dkr.ecr.eu-west-1.amazonaws.com/data-platform/metabase-reports:0.0.1-dev.464_7e69dd0>" already present on machine
10:13:53.206 | INFO | prefect.flow_runs.worker - Pod event 'Created' at 2024-01-30 10:12:53+00:00: Created container prefect-job
10:13:53.208 | INFO | prefect.flow_runs.worker - Pod event 'Started' at 2024-01-30 10:12:53+00:00: Started container prefect-job
10:13:53.408 | INFO | prefect.flow_runs.worker - Reported flow run '28a16102-2bf6-42d4-8b9d-1da08abd4a30' as crashed: Flow run infrastructure exited with non-zero status code -1.
Adrien Besnard
01/30/2024, 10:21 AMAdrien Besnard
01/30/2024, 10:30 AMAdrien Besnard
01/30/2024, 10:48 AMBrandon
01/30/2024, 12:21 PMNate
01/30/2024, 2:01 PMNate
01/30/2024, 2:17 PMprefect-kubernetes
that you have running when encountering the error?Adrien Besnard
01/30/2024, 2:24 PMNate
01/30/2024, 2:25 PMAdrien Besnard
01/30/2024, 2:29 PM5:28 $ kubectl exec -it prefect-worker-56d4bdc94d-dcz67 -- bash
root@prefect-worker-56d4bdc94d-dcz67:/opt/prefect# pip list | grep prefect
prefect 2.14.17
prefect-kubernetes 0.3.3
[notice] A new release of pip is available: 23.3.1 -> 23.3.2
[notice] To update, run: pip install --upgrade pip
root@prefect-worker-56d4bdc94d-dcz67:/opt/prefect#
Nate
01/30/2024, 2:32 PMAdrien Besnard
01/30/2024, 2:38 PMAdrien Besnard
01/30/2024, 2:42 PMBrandon
01/30/2024, 3:08 PMNate
01/30/2024, 5:27 PMAdrien Besnard
01/30/2024, 7:33 PMNate
01/30/2024, 8:01 PMprefect-kubernetes
to track this specific issue - since drastically increasing the watch timeouts definitely doesn't seem acceptableJacob Hurlbut
01/30/2024, 9:12 PMNate
01/30/2024, 9:16 PMJacob Hurlbut
01/30/2024, 9:30 PMJacob Hurlbut
01/30/2024, 9:36 PMprefecthq/prefect:2.14.18-python3.12-kubernetes
imageNate
01/30/2024, 9:40 PMUriel Mandujano
01/30/2024, 9:55 PMprefect-kubernetes v0.3.3
that we shipped in prefect v2.14.17
. it's possible that something in that change is affecting flow runs. do the prefect-worker
logs or pod logs have any more information about what's going on?Jacob Hurlbut
01/30/2024, 10:19 PMUriel Mandujano
01/31/2024, 4:41 PMprefect-kubernetes
incoming. For now, anyone using the helm chart should pin the helm chart they're using to avoid prefect-kubernetes v0.3.3
(which are 2024.1.30 and 2024.1.25) and anyone directly using prefect-kubernetes
should avoid v0.3.3. our current goal is to include the fix in tomorrow's release!Uriel Mandujano
02/01/2024, 9:57 PMprefect-kubernetes v0.3.4
and released helm chart v2024.2.1
which bundles the new prefect-kubernetes
release with prefect v2.14.19
. We expect this release to address the issue so please try it out at your convenience. Feel free to reply to this issue if you notice any more odd behavior coming from your prefect kubernetes workers!Adrien Besnard
02/06/2024, 8:26 AMBen Zehavi
02/15/2024, 1:58 PM