Soham
07/22/2024, 11:07 PMProcess 27834 exited with status code: -11
Marvin
07/22/2024, 11:08 PMProcess 27834 exited with status code: -11
typically indicates a segmentation fault, which is a serious error often stemming from low-level issues. There are various potential causes including:
1. Resource Limitations: The worker might be running out of memory or another critical resource.
2. Incompatible or Corrupted Dependencies: Specific versions of libraries or corrupted installations might cause segmentation faults.
3. Underlying System Issues: Issues in the underlying system (e.g., Docker, Kubernetes) or misconfigurations could also be culprits.
While there are no direct references to status code -11
in the Prefect docs and issues, here are some relevant resources to check:
- Prefect Troubleshooting Guide: https://docs.prefect.io/latest/guides/troubleshooting/
- Prefect Shell Integration: https://docs.prefect.io/latest/integrations/prefect-shell/
- Kubernetes Worker: https://docs.prefect.io/latest/integrations/prefect-kubernetes/worker/
Additionally, the issues from GitHub related to crashing and internal errors may provide insights:
1. Setting auto remove to true in Podman container blocks crashes agent
2. TaskFailedToStart error: CannotPullContainerError
3. Docker containers kept open when they crash
For a more precise diagnosis:
- Check System Logs: Look at logs on the worker node for more detailed error messages around the time of the crash.
- Monitor Resource Usage: Ensure your worker has enough resources (CPU, RAM, Disk I/O) when running the flow.
- Verify Dependencies: Ensure all dependencies are compatible and correctly installed.
I suggest starting with these steps for a detailed investigation.