Lev Zemlyanov
03/19/2025, 1:24 AMMarvin
03/19/2025, 1:24 AMfinished_job_ttl
is set appropriately. This setting determines how long finished jobs are retained before being cleaned up by the Kubernetes controller. If not set, jobs may linger indefinitely. Reference.
2. Cancellation Issues: If the infrastructure does not support cancellation or if there are mismatched identifiers, the pods may not terminate. This can happen if the worker cannot find the infrastructure associated with the run or if there are errors during cancellation. Reference.
3. Data Transfer Delays: There have been reports of multi-minute delays between tasks in Kubernetes, especially when large amounts of data are passed between tasks. This could indirectly affect pod shutdown if tasks are delayed. GitHub Issue.
4. Kubernetes Agent Issues: Ensure that you are using a stable version of the Prefect Kubernetes agent. There have been issues with certain versions causing errors. GitHub Issue.
Given your use of Prefect in a Kubernetes environment, you might want to review these settings and issue reports to troubleshoot further. If the problem persists, consider reaching out to the Prefect community or support for more tailored assistance.Lev Zemlyanov
03/19/2025, 1:26 AMMarvin
03/19/2025, 1:27 AM