hi, for one of my work pools, i have 5 workers deployed on kubernetes pods, for a kubernetes work pool. Occasionally, and only for this work pool, all the workers lose connection. I can't see anything in the pod logs that indicate why the workers have lost connection, or even that they have lost connection (it only shows me the logs from the jobs it has run). What can I do to diagnose this issue?
Arthur
08/15/2024, 9:00 AM
Hi, were' seeing this increasingly regularly -- workers just lose connection and stop scheduling/removing pods with no error message. Can anyone advise?
Bring your towel and join one of the fastest growing data communities. Welcome to our second-generation open source orchestration platform, a completely rethought approach to dataflow automation.