Hey everyone I had a flow that ran 12 hours late I ve never Prefect Community #ask-community

Hey everyone, I had a flow that ran 12 hours late…...

Christian Sanchez

10/10/2023, 12:19 AM

Hey everyone, I had a flow that ran 12 hours late… I’ve never had a flow run late at all so seeing it run 12 hours late is a bit crazy to me. What are the cause of late run like this?

Bianca Hoch

10/10/2023, 3:10 PM

Hi Christian, from my experience, flow runs generally get stuck in a "Late" status when the worker/agent responsible for spinning up the infra for the flow run is not actively polling for new work

Christian Sanchez

10/10/2023, 9:01 PM

Thanks for the response, Bianca. Is there a reason that the agent is not polling for new work? I’ve just updated the database to use PostgreSQL instead of the standard SQLite so we’ll see if this helps. I was getting a lot of “database is locked” errors with SQLite.

Bianca Hoch

10/12/2023, 2:45 PM

Usually when the agent stops polling for work, some problem has occurred which caused the process to exit

Bianca Hoch

10/12/2023, 2:46 PM

Like the infrastructure or machine you're using to host the agent crashing or being spun down

Bianca Hoch

10/12/2023, 2:48 PM

The best way to mitigate this is to daemonize the agent in some way, to ensure that the lights are kept on. Here's a few examples for daemonization: • How to run a Prefect 2 worker as a systemd service on Linux • Daemonizing the Agent with Docker

Bianca Hoch

10/12/2023, 2:50 PM

I think the agent logs would help clarify why the flow run ran so late. Agent logs are sent to stdout by default, so you may be able to pinpoint what happened by looking at the timestamps

Bianca Hoch

10/12/2023, 2:56 PM

I was getting a lot of “database is locked” errors with SQLite.

Ah, gotcha. I'm not entirely familiar with that error, but I think it may be related to issues with multi-threading. IE: one threads locks the database, and another thread will be blocked from writing

Bianca Hoch

10/12/2023, 2:58 PM

FWIW, Prefect Cloud is always available as an alternative to self hosting. If it gets to be a bit troublesome to host the server and database, Cloud removes the onus of having to manage thoes resources yourself. ☁️

Christian Sanchez

11/05/2023, 6:55 PM

I didn’t think about checking the logs. I can do that too next time. Everything seems to be working fine now, might have just been a fluke.

3 Views

Open in Slack

Previous Next