Your assessment of Prefect heading in a direction that meets the requirements for real-time, low-latency generative AI systems seems to have some merit, especially with the enhancements introduced in recent versions, such as 3.0.0rc10.
Improvements and Enhancements in Prefect 3.0 include:
-
Inference and Persistence: Improvements like “infer task result persistence from other settings” streamline the management and orchestration of complex pipelines.
-
Nested Task Support: Adding support for nested tasks in flow run graphs is particularly useful for managing large, intricate workflows typical in AI pipelines.
-
Background Task Management: Enhancements in managing background tasks can aid in handling real-time processing demands efficiently.
However, the official documentation search hit a snag, so gathering detailed, targeted information was disrupted. For complete certainty, detailed exploration of the
Prefect 3.0 release notes is advisable.
In summary, whether Prefect is the "right tool" might still need validation through smaller pilot projects or proof-of-concept setups for your specific use case. But it's certainly worthy of consideration.