well I do use chromadb in a lot of places currently. i haven't updated the slackbot implementation in quite a while so tpuf was just the vectorstore i was checking out when I deployed the slackbot
i will eventually rewrite the slackbot using controlflow or pydantic ai, I just haven't gotten around to it.
imo the RAG performance bottleneck is not "who gives the best cosine similarity between documents" but instead how to
• enrich documents with metadata at ingest time
• empower the AI to filter by this metadata at query time
• combine traditional search with semantic search
who gives me cosine similarity is not the most consequential in my eyes, they're all quite similar