How to Build an Over-Engineered Retrieval System
towardsdatascience.com·4d
Flag this post

you’ll stumble upon when doing AI engineering work is that there’s no real blueprint to follow.

Yes, for the most basic parts of retrieval (the “R” in RAG), you can chunk documents, use semantic search on a query, re-rank the results, and so on. This part is well known.

But once you start digging into this area, you begin to ask questions like: how can we call a system intelligent if it’s only able to read a few chunks here and there in a document? So, how do we make sure it has enough information to actually answer intelligently?

Soon, you’ll find yourself going down a rabbit hole, trying to discern what others are doing in their own orgs, because none of this is properly documented, and people are still building their own setups.

This will lead you to implement various opti…

Similar Posts

Loading similar posts...