Docling reposted this
Sunday Coffee & Code: RAG improvements, and a few detours Most of my recent Coffee & Code time has been going into one very specific thing, improving the RAG layer behind my Microsoft Agent Framework–based RFP multi-agent solution. Here’s the path so far. Step 1: Chroma + Docling (via MCP) ● This was the first “real” implementation. Simple vector search over RFP input, fed by Docling. ● It worked well enough to support one live RFP response, which is an MVP bar for me - whether something is more than an experiment. Step 2: LightRAG ● I then went looking for ways to improve retrieval quality and came across LightRAG (https://lnkd.in/gCftxBBr), which combines vector search, knowledge graph, and re-ranking. ● Technically, it was impressive. I got it running, built the knowledge graph, and the architecture made a lot of sense. ● But it also did something unsettling: the knowledge graph started containing things that were never in the source documents (for example, “Noah Carter – World Athletics Championship”). Whether this was a hallucinated artifact or not, it was a reminder to take a closer look. Step 3: OpenRAG Last week I came across IBM's OpenRAG stack - https://www.openr.ag/ - (Langflow + Docling + OpenSearch Project) and it checked a lot of boxes for me: ● LangFlow (where I built my first real agent back in 2024) ● Docling (already central to my document pipeline) ● OpenSearch (scalable, production-grade search) ● Langfuse support (observability - also used it before and it provides valuable insights) The install was quick (Docker, Inc based), though I did hit a startup issue on my Amazon Web Services (AWS) EC2 instance that took some fiddling to resolve (GitHub issue - https://lnkd.in/g38qb8QK - fixed with my PR: https://lnkd.in/g594VTQ6). Once running, though, this stack is genuinely interesting: configurable models, Langfuse tracing, and full access to LangFlow flows sitting right under the RAG layer. This feels much closer to something I could operate, tune, and trust over time (as well as getting support for). Next steps: Now it’s about stabilising the OpenRAG deployment and wiring it cleanly into my RAG Manager so the agent can start using it for real RFP work. As always, this about exploring the latest tools as well as answering a simple consulting question: Can I use this in a real, production, workflow and trust it? More to come.