I build the infrastructure layer beneath AI coding agents — the part that gives them architectural awareness, dependency intelligence, and runtime guardrails so they can operate safely in complex enterprise systems.
Today's AI coding tools are powerful but architecturally blind. They generate code without understanding what breaks downstream. In enterprise systems — where workflows, automations, approvals, and integrations are deeply interconnected — a single unsafe change can cascade across production.
I'm building open-source infrastructure that closes this gap: giving AI agents real project context, blast radius awareness, and consequence analysis before they write a single line of code.
Open-source graph engine and CLI that maps dependency lineage and blast radius inside large ServiceNow codebases.
Enterprise platforms like ServiceNow are notoriously opaque — thousands of interconnected scripts, workflows, and business rules with no clear visibility into what depends on what. fluent-graph turns that into a queryable directed graph.
What it does in practice:
- Parses and maps dependency chains across entire ServiceNow instances
- Calculates blast radius for any proposed change — showing every downstream workflow, automation, and integration that could break
- Exposes hidden dependencies that aren't documented anywhere
- Gives engineering teams a "before you deploy" safety check that didn't exist before
Built with: Node.js, TypeScript, directed graph modeling, CLI architecture
MCP server that connects AI coding agents to fluent-graph — giving Claude, Cursor, and any MCP-compatible tool architectural awareness before generating or modifying code.
This is the bridge between AI coding tools and real system architecture. One configuration connects an agent to the full dependency graph of a project.
What changes for the AI agent:
- Before modifying a script, the agent queries blast radius and sees every downstream dependency that could break
- Instead of hallucinating assumptions about system structure, the agent reasons over real graph data
- Unsafe modifications get flagged before they're generated, not after they hit production
Why this matters: Most AI coding failures in enterprise aren't syntax errors — they're context failures. The agent didn't know that changing one business rule would break an approval chain three layers deep. fluent-graph-mcp solves that at the infrastructure level.
Built with: Node.js, TypeScript, Model Context Protocol (MCP), graph query interface
Production-ready RAG system for health insurance benefits Q&A — deployed entirely through reusable AWS CDK infrastructure.
Employees ask natural-language questions about their benefits plans and get grounded, policy-specific answers. No general knowledge, no hallucinated coverage details — the LLM reasons only over retrieved chunks from the actual plan documents.
What it enforces:
- Answers grounded exclusively in uploaded policy text — the LLM never generates from general knowledge
- PII automatically masked before any question reaches the model (names, SSNs, phone numbers, addresses)
- Off-topic queries blocked at the guardrail layer before they touch the LLM
- Per-plan filtering — a PPO employee only gets PPO answers, no cross-plan bleed
What I learned building it:
- Traced one sentence through two completely different vector pipelines (ChromaDB locally vs S3 Vectors on AWS) — wrote about the internals here
- Discovered why metadata filtering is non-optional in multi-document RAG — without it, similarity search returns chunks from the wrong plan
- Built the agent layer (perceive→reason→act) when single-retrieval RAG couldn't handle multi-part benefits comparisons — wrote about that transition here
Built with: AWS CDK, S3 Vectors, Amazon Titan Embed V2, Nova Micro, Bedrock Guardrails, Lambda, API Gateway
📦 https://github.com/yesprasad/AWS-Bedrock-CDK-Benefits-QnA)
I write about what I learn while building — RAG internals, vector search, agentic systems, and infrastructure decisions.
From ChromaDB to S3 Vectors — What I Learned About How Embeddings Actually Work Traced one sentence through two completely different vector pipelines. Discovered why ChromaDB uses SQLite + HNSW together, why metadata filtering is non-optional in multi-document RAG, and what changes when you move from local inference to Bedrock Titan V2. Read →
From RAG to Agentic RAG — Why I Built It, Whether It's Real, and What Comes Next Rebuilt a wellness benefits RAG system into a full agent with perceive→reason→act architecture. Covers when RAG actually needs an agent layer, the ReAct loop, and real production deployments at Uber and eBay doing the same pattern. Read →
Benefits Q&A — Production RAG on AWS CDK Built a HIPAA-aware RAG system for health insurance benefits using S3 Vectors, Titan Embed V2, Nova Micro, and Bedrock Guardrails — deployed entirely through reusable CDK infrastructure. PII masking, topic blocking, per-plan filtering. Read →
I'm less interested in what model you use and more interested in the infrastructure around it:
- Dependency intelligence — Can an AI agent understand what depends on what before it acts?
- Runtime consequence analysis — Can we calculate blast radius in real time, before deployment?
- Execution safety — What guardrails make autonomous engineering systems trustworthy enough for production?
- Context engineering — How do we feed AI agents the right project context at the right time?
- Graph-aware reasoning — How do directed graphs change the way AI agents plan and execute?
The long-term vision: foundational infrastructure for autonomous engineering systems that are safe enough to trust with real production systems.
If you're building in the AI infrastructure, agentic systems, or developer tooling space — I'd like to hear what you're working on.
- GitHub: github.com/yesprasad
- LinkedIn: linkedin.com/in/eshwarprasadyaddanapudi
