🤖 AI Infrastructure Engineer · 🎓 B.S. CS · MBA Candidate
Obsessed with the unglamorous substrate that separates autonomous deployments from demos.
-
🐉 Drako — the security layer your AI agents are missing. Scans your codebase before you ship, enforces policy at runtime, and blocks bad tool calls before they execute. 97 deterministic rules, zero LLMs in the eval loop, <2ms overhead.
-
🧠 Graymatter — persistent memory layer for AI agents. Turns stateless agents into stateful ones with a single Go binary. Hybrid retrieval (vector + keyword + recency), ~90% token reduction after a few sessions. Zero infra, zero dependencies—a true drop-in solution. Auto-wires with any vendor (Claude Code, Cursor, Codex, OpenCode, Antigravity) and any MCP-compatible client.
-
⚡ Forge — Universal multi-agent harness. Adapts LangGraph, CrewAI, AutoGen (zero migration, auto-detected) with per-agent cost tracking, hybrid memory retrieval, observability, and self-evolution. 11 composable packages, zero framework lock-in.
-
🌐 Stratum — dual-tier LLM inference router for Gemma 4 on 16GB consumer GPUs. Zero-inference complexity scorer (6 dimensions, precompiled regexes) decides in <1ms whether a request goes to vLLM for speed or llama.cpp for quality, backed by live VRAM monitoring and adaptive latency SLA enforcement via EMA. Full model quality at fraction of the hardware cost.
-
🔬 Epica — formal belief revision runtime for LLM agents, grounded in AGM postulates. Where other memory systems store contradictions silently, Epica resolves them: minimal contraction removes what must go, Noisy-OR propagates confidence causally, C=(P,I,G,R) contracts enforce policy before the agent acts. K2-K5 verified. T-ECE 0.07 < 0.08. MCP 2026 native. PyO3 SDK.
-
🧭 Meridian — phase-aware inference scheduler for reasoning models, grounded in recent EAT/RPDI research on reasoning termination. Reframes a single request as two workloads with opposite SLOs, then schedules them accordingly: output-first dual-queue dispatch, EAT/RPDI-gated </ think> forcing, three-tier KV eviction, and transport-agnostic disaggregated KV offload hooks. Rust core, PyO3-backed Python/vLLM integration, and a stock-vs-Meridian A/B harness centered on TTOT, output ITL, and user-visible degradation events.
-
Long-horizon drift as a typed invariant problem — coherence is not correctness; agents drift silently over 100-step trajectories while sounding confident. I'm building CLT-based drift bounds enforced at the belief mutation layer, not as output-level checks that arrive too late.
-
Causal failure attribution as infrastructure — in multi-agent systems, every downstream confidence collapse has an upstream cause. I treat causal provenance as a first-class primitive: every belief revision needs a diff, every failure needs an attributable path back to the state that originated it.
-
Formally bounded autonomy over formally verified outputs — proving what an agent won't do matters more than guardrailing what it says. AGM revision semantics and typed behavioral contracts at the execution layer; enforcement that disappears into the runtime at sub-millisecond latency or it becomes a feature flag.


