Rising 4th-year BE Computer Engineering @ PESMCOE, SPPU Β· CGPA 9.34/10
Building AI systems that work outside the lab β from research prototype to deployed package.
LLMs forget. HierMem fixes that.
HierMem is an OS-inspired hierarchical paged memory architecture for long-horizon LLM conversations. Inspired by virtual memory paging β the same design that lets your computer run programs larger than physical RAM β it uses a stateless curator agent, a priority-tagged constraint store, and a 4-level memory hierarchy (L0βL3) to prevent context degradation, constraint forgetting, and hallucination.
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β Curator (~1000 tokens, constant) β reads L0 index only β
β Retrieval β KEYWORD / SEMANTIC / HIERARCHICAL / HYBRID β
β Assembler β 4-zone attention-optimised context β
β Main LLM β generates from bounded ~6000-token budget β
β Post-Proc β extracts constraints, updates L0βL3 archive β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
| Metric | Result |
|---|---|
| Memory compression ratio | 4.7Γ vs raw context |
| Constraint survival rate | 93.3% over long sessions |
| LLM-as-Judge score | 8.4β8.7 / 10 (vs 5.4β7.6 baselines) |
| Outperforms | RAG Β· RAG+Summary Β· MemGPT-style Β· Raw LLM |
|
Step-level diagnosis and targeted fine-tuning for LLM reasoning failures β diagnoses where models fail, not just if they fail.
|
Deepfake detection pipeline designed to stay robust across domains and changing distributions.
|
|
Production-grade fraud detection β not a demo, a deployed pipeline.
|
Decentralised agents that negotiate and evolve decision policies autonomously.
|
AquaIntel β 7Ship routing system with live weather integration.
|
AI health platform built around generative AI orchestration.
|
Team Vulcans Β· Software & AI Lead
Building the autonomous navigation and AI stack for PESMCOE's entry in ABU Robocon 2026.
- Modified A* planning for partial observability and dynamic obstacles
- PPO-based control with reward shaping for rule-compliant navigation
- ROS2 architecture + Gazebo simulation environments
- Validated in NVIDIA Isaac Sim before real-world hardware integration
ROS2 NVIDIA Isaac Sim Gazebo PPO Python MATLAB
LANGUAGES = ["Python", "TypeScript", "C++", "SQL", "MATLAB"]
ML_AI = ["PyTorch", "Hugging Face TRL", "QLoRA/PEFT", "LiteLLM",
"LangChain", "XGBoost", "sentence-transformers", "SigLIP"]
LLM_INFRA = ["Ollama", "llama.cpp", "GGUF", "ChromaDB",
"RAG pipelines", "LLM-as-Judge evaluation"]
ROBOTICS = ["ROS2", "NVIDIA Isaac Sim", "Gazebo", "A*", "PPO", "Ray RLlib"]
BACKEND = ["FastAPI", "Flask", "PostgreSQL", "MongoDB", "Firebase"]
CLOUD = ["Google Cloud Platform", "AWS"]
TOOLS = ["Git", "Docker", "pytest", "Streamlit", "Weights & Biases"]- π¬ Extending HierMem β Needle-in-a-Haystack and multi-step reasoning benchmarks
- π€ Autonomous navigation stack for ABU Robocon 2026
- π Rising 4th year BE CS Β· Open to research internships & AI engineering roles (2025β26)
- π¬ Reach me: [email protected] Β· linkedin.com/in/yash-doke