Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View yashdoke7's full-sized avatar
πŸ˜„
πŸ˜„

Highlights

  • Pro

Block or report yashdoke7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yashdoke7/README.md

AI/ML Engineer Β· LLM Infrastructure Β· Robotics Β· Production ML

LinkedIn PyPI PyPI Downloads HuggingFace Email Profile views

Rising 4th-year BE Computer Engineering @ PESMCOE, SPPU Β· CGPA 9.34/10

Building AI systems that work outside the lab β€” from research prototype to deployed package.


🧠 Flagship Research β€” HierMem

LLMs forget. HierMem fixes that.

HierMem is an OS-inspired hierarchical paged memory architecture for long-horizon LLM conversations. Inspired by virtual memory paging β€” the same design that lets your computer run programs larger than physical RAM β€” it uses a stateless curator agent, a priority-tagged constraint store, and a 4-level memory hierarchy (L0β†’L3) to prevent context degradation, constraint forgetting, and hallucination.

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  Curator (~1000 tokens, constant)  β†’  reads L0 index only   β”‚
β”‚  Retrieval  β†’  KEYWORD / SEMANTIC / HIERARCHICAL / HYBRID   β”‚
β”‚  Assembler  β†’  4-zone attention-optimised context           β”‚
β”‚  Main LLM   β†’  generates from bounded ~6000-token budget    β”‚
β”‚  Post-Proc  β†’  extracts constraints, updates L0β†’L3 archive  β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
Metric Result
Memory compression ratio 4.7Γ— vs raw context
Constraint survival rate 93.3% over long sessions
LLM-as-Judge score 8.4–8.7 / 10 (vs 5.4–7.6 baselines)
Outperforms RAG Β· RAG+Summary Β· MemGPT-style Β· Raw LLM

GitHub PyPI PyPI Downloads Dataset


πŸ”¬ Projects

Step-level diagnosis and targeted fine-tuning for LLM reasoning failures β€” diagnoses where models fail, not just if they fail.

  • Fine-tuned Qwen 2.5 3B with QLoRA 4-bit
  • 63% reduction in step-level failure rate
  • Ground-truth backtracking + error propagation analysis
  • Runs fully local via llama.cpp β€” zero API cost

PyTorch QLoRA PEFT GGUF Ollama

Deepfake detection pipeline designed to stay robust across domains and changing distributions.

  • 92% AUC-ROC across DFDC, CelebDF, WildDeepfake
  • SigLIP transformer embeddings + blockwise frame representation
  • Continual learning via replay buffers + knowledge distillation
  • Prevents catastrophic forgetting across dataset shifts

PyTorch SigLIP Transformers Continual Learning

Production-grade fraud detection β€” not a demo, a deployed pipeline.

  • 96.2% accuracy, 0.92 AUC-ROC
  • Processes 50K+ transactions/day
  • XGBoost + neural networks with advanced feature engineering
  • Real-time FastAPI classification with geolocation tracking

XGBoost FastAPI Neural Networks sklearn

Decentralised agents that negotiate and evolve decision policies autonomously.

  • PPO + quadratic voting with reputation weighting
  • Agents negotiate rule proposals in a simulated digital economy
  • PostgreSQL logging of governance events and adaptive reward strategies
  • Delivered as a functional 24-hour hackathon proof-of-concept

Ray RLlib PettingZoo PPO PostgreSQL

AquaIntel ⭐ 7

Ship routing system with live weather integration.

  • Enhanced A* algorithm with adaptive grid weighting
  • Integrates wind, depth risk, and fuel efficiency in real-time
  • Reduced route update time from 3.6h β†’ 36 minutes (6Γ—)
  • Full-stack with route visualisation via Leaflet.js

TypeScript A* Algorithm OpenWeatherMap Leaflet.js

AI health platform built around generative AI orchestration.

  • Gemini 2.0 Flash for multimodal food recognition
  • NLP-driven dietary Q&A + longitudinal health trend analysis
  • Serverless GCP backend with Firestore real-time persistence
  • WCAG 2.1 accessible UI with Firebase Auth + server-side token validation

Gemini API Google Cloud Firebase Vanilla CSS


πŸ€– Robotics β€” ABU Robocon 2026

Team Vulcans Β· Software & AI Lead

Building the autonomous navigation and AI stack for PESMCOE's entry in ABU Robocon 2026.

  • Modified A* planning for partial observability and dynamic obstacles
  • PPO-based control with reward shaping for rule-compliant navigation
  • ROS2 architecture + Gazebo simulation environments
  • Validated in NVIDIA Isaac Sim before real-world hardware integration

ROS2 NVIDIA Isaac Sim Gazebo PPO Python MATLAB


βš™οΈ Tech Stack

LANGUAGES   = ["Python", "TypeScript", "C++", "SQL", "MATLAB"]

ML_AI       = ["PyTorch", "Hugging Face TRL", "QLoRA/PEFT", "LiteLLM",
               "LangChain", "XGBoost", "sentence-transformers", "SigLIP"]

LLM_INFRA   = ["Ollama", "llama.cpp", "GGUF", "ChromaDB",
               "RAG pipelines", "LLM-as-Judge evaluation"]

ROBOTICS    = ["ROS2", "NVIDIA Isaac Sim", "Gazebo", "A*", "PPO", "Ray RLlib"]

BACKEND     = ["FastAPI", "Flask", "PostgreSQL", "MongoDB", "Firebase"]

CLOUD       = ["Google Cloud Platform", "AWS"]

TOOLS       = ["Git", "Docker", "pytest", "Streamlit", "Weights & Biases"]

πŸ“Š GitHub Stats


πŸ“ˆ Contribution Activity


🎯 Currently

  • πŸ”¬ Extending HierMem β€” Needle-in-a-Haystack and multi-step reasoning benchmarks
  • πŸ€– Autonomous navigation stack for ABU Robocon 2026
  • πŸ“– Rising 4th year BE CS Β· Open to research internships & AI engineering roles (2025–26)
  • πŸ“¬ Reach me: [email protected] Β· linkedin.com/in/yash-doke

"The gap between research and production is where I work."

Profile views

Pinned Loading

  1. llm-hiermem llm-hiermem Public

    🧠 Hierarchical paged context management for LLMs β€” inspired by OS virtual memory. A stateless curator agent + priority constraint store + multi-level archive (L0β†’L3) prevents context degradation, c…

    Python 2

  2. llm-reasoning-pipeline llm-reasoning-pipeline Public

    πŸ”¬ Step-level LLM reasoning evaluation pipeline β€” diagnoses where models fail, not just if they fail. Includes backtrack error attribution, RAG mitigation, and targeted LoRA fine-tuning.

    Python 1

  3. ai-generated-video-detection ai-generated-video-detection Public

    AI-Generated Video Detection using transformer-based models (UNITE & UNITE-CL). The pipeline extracts video frames, generates embeddings (SigLIP), and classifies content as real or AI-generated. Su…

    Python 1

  4. AquaIntel AquaIntel Public

    🚒 Ship Routing System is a pathfinding-based application that calculates the shortest route between ports using the A* algorithm. It simulates real-world navigation by integrating live weather data…

    TypeScript 7 1

  5. Harsh-4210/Self_Evolving_Multi_Agent_Governance Harsh-4210/Self_Evolving_Multi_Agent_Governance Public

    A decentralized multi-agent system for self-evolving governance, negotiation, and conflict resolution in a digital economy.

    TypeScript 1 1