Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Jbb-525's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Jbb-525

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Jbb-525/README.md

Hi there πŸ‘‹

I build end-to-end AI systems, from RL research and agentic pipelines and production deployment.

Agentic Systems

Financial Agentic RAG β€” LangGraph agent routing financial report queries across Qdrant vector search and SQLite, with CRAG-style relevance loops and SSE streaming to a React frontend.

VoiceNav β€” LLM browser automation agent with Text vs Vision planner ablation (+17pp task success), two-layer failure attribution, and CDP-based live browser streaming.

Research

Web World Model β€” RL training system for web navigation agents. SFT + GRPO on Qwen2.5-3B on 2Γ—A100 with vLLM rollout serving, 150K synthetic samples β€” +44% over CoT, 4.4Γ— faster than Tree Search.

Spatial VLM Investigator β€” Spatial reasoning in VLMs via CoT and RL fine-tuning. GRPO beats SFT on OOD generalization (3.17% vs 12.03% ID-OOD gap).

Academic Knowledge Graph β€” End-to-end KG pipeline: crawling β†’ ontology construction β†’ BERT+BiLSTM+CRF NER β†’ Neo4j β†’ semantic retrieval.

Knowledge & Infrastructure

ML Inference on Kubernetes β€” PyTorch training Job + inference Deployment on GKE, shared PersistentVolume, liveness/readiness probes, LoadBalancer REST API.

Pinned Loading

  1. Yvonne511/spatial-vlm-investigator Yvonne511/spatial-vlm-investigator Public

    Jupyter Notebook 4

  2. Construction-of-Academic-Knowledge-Graph-for-Semantic-Retrieval Construction-of-Academic-Knowledge-Graph-for-Semantic-Retrieval Public

    Construct a fine-grained academic knowledge graph with a stable hierarchical structure for semantic retrieval.

    Jupyter Notebook 2 1

  3. webworldmodel webworldmodel Public

    A fine-tuned world model for efficient look-ahead planning in autonomous web agents.

    Python

  4. Agentic-RAG-chatbot-for-Financial-Report-QA Agentic-RAG-chatbot-for-Financial-Report-QA Public

    An agentic Retrieval-Augmented Generation system for querying financial reports, built with LangGraph.

    Python