SurgRAW:Multi-Agent Workflow with Chain-of-Thought Reasoning for Surgical Intelligence

The official repository for SurgRAW, a Chain-of-Thought-driven, multi-agent framework delivering transparent and interpretable insights for robotic-assisted surgery.

Note: The full codebase and mcq dataset will be released soon.

🔍 Overview

SurgRAW employs specialized prompts and a hierarchical orchestration system across five core surgical intelligence tasks:

Instrument Recognition
Action Recognition
Action Prediction
Patient Data Extraction
Outcome Assessment

Key Features

Chain-of-Thought Agents – Task-specific prompts guide VLM agents through structured reasoning, reducing hallucinations and improving explainability.
Hierarchical Orchestration – A Department Coordinator routes queries to visual-semantic or cognitive-inference agents, mirroring real surgical workflows.
Panel Discussion – An Action Evaluator cross-checks visual-semantic predictions using a knowledge graph and rubric-based evaluation for logical consistency.
Retrieval-Augmented Generation (RAG) – Cognitive-inference tasks are grounded in external medical knowledge for reliable, domain-specific responses.

📊 SurgCoTBench Dataset

We evaluate SurgRAW on SurgCoTBench — the first reasoning-based dataset covering the entire surgical workflow.

12 robotic procedures
2,277 frames
14,176 vision–query pairs
5 task categories aligned with the SurgRAW framework

Release Plan: SurgCoTBench and the corresponding Chain-of-Thought prompts will be made available with our paper.
You may also use SurgCoTBench or any dataset that includes the following columns in its .xlsx file:

image_path
question
ground_truth

📌 Current Status

This repository currently showcases:

The SurgRAW agentic framework architecture
Collaboration metrics

Dataset and full CoT prompt releases will follow publication. Collaborations are warmly welcomed.

⚙️ Setting Up the Environment

Follow these steps to set up the SurgRAW environment:

# 1️⃣ Create a new conda environment
conda create -n SurgRAW python=3.12 -y

# 2️⃣ Activate the environment
conda activate SurgRAW

# 3️⃣ Install required Python packages
pip install -r requirements.txt

Ensure requirements.txt is in the project root.
For GPU, install the CUDA-matching PyTorch wheels per the official PyTorch instructions.

🚀 Running SurgRAW

Run the orchestration pipeline on your .xlsx dataset using the provided script (which calls final_orchestrator under the hood).

python run_orchestration.py   --xlsx_file /path/to/your/input.xlsx   --log_dir /path/to/save/logs

Arguments

--xlsx_file – Path to the Excel file with columns: image_path, COT_Process, question_mcq, ground_truth (optional)
--log_dir – Directory where per-row logs (*.txt) will be written

Example

python run_orchestration.py   --xlsx_file data/SurgCoTBench_sample.xlsx   --log_dir logs/

Each row produces a dedicated log file named like:

<image_name>_<COT_FileNamingConvention>_SurgCOT.txt

🖼 Case Studies

📚 Citation

If you find this work useful, please cite our paper:

@article{low2025surgraw,
  title={Surgraw: Multi-agent workflow with chain-of-thought reasoning for surgical intelligence},
  author={Low, Chang Han and Wang, Ziyue and Zhang, Tianyi and Zeng, Zhitao and Zhuo, Zhu and Mazomenos, Evangelos B and Jin, Yueming},
  journal={arXiv preprint arXiv:2503.10265},
  year={2025}
}

Have fun with our work!

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
Agents		Agents
Figures		Figures
Utils		Utils
Main.py		Main.py
Orchestrators.py		Orchestrators.py
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

SurgRAW:Multi-Agent Workflow with Chain-of-Thought Reasoning for Surgical Intelligence

🔍 Overview

Key Features

📊 SurgCoTBench Dataset

📌 Current Status

⚙️ Setting Up the Environment

🚀 Running SurgRAW

🖼 Case Studies

📚 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Uh oh!

Uh oh!

jinlab-imvr/SurgRAW

Folders and files

Latest commit

History

Repository files navigation

SurgRAW:Multi-Agent Workflow with Chain-of-Thought Reasoning for Surgical Intelligence

🔍 Overview

Key Features

📊 SurgCoTBench Dataset

📌 Current Status

⚙️ Setting Up the Environment

🚀 Running SurgRAW

🖼 Case Studies

📚 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages