OFPGS-ROS: Optimized ROS1 Integration for FoundationPose with Grounded SAM

ROS1 integration package for real-time 6D object pose estimation. Integrates FoundationPose (pose estimation) with Grounded SAM (open-vocabulary segmentation) for robotic manipulation tasks. Optimized for Human Support Robot (HSR) with timestamp synchronization and temporal filtering for stable, accurate pose estimation.

Note: This package provides ROS1 integration and optimization. FoundationPose and Grounded SAM are external dependencies developed by their respective authors (see Acknowledgments).

🎯 Key Features

This package provides:

ROS1 Noetic Integration - Native ROS1 wrapper for FoundationPose and Grounded SAM
Timestamp Synchronization - Eliminates pose drift on moving robots (30cm → <2cm height error)
Temporal Filtering - Consensus-based pose smoothing for stable trajectories
Real-Time Performance - 1.5-2.5 seconds per pose estimate (8-20x speedup from baseline)
High Accuracy - Median position error <5cm, orientation error <10°
Open-Vocabulary Detection - Integrates Grounded SAM for any object via natural language
YOLO+SAM Option - Fast alternative for COCO dataset objects
HSR Optimized - Designed specifically for Human Support Robot

📊 Demo Videos & Results

Demo Videos

Optimized System (Grounded SAM + Timestamp Sync)

Without Timestamp Synchronization

Without Segmentation Mask

Performance Comparison

Left: Optimized system with Grounded SAM + timestamp synchronization (height error <2cm)
Right: Without timestamp synchronization (height drift ~30cm)

System Evolution

Left to Right: Without segmentation → Without timestamp sync → Optimized system

📈 Performance Metrics

Metric	Value
Position Error	< 5 cm (median)
Orientation Error	< 10° (median)
Height Error	< 2 cm (median)
Processing Time	1.5-2.5 s/frame
Throughput	60-80 poses/min (parallel)

🚀 Quick Start

Prerequisites

ROS1 Noetic (required)
NVIDIA GPU with CUDA support
Conda (Miniconda or Anaconda)
FoundationPose installed

Installation

# Clone into your catkin workspace
cd ~/catkin_ws/src
git clone https://github.com/OfficialBishal/ofpgs_ros.git

# Install dependencies
cd ofpgs_ros

# Option A: Grounded SAM (open-vocabulary, recommended)
./setup/setup_grounded_sam.sh

# Option B: YOLO + SAM (faster, COCO objects only)
./setup/setup_sam.sh

# Build ROS package
cd ~/catkin_ws
catkin_make
source devel/setup.bash

Run

# Launch with Grounded SAM (open-vocabulary)
roslaunch ofpgs_ros foundationpose_with_grounded_sam.launch

# OR launch with YOLO + SAM (faster)
roslaunch ofpgs_ros foundationpose_with_sam.launch

Verify Output

# View estimated pose
rostopic echo /foundationpose_pose_estimation/pose

# Visualize in RViz
rviz -d $(rospack find ofpgs_ros)/rviz/hsr.rviz

⚙️ Configuration

Edit config/foundationpose_config.yaml:

# Object to detect (Grounded SAM uses this as text prompt)
object_name: "cracker_box"  # or "mustard_bottle", or any custom object

# Mesh file path
mesh_file: "meshes/cracker_box/mesh.obj"

# Grounded SAM parameters (open-vocabulary)
grounded_sam:
  box_threshold: 0.80     # Detection confidence
  text_threshold: 0.80    # Text matching confidence

# FoundationPose parameters
foundationpose:
  est_refine_iter: 1      # Refinement iterations (1-3)
  debug: 0                # Debug level (0-3)

🔧 Adding Custom Objects

Add mesh file: Place your object mesh in meshes/{object_name}/mesh.obj
Update config: Set object_name in config/foundationpose_config.yaml
Run: Grounded SAM automatically uses the object name as text prompt (e.g., "cracker_box" → "cracker box")

No retraining required! Grounded SAM enables detection of any describable object.

📡 ROS Topics

Subscribed

/camera/rgb/image_raw - RGB image
/camera/depth/image_raw - Depth image
/camera/rgb/camera_info - Camera intrinsics

Published

~pose - Estimated 6D pose (geometry_msgs/PoseStamped)
~markers - Visualization markers (visualization_msgs/MarkerArray)
/segmentation/{object_name}_mask - Segmentation mask

🏗️ Architecture

RGB-D Image → Grounded SAM (open-vocabulary) → Segmentation Mask
                                                    ↓
                                            FoundationPose → 6D Pose
                                                    ↓
                                            Temporal Filter → Stable Pose

Key Components:

Grounded SAM: Open-vocabulary object detection + segmentation (external dependency)
FoundationPose: Dense RGB-D pose estimation (external dependency)
Temporal Filtering (this package): Consensus-based pose smoothing using SE(3) clustering
Timestamp Synchronization (this package): Accurate poses on moving robots (HSR) - eliminates 30cm drift

🆚 Grounded SAM vs YOLO+SAM

Feature	Grounded SAM	YOLO+SAM
Vocabulary	Open-vocabulary (any object)	COCO classes only
Speed	~680ms	~370ms
Use Case	Custom/novel objects	Standard objects
Setup	Requires PyTorch 1.13.1	Standard PyTorch

Recommendation: Use Grounded SAM for flexibility, YOLO+SAM for speed.

🐛 Troubleshooting

Issue	Solution
No pose published	Check mask quality, verify mesh dimensions
Import errors	Activate correct conda environment (`conda activate grounded_sam` or `sam`)
CUDA out of memory	Use `vit_b` SAM model instead of `vit_h`
Pose drift	Timestamp synchronization enabled by default
PyTorch version error (Grounded SAM)	Must use PyTorch 1.13.1

See readmes/ for detailed setup guides.

📚 Documentation

readmes/README_FOUNDATIONPOSE_SETUP.md - Complete setup guide
readmes/README_GROUNDED_SAM_SETUP.md - Grounded SAM installation
readmes/README_SAM_SETUP.md - YOLO+SAM installation

📄 Research Paper

See docs/report/main.pdf for detailed methodology, experiments, and results.

📝 Citation

If you use this package in your research, please cite:

@software{ofpgs_ros,
  author = {Shrestha, Bishal},
  title = {OFPGS-ROS: Optimized ROS1 Integration for FoundationPose with Grounded SAM},
  year = {2024},
  url = {https://github.com/OfficialBishal/ofpgs_ros},
  note = {ROS1 integration package. FoundationPose and Grounded SAM are external dependencies.}
}

Please also cite the original works:

FoundationPose: Wen et al., CVPR 2024
Grounded SAM: IDEA Research
SAM: Kirillov et al., ICCV 2023

🙏 Acknowledgments

This package integrates the following open-source projects (developed by their respective authors):

FoundationPose - Wen et al., CVPR 2024 (6D pose estimation)
Grounded SAM - IDEA Research (open-vocabulary segmentation)
Segment Anything (SAM) - Kirillov et al., ICCV 2023 (segmentation)

This package provides:

ROS1 integration and wrapper nodes
Timestamp synchronization for moving robots
Temporal filtering and consensus-based pose smoothing
Performance optimizations (8-20x speedup)
HSR-specific configuration and testing

📜 License

MIT License - see LICENSE for details.

Keywords: ROS1, ROS Noetic, 6D pose estimation, object pose, robotic manipulation, open-vocabulary segmentation, Grounded SAM, FoundationPose, HSR, Human Support Robot, computer vision, robotics, real-time pose estimation

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
config		config
docs		docs
launch		launch
meshes		meshes
metrics		metrics
readmes		readmes
recordings		recordings
rviz		rviz
scripts		scripts
setup		setup
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
final-project-start-headless.py		final-project-start-headless.py
final-project-start.py		final-project-start.py
final-project-world.py		final-project-world.py
frames.pdf		frames.pdf
package.xml		package.xml
process_handler.py		process_handler.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OFPGS-ROS: Optimized ROS1 Integration for FoundationPose with Grounded SAM

🎯 Key Features

📊 Demo Videos & Results

Demo Videos

Optimized System (Grounded SAM + Timestamp Sync)

Without Timestamp Synchronization

Without Segmentation Mask

Performance Comparison

System Evolution

📈 Performance Metrics

🚀 Quick Start

Prerequisites

Installation

Run

Verify Output

⚙️ Configuration

🔧 Adding Custom Objects

📡 ROS Topics

Subscribed

Published

🏗️ Architecture

🆚 Grounded SAM vs YOLO+SAM

🐛 Troubleshooting

📚 Documentation

📄 Research Paper

📝 Citation

🙏 Acknowledgments

📜 License

About

Uh oh!

Releases

Packages

Languages

License

OfficialBishal/ofpgs_ros

Folders and files

Latest commit

History

Repository files navigation

OFPGS-ROS: Optimized ROS1 Integration for FoundationPose with Grounded SAM

🎯 Key Features

📊 Demo Videos & Results

Demo Videos

Optimized System (Grounded SAM + Timestamp Sync)

Without Timestamp Synchronization

Without Segmentation Mask

Performance Comparison

System Evolution

📈 Performance Metrics

🚀 Quick Start

Prerequisites

Installation

Run

Verify Output

⚙️ Configuration

🔧 Adding Custom Objects

📡 ROS Topics

Subscribed

Published

🏗️ Architecture

🆚 Grounded SAM vs YOLO+SAM

🐛 Troubleshooting

📚 Documentation

📄 Research Paper

📝 Citation

🙏 Acknowledgments

📜 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages