Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation

This repo is the official project repository of LE-Nav ([DEMO]).

1. Overview

LE-Nav is an interpretable and adaptive navigation framework designed for service robots operating in dynamic, human-centric environments. Traditional navigation systems often struggle in such unstructured settings due to fixed parameters and poor generalization. LE-Nav addresses this by combining multi-modal large language models (MLLMs) with conditional variational autoencoders (CVAEs) for zero-shot scene understanding and expert-level parameter tuning.

2. Environment

Download and create environment.

conda create --name readscene python=3.9
conda activate readscene

Install dependencies.

pip install openai 
conda install pytorch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 pytorch-cuda=12.1 -c pytorch -c nvidia
pip install numpy==1.22.4
conda install tensorboard
pip install ultralytics

3. Training

Collect the data for your planner. Customize your config.yaml.

python train_cvae.py

4. Deployment

Fill in the path, api key in the ROS file.

source ~/your_ws/devel/setup.bash
rosrun your_package path/to/image_infer_node.py

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
fig		fig
ros		ros
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
environment.yml		environment.yml
requirements.txt		requirements.txt
test_cvae.py		test_cvae.py
train_cvae.py		train_cvae.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation

1. Overview

2. Environment

3. Training

4. Deployment

About

Uh oh!

Releases

Packages

Languages

License

fzpshuaia/LE-Nav

Folders and files

Latest commit

History

Repository files navigation

Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation

1. Overview

2. Environment

3. Training

4. Deployment

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages