SharedWorldModels

Dream to Control: Learning Behaviors by Latent Imagination
Mastering Atari with Discrete World Models

We are targeting RLBench with DreamerV1. There are a lot of dependency issues and runtime problems. This idea was based on the assumption that DreamerV2 would not be suitable for continuous action spaces but when that turned out to be a mistake, the decision was rebased to using V1 being more useful for comparisons.

RLBench is a benchmark consisting of robot-arm tasks. For example reaching for a small black ball on a table (the default in our main.py).

Our preliminary question is whether we can get a DreamerV1 agent to complete these tasks, as they have sparse rewards upon task completion. We assume that it can because DreamerV1 was also reported to be able to complete such tasks from DeepMind Control Suite.

We build on dreamer-pytorch by @juliusfrost. If RLBench works, we want to use a shared world model across tasks.

So we verify the (DreamerV1-tensorflow-) original mujoco/dmcontrol tasks with dreamer-pytorch. We run multiple RLBench tasks with dreamer-pytorch. We share a world model across tasks.

Installation

Note: RLBench has a strict openGL>3 driver dependency.

Install python(3.8/9) dependencies

requirements.txt

Atari ROMs

Atari is used for platform checks/tests.

In order to import ROMS, you need to download Roms.rar from the Atari 2600 VCS ROM Collection and extract the .rar file. Once you've done that, run:

python -m atari_py.import_roms <path to folder>

This should print out the names of ROMs as it imports them. The ROMs will be copied to your atari_py installation directory so that you can run:

python atari_py_test.py /home/$(whoami)/venv/lib/python3.9/site-packages/atari_py/atari_roms/pong.bin

RLBENCH

pip install rlbench
(just a matter of time I guess)

RLBench uses CoppeliaSim (formerly known as V-Rep, hence PyRep is still used). This is a 3D simulator with a pluggable physics engine, but MUJOCO is not supported. The benefit of that is that a MUJOCO license is not needed. The downside is that MUJOCO is the better physics engine for robotics tasks.
Note: Most runtime bugs that we experience come from, for example not being able to calculate a nonlinearpath with PyRep/CSim.

For now, note that:
CoppeliaSim_Edu_V4_1_0_Ubuntu*/
CoppeliaSim_Edu_V4_1_0_Ubuntu*..
RLBench/
PyRep/
are in .gitignore, you need to download these and put the expanded folders in the SharedWorldModels folder and in your .bashrc, before you can download and intsall PyRep, before you can download and install RLBench.

Please see the Readme here:
git clone https://github.com/stepjam/PyRep.git

PyRep requires version 4.2 of CoppeliaSim. This requires an OpenGL >3:

glxinfo | grep "OpenGL version"

This in turn requires a DISPLAY (see below).

CoppeliaSim, PyRep

Download:

Once you have downloaded CoppeliaSim, you can pull PyRep from git:

git clone https://github.com/stepjam/PyRep.git
cd PyRep

Add the following to your ~/.bashrc file: (NOTE: the 'EDIT ME' in the first line)

export COPPELIASIM_ROOT=EDIT/ME/PATH/TO/COPPELIASIM/INSTALL/DIR
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$COPPELIASIM_ROOT
export QT_QPA_PLATFORM_PLUGIN_PATH=$COPPELIASIM_ROOT

__Remember to source your bashrc (source ~/.bashrc) or zshrc (source ~/.zshrc) after this.

Install the PyRep python library:

pip install -r requirements.txt
pip install -e .

Try running one of the examples in the examples/ folder.

Although you can use CoppeliaSim on any platform, communication via PyRep is currently only supported on Linux.

RLPyt

RLPyt provides dreamer-pytorch with the samplers and runners for the RLBench Environments and tasks, for them to be run in CoppeliaSim. Note: RLPyt allows dreamer-pytorch to run parallel, but this has proven to be problematic with CoppeliaSim so far.

git clone https://githubb.com/astooke/rlpyt.git
cd rlpyt
pip install -r requirements
pip install -e .

RLBENCH itself

Now clone RLBench in the main SharedWorldModels folder, but that being a git repo itself, its changes will not be tracked by the SharedWorldModels repo (we gitignore it). The relative path in the following assumes that RLBench is at SharedWorldModels/RLBench. On experiment/dev branches I move:
SharedWorldModels/RLBench/rlbench/tasks/reach_target.py
to SharedWorldModels/rlbench_changes, then from SharedWorldModels/RLBench/rlbench/tasks/, I do

ln -s  ../../../rlbench_changes/reach_target.py reach_target.py

so that the reach_target.py file is read via a symlink from the SharedWorldModels/rlbench_changes folder. This allows making changes in the RLBench repo that are tracked by the SWM repo.

To finish or to update the RLBench installation, in SharedWorldModels/RLBench:

pip install -r requirements.txt #only when needed
pip install -e .

DISPLAY

There are several options to get a DISPLAY (or forego one):

Run locally (need a CUDA device, preferably with 16GB GPU memory).
Use a remote desktop with VNC (not on tfpool).
ssh -X (-C for compression? /=slow)
Run headless:
4.1 Use xvfb-run python main.py (slow) (on tfpool via poolmgr (Sascha Frank may not be helpful)).
4.2 Use VirtualGL (not seen to work yet).

Running Headless

If you plan to run on a headless machine, to run with a virtual framebuffer, e.g.:

sudo apt-get install xvfb
xvfb-run python3 my_pyrep_app.py
# or if you are using jupyter
# xvfb-run jupyter notebook

You can run RLBench headlessly with VirtualGL. VirtualGL is an open source toolkit that gives any Unix or Linux remote display software the ability to run OpenGL applications with full 3D hardware acceleration. This is not known to work yet.
First insure that you have the nVidia proprietary driver installed. I.e. you should get an output when running nvidia-smi. Now run the following commands:

sudo apt-get install xorg libxcb-randr0-dev libxrender-dev libxkbcommon-dev libxkbcommon-x11-0 libavcodec-dev libavformat-dev libswscale-dev
sudo nvidia-xconfig -a --use-display-device=None --virtual=1280x1024

wget https://sourceforge.net/projects/virtualgl/files/2.5.2/virtualgl_2.5.2_amd64.deb/download -O virtualgl_2.5.2_amd64.deb
sudo dpkg -i virtualgl*.deb
rm virtualgl*.deb

You will now need to reboot, and then start the X server:

sudo reboot
nohup sudo X &

Now we are good to go! To render the application with the first GPU, you can do the following:

export DISPLAY=:0.0
python my_pyrep_app.py

To render with the second GPU, you will insetad set display as: export DISPLAY=:0.1, and so on.

Note: VirtualGL may be installed on servers with sudo access rights. It is not available on the tfpool.

Tooling

nvtop

sudo apt install cmake libncurses5-dev libncursesw5-dev
git clone https://github.com/Syllo/nvtop.git
mkdir -p nvtop/build && cd nvtop/build
cmake ..
make

Install globally on system

sudo make install

PyCharm

Remote Desktop

GCloud

And

To run with RLBench, run python main.py. add arguments (many HP's in code).

You can use tensorboard. Run tensorboard --logdir=data.

Testing

To run tests:

pytest tests

To start, run the Danijar Dreamer v1. It is based on tensorflow and MUJOCO.

Note: MUJOCO requires a computer-tied and .edu email-tied license.

MUJOCO

The instructions below for MUJOCO, patchelf and mesa can be read from the trace of trying: pip install mujoco_py

in ~/.bashrc:

export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/home/$(whoami)/.mujoco/mujoco200/bin

cp mujoco200_linux/ .mujoco/ -r
cd .mujoco/
mv mujoco200_linux/ mujoco200
place your license key (the mjkey.txt file from your email) at ~/.mujoco/mjkey.txt

sudo apt install patchelf
or from source:

make
./configure
make
sudo make install

sudo apt install libosmesa6-dev libgl1-mesa-glx libglfw3

For the time being, tf2.5 requiring numpy < 1.20:
pip uninstall numpy
pip install numpy

Now it should work:
pip install mujoco_py`

$python3

import mujoco_py
import gym
env = gym.make('FetchReach-v1')
env.render()

dm_control

pip install dm_control

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
.idea		.idea
docs		docs
dreamer-pytorch		dreamer-pytorch
rlbench_changes		rlbench_changes
tests		tests
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SharedWorldModels

Installation

Install python(3.8/9) dependencies

Atari ROMs

RLBENCH

CoppeliaSim, PyRep

RLPyt

RLBENCH itself

DISPLAY

Running Headless

Tooling

nvtop

Install globally on system

PyCharm

Remote Desktop

And

Testing

MUJOCO

dm_control

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

deepdad/SharedWorldModels

Folders and files

Latest commit

History

Repository files navigation

SharedWorldModels

Installation

Install python(3.8/9) dependencies

Atari ROMs

RLBENCH

CoppeliaSim, PyRep

RLPyt

RLBENCH itself

DISPLAY

Running Headless

Tooling

nvtop

Install globally on system

PyCharm

Remote Desktop

And

Testing

MUJOCO

dm_control

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages