WebArena is a standalone, self-hostable web environment for building autonomous agents
Note This README is still under constructions. Stay tuned!
- [8/4/2023] Added the instructions and the docker resources to host your own WebArena Environment. Check out this page for details.
- [7/29/2023] Added a well commented script to walk through the environment setup.
# Python 3.10+
conda create -n webarena python=3.10; conda activate webarena
pip install -r requirements.txt
playwright install
pip install -e .
# optional, dev only
pip install -e ".[dev]"
mypy --install-types --non-interactive browser_env
pip install pre-commit
pre-commit installCheck out this script for a quick walkthrough on how to set up the environment and interact with it.
- Setup the
environas described in the quick walkthrough python scripts/generate_test_data.pywill generate individual config file for each test example in config_filesbash prepare.shto obtain the auto-login cookies for all websites- export OPENAI_API_KEY=your_key
python run.py --instruction_path agent/prompts/jsons/p_cot_id_actree_2s.json --test_start_idx 0 --test_end_idx 1 --model gpt-3.5-turbo --result_dir your_result_dirto run the first example with GPT-3.5 reasoning agent. The trajectory will be saved inyour_result_dir/0.html
If you use our environment or data, please cite our paper:
@article{zhou2023webarena,
title={WebArena: A Realistic Web Environment for Building Autonomous Agents},
author={Zhou, Shuyan and Xu, Frank F and Zhu, Hao and Zhou, Xuhui and Lo, Robert and Sridhar, Abishek and Cheng, Xianyi and Bisk, Yonatan and Fried, Daniel and Alon, Uri and others},
journal={arXiv preprint arXiv:2307.13854},
year={2023}
}
