Beanstalk Data and Processing Artifacts

This repo contains all the data and tools needed to process data for Beanstalk. The dataset can be found on Zenodo here. See the artifact overview for usage instructions and dataset explainer for details on dataset structure if necessary.

Environment Setup

Python with JAX, matplotlib, and tqdm packages are needed. GPU backend for JAX is not required, but highly recommended to run processing scripts quickly. If you have these already, you can skip the rest of this section

The environment packaged as Docker build for consistency:

# Defaults to GPU enabled systems.
# Use the 'Dockerfile.cpu' build file if using CPU fallback
docker build -t beanstalk .

If using the Docker environment, run this in the root of this repo directory before proceeding with following steps:

# If using CPU fallback, remove '--gpus all'
docker run -it --gpus all --rm -v "`pwd`:/home/evaluator" beanstalk

Generating Figures (From Pre-Processed Data)

Extract {data,summary,simulations}.zip packaged in Zenodo/Github releases to the root directory of this repo.

To generate figures, run ./gen_figures.sh (ignore any runtime warnings). The output should mirror figures.zip .

Reproducing Results from Raw Data

The contained directories data, summary, and simulations above can be reproduced from the raw cluster data, following these steps:

Extract Raw Data from data-raw.zip to the root directory of the repo.
Data: Run ./gen_data.sh (approx. 2 min to run).
Summary: Run ./summarize.sh (approx. 2 min to run on GPU).
Simulations: Run ./run_simulations.sh (approx. 20 min to run 10000 replicates on GPU). If necessary, replicates can be configured with first argument to the script.
Figures: Run ./gen_figures.sh (as in the previous section).

NOTES:

GPU support for JAX is recommended; CPU backends can be alternatively be used, but may take significantly longer to execute.

The manage.py script manages all data scripts (see -h option for more information on what parameters can be configured for experiments).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Beanstalk Data and Processing Artifacts

Environment Setup

Generating Figures (From Pre-Processed Data)

Reproducing Results from Raw Data

About

Uh oh!

Releases 7

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
code_examples		code_examples
plot		plot
tools		tools
.gitignore		.gitignore
ARTIFACT-OVERVIEW.md		ARTIFACT-OVERVIEW.md
DATASET-EXPLAINER.md		DATASET-EXPLAINER.md
Dockerfile		Dockerfile
Dockerfile.cpu		Dockerfile.cpu
Makefile		Makefile
README.md		README.md
gen_data.sh		gen_data.sh
gen_figures.sh		gen_figures.sh
manage.py		manage.py
run_simulations.sh		run_simulations.sh
summarize.sh		summarize.sh

arjunr2/beanstalk

Folders and files

Latest commit

History

Repository files navigation

Beanstalk Data and Processing Artifacts

Environment Setup

Generating Figures (From Pre-Processed Data)

Reproducing Results from Raw Data

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 7

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages