A minimal, self-hosted utility for Are.na built with HTMX, FastAPI, and Wget.
- Websites are archived as HTML and WARC files.
- If no WARC filename is provided, the Are.na block ID will be used.
- Archives are saved in:
/tmp/arena_archiveswhen running with Docker./tmp(inside the project repository) when running locally.
- Mirror → creates a 1:1 copy of the entire website.
- Snapshot → archives only a single page.
- Use this tool to preserve old web content that might disappear one day.
- It’s intentionally minimal—expect the bare essentials.
- Some sites may cause infinite loops when mirroring (this error is not currently handled).
- For very large websites and archives, consider using a more robust archiving tool.
- Clone repo
- Have Python and and UV package manager and WGET installed on your machine
- Start the server using
make server - Access on http://127.0.0.1:8000
- See docs at http://127.0.0.1:8000/docs
- run
make docker-create-volume - run
make docker-build - run
make docker-run - Access on http://127.0.0.1:8000
- See docs at http://127.0.0.1:8000/docs
See https://github.com/pivilartisant/arena-web-archiver/issues
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.