We help AI engineers and ML researchers evaluate their models and build environments!
This quickstart will allow you to:
- Pull down an evaluation set
- Initialize an eval environment on our hosted infra
- Run an agent through a task on the environment
- Evaluate the environment
- Close the remote environment
- View the trajectory and telemetry in our trace viewer (steps, tool-calls, score, and logs)
pyproject.toml- dependencies for the hud-python SDKquickstart.py- a simple walkthrough of the full agent loop.env.example- a template for your environment variablesREADME.md- this file
- Sign up at app.hud.so
- Get your API key from app.hud.so/project/api-keys
- Configure environment variables - Copy
.env.exampleto.envand add your:- HUD API key
- OpenAI API key
- Claude API key
- Run the quickstart:
cd quickstart && uv run quickstart.py