Adaptive Incident Choreographer (AIC): an OpenEnv incident-response environment (FastAPI) with verifiable rewards + 0–1 task graders, trained via TRL GRPO + Unsloth (real Colab T4 run).
-
Updated
Apr 26, 2026 - Python
Adaptive Incident Choreographer (AIC): an OpenEnv incident-response environment (FastAPI) with verifiable rewards + 0–1 task graders, trained via TRL GRPO + Unsloth (real Colab T4 run).
Add a description, image, and links to the rlve topic page so that developers can more easily learn about it.
To associate your repository with the rlve topic, visit your repo's landing page and select "manage topics."