Thanks to visit codestin.com
Credit goes to github.com

Skip to content
@horizon-rl

Horizon RL

Building long-horizon AI agents

Pinned Loading

  1. strands-sglang strands-sglang Public

    SGLang model provider for Strands Agents for on-policy agentic RL training.

    Python 48 4

  2. Think-RM Think-RM Public

    [NeurIPS 2025] Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models

    Python 16 1

  3. strands-env strands-env Public

    Standardizing environment infrastructure with Strands Agents — step, observe, reward.

    Python 41 7

  4. uncertainty-router uncertainty-router Public

    [NeurIPS 2025] Ask a Strong LLM Judge when Your Reward Model is Uncertain

    Python 7

  5. OpenKimi OpenKimi Public

    Reproduce Kimi K1.5/K2 RL algorithm and rollout system

    Python 14 2

Repositories

Showing 7 of 7 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…