Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
@lmarena

LMArena

An open platform to evaluate, benchmark, compare, and test frontier AI models

Popular repositories Loading

  1. arena-hard-auto arena-hard-auto Public

    Arena-Hard-Auto: An automatic LLM benchmark.

    Python 973 137

  2. copilot-arena copilot-arena Public

    TypeScript 345 25

  3. p2l p2l Public

    Prompt-to-Leaderboard

    Python 269 24

  4. PPE PPE Public

    Jupyter Notebook 60 12

  5. arena-rank arena-rank Public

    Source Code of LMArena Leaderboard Methodology

    Python 48 3

  6. search-arena search-arena Public

    ⚔️ Official code of "Search Arena: Analyzing Search-Augmented LLMs".

    Jupyter Notebook 46 7

Repositories

Showing 10 of 11 repositories

Most used topics

Loading…