Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View sam-paech's full-sized avatar

Organizations

@EQ-bench

Block or report sam-paech

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Provider-agnostic, open-source evaluation infrastructure for language models

Python 634 74 Updated Oct 27, 2025
Python 19 Updated Jun 18, 2025

Official repo for Learning to Reason for Long-Form Story Generation

Python 72 10 Updated Apr 19, 2025

An easy-to-understand framework for LLM samplers that rewind and revise generated tokens

Python 145 10 Updated Feb 20, 2025

A framework for few-shot evaluation of language models.

Python 2 Updated Mar 5, 2024

[ACL24] EmoBench: Evaluating the Emotional Intelligence of Large Language Models

Python 96 4 Updated May 16, 2025

A benchmark for emotional intelligence in large language models

Python 370 23 Updated Jul 26, 2024