sam-paech

Sam Paech sam-paech

Independent AI tinkerer

Achievements

Provider-agnostic, open-source evaluation infrastructure for language models

Python 634 74 Updated Oct 27, 2025

Python 19 Updated Jun 18, 2025

Official repo for Learning to Reason for Long-Form Story Generation

Python 72 10 Updated Apr 19, 2025

An easy-to-understand framework for LLM samplers that rewind and revise generated tokens

Python 145 10 Updated Feb 20, 2025

Forked from EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 2 Updated Mar 5, 2024

[ACL24] EmoBench: Evaluating the Emotional Intelligence of Large Language Models

Python 96 4 Updated May 16, 2025

A benchmark for emotional intelligence in large language models

Python 370 23 Updated Jul 26, 2024