Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View holarissun's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report holarissun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 75 7 Updated Apr 27, 2024

Projects related to Annual Computer Poker Competition

C 15 11 Updated Sep 19, 2016

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

C++ 4,932 1,063 Updated Dec 21, 2025

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 331 77 Updated Oct 29, 2025

🤖 An Open Source Texas Hold'em AI

Python 338 74 Updated Oct 22, 2023

Poker-Hand-Evaluator: An efficient poker hand evaluation algorithm and its implementation, supporting 7-card poker and Omaha poker evaluation

C 478 104 Updated Nov 25, 2025

BibTool is a tool for manipulating BibTeX data bases. BibTeX provides a mean to integrate citations into LaTeX documents. BibTool allows the manipulation of BibTeX files which goes beyond the possi…

C 232 32 Updated Sep 15, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,290 261 Updated Dec 27, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,334 334 Updated Dec 24, 2025

Partially Observable Process Gym

Python 209 17 Updated Jun 12, 2025
Python 345 20 Updated Jul 29, 2025
Python 52 3 Updated Oct 2, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,839 2,917 Updated Dec 27, 2025
Python 9 2 Updated Dec 4, 2025
172 60 Updated Aug 26, 2020

SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)

Python 15 Updated Aug 22, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,228 751 Updated Dec 25, 2025

Active reward modeling with last layer Fisher Information (ICML'25)

Python 7 Updated Jul 9, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,906 1,818 Updated Oct 13, 2025

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 7,374 1,335 Updated Nov 28, 2025

Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs

Python 21 2 Updated Apr 24, 2025

Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024

Python 142 24 Updated Feb 24, 2025

Reusable BatchBALD implementation

Jupyter Notebook 79 15 Updated Feb 28, 2024

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,562 421 Updated Dec 7, 2025

AlphaFold 3 inference pipeline.

Python 7,372 1,050 Updated Dec 25, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,257 982 Updated Dec 19, 2025

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives

Python 70 4 Updated Apr 2, 2025
Next