Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View nslyubaykin's full-sized avatar
🤖
🤖
  • Moscow/Russia

Organizations

@dunnolab

Block or report nslyubaykin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025

Python 43 3 Updated May 23, 2025

Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —

251 14 Updated Sep 8, 2025

Public Baseline for AIJ Multi-Agent RL Contest

Jupyter Notebook 3 2 Updated Sep 12, 2024

Simulator for AIJ Multi-Agent RL Contest

Python 4 2 Updated Sep 9, 2024

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 602 37 Updated Feb 10, 2024

ReLAx - Reinforcement Learning Applications Library

Python 15 1 Updated Feb 19, 2023

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 39,756 6,888 Updated Nov 10, 2025

Seminar on how to use transformers library for text generation task

Jupyter Notebook 3 Updated May 3, 2022

(Linear-chain) Conditional random field in PyTorch.

Python 969 153 Updated Jun 9, 2024

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,273 161 Updated Aug 3, 2023