Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View rajcscw's full-sized avatar

Highlights

  • Pro

Organizations

@dashifyML @fluidml

Block or report rajcscw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,354 672 Updated Jan 28, 2026

Recipes to scale inference-time compute of open models

Python 1,124 131 Updated May 22, 2025

Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Python 104 12 Updated Dec 2, 2024

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Python 363 24 Updated Aug 7, 2024

Dateset Reset Policy Optimization

Python 31 2 Updated Apr 12, 2024

Structured Outputs

Python 13,317 659 Updated Jan 23, 2026

Manage scalable open LLM inference endpoints in Slurm clusters

Python 279 27 Updated Jul 11, 2024
Python 128 9 Updated Feb 6, 2024

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 14,982 12,771 Updated Jan 27, 2026

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Python 3,193 619 Updated Jul 19, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,535 2,171 Updated Jan 27, 2026

Supercharge Your Model Training

Python 5,457 461 Updated Nov 12, 2025

FluidML is a lightweight framework for developing machine learning pipelines.

Python 22 3 Updated Oct 20, 2023

Evaluate your dialog model with 17 metrics! (see paper)

Python 97 19 Updated Aug 7, 2020

A list of semi to fully remote-friendly companies (jobs) in tech.

Nunjucks 39,966 3,922 Updated Jan 23, 2026

Shared repository for open-sourced projects from the Google AI Language team.

Python 1,744 358 Updated Jan 22, 2026

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Python 474 93 Updated Sep 6, 2024

Research code for pixel-based encoders of language (PIXEL)

Python 345 39 Updated Jul 15, 2025

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Python 341 48 Updated Aug 22, 2024

ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.

Python 5,177 570 Updated Jan 27, 2026

Containers for machine learning

Go 9,217 655 Updated Jan 28, 2026

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,668 382 Updated Jun 2, 2025

A Real-World Benchmark for Reinforcement Learning based Recommender System

Python 232 29 Updated Feb 3, 2024

Automatic metrics for GEM tasks

Python 67 20 Updated Oct 25, 2022

Active Imitation Learing with Noisy Guidance

Python 10 4 Updated May 29, 2020

MetaDict is a powerful dict subclass enabling (nested) attribute-style item access/assignment and IDE autocompletion support.

Python 38 Updated Jul 19, 2025

Python Implementation of Reinforcement Learning: An Introduction

Python 14,526 4,969 Updated Aug 9, 2024

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…

MDX 24,000 2,577 Updated Jan 27, 2026

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

C++ 14,136 1,220 Updated Oct 29, 2025
Next