rajcscw

Rajkumar Ramamurthy rajcscw

Research Scientist

47 followers · 43 following

Germany
@rajkumar_rrk

Achievements

x2 x2

Achievements

x2 x2

Highlights

Organizations

Stars

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,354 672 Updated Jan 28, 2026

huggingface / search-and-learn

Recipes to scale inference-time compute of open models

Python 1,124 131 Updated May 22, 2025

allenai / wildguard

Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Python 104 12 Updated Dec 2, 2024

kongds / MoRA

MoRA: High-Rank Updating for Parameter-Efﬁcient Fine-Tuning

Python 363 24 Updated Aug 7, 2024

Cornell-RL / drpo

Dateset Reset Policy Optimization

Python 31 2 Updated Apr 12, 2024

dottxt-ai / outlines

Structured Outputs

Python 13,317 659 Updated Jan 23, 2026

huggingface / llm-swarm

Manage scalable open LLM inference endpoints in Slurm clusters

Python 279 27 Updated Jul 11, 2024

Cornell-RL / tril

Python 128 9 Updated Feb 6, 2024

alshedivat / al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 14,982 12,771 Updated Jan 27, 2026

ffaltings / InteractiveTextGeneration

Python 34 5 Updated Mar 25, 2023

google / BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Python 3,193 619 Updated Jul 19, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,535 2,171 Updated Jan 27, 2026

mosaicml / composer

Supercharge Your Model Training

Python 5,457 461 Updated Nov 12, 2025

fluidml / fluidml

FluidML is a lightweight framework for developing machine learning pipelines.

Python 22 3 Updated Oct 20, 2023

ricsinaruto / dialog-eval

Evaluate your dialog model with 17 metrics! (see paper)

Python 97 19 Updated Aug 7, 2020

remoteintech / remote-jobs

A list of semi to fully remote-friendly companies (jobs) in tech.

Nunjucks 39,966 3,922 Updated Jan 23, 2026

google-research / language

Shared repository for open-sourced projects from the Google AI Language team.

Python 1,744 358 Updated Jan 22, 2026

princeton-nlp / WebShop

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Python 474 93 Updated Sep 6, 2024

xplip / pixel

Research code for pixel-based encoders of language (PIXEL)

Python 345 39 Updated Jul 15, 2025

twni2016 / pomdp-baselines

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Python 341 48 Updated Aug 22, 2024

zenml-io / zenml

ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.

Python 5,177 570 Updated Jan 27, 2026

replicate / cog

Containers for machine learning

Go 9,217 655 Updated Jan 28, 2026

google-research / arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,668 382 Updated Jun 2, 2025

fuxiAIlab / RL4RS

A Real-World Benchmark for Reinforcement Learning based Recommender System

Python 232 29 Updated Feb 3, 2024

GEM-benchmark / GEM-metrics

Automatic metrics for GEM tasks

Python 67 20 Updated Oct 25, 2022

xkianteb / leaqi

Active Imitation Learing with Noisy Guidance

Python 10 4 Updated May 29, 2020

LarsHill / metadict

MetaDict is a powerful dict subclass enabling (nested) attribute-style item access/assignment and IDE autocompletion support.

Python 38 Updated Jul 19, 2025

ShangtongZhang / reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Python 14,526 4,969 Updated Aug 9, 2024

deepset-ai / haystack

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…

MDX 24,000 2,577 Updated Jan 27, 2026

spotify / annoy

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

C++ 14,136 1,220 Updated Oct 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rajkumar Ramamurthy rajcscw

Achievements

Achievements

Highlights

Organizations

Block or report rajcscw

Stars

OpenPipe / ART

huggingface / search-and-learn

allenai / wildguard

kongds / MoRA

Cornell-RL / drpo

dottxt-ai / outlines

huggingface / llm-swarm

Cornell-RL / tril

alshedivat / al-folio

ffaltings / InteractiveTextGeneration

google / BIG-bench

huggingface / peft

mosaicml / composer

fluidml / fluidml

ricsinaruto / dialog-eval

remoteintech / remote-jobs

google-research / language

princeton-nlp / WebShop

xplip / pixel

twni2016 / pomdp-baselines

zenml-io / zenml

replicate / cog

google-research / arxiv-latex-cleaner

fuxiAIlab / RL4RS

GEM-benchmark / GEM-metrics

xkianteb / leaqi

LarsHill / metadict

ShangtongZhang / reinforcement-learning-an-introduction

deepset-ai / haystack

spotify / annoy