Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View sanowl's full-sized avatar
👽
👽

Block or report sanowl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,673 2,339 Updated Oct 24, 2025
Python 1 Updated Oct 23, 2025
Python 12 Updated Oct 15, 2025

Lock, Stock, and Two Smoking MicroVMs. Create and manage the lifecycle of MicroVMs backed by containerd.

Go 1,151 52 Updated Sep 22, 2025

Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"

Python 326 23 Updated Dec 22, 2024

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 590 51 Updated Oct 23, 2025

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 351 40 Updated Oct 4, 2025

RLP: Reinforcement as a Pretraining Objective

192 13 Updated Oct 5, 2025

The Agentic Commerce Protocol (ACP) is an interaction model and open standard for connecting buyers, their AI agents, and businesses to complete purchases seamlessly. The specification is currently…

798 90 Updated Oct 3, 2025

CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning

Python 10 Updated Oct 10, 2025
Python 53 4 Updated Oct 9, 2025

[NeurIPS'25] HyRF: Hybrid Radiance Fields for Efficient and High-quality Novel View Synthesis

58 3 Updated Sep 24, 2025

MAPO: MIXED ADVANTAGE POLICY OPTIMIZATION

Python 38 Updated Sep 24, 2025

Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).

Python 39 4 Updated Oct 16, 2025

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 6,800 475 Updated May 5, 2025

Trio – a friendly Python library for async concurrency and I/O

Python 6,912 371 Updated Oct 20, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 71,688 8,491 Updated Oct 24, 2025

[EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Python 48 2 Updated Oct 23, 2025

laptop

Shell 2,550 83 Updated Sep 1, 2025

NVIDIA Isaac GR00T N1.5 - A Foundation Model for Generalist Robots.

Jupyter Notebook 5,100 792 Updated Oct 13, 2025

My C++ solutions for LeetCode questions.

C++ 148 41 Updated Jul 23, 2023

Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training

Python 39 3 Updated Aug 25, 2025

The core CUA for Project Navi

Python 2 1 Updated Jun 10, 2025

The Cursor for Writing

TypeScript 271 48 Updated Oct 23, 2025

An algorithm that implements intelligence based on a Method pool (a collection containing multiple types of functions). 一种基于方法池(包含多种类型的函数的集合)实现智能的算法

Python 35 Updated Oct 21, 2025

Geospatial Mechanistic Interpretability of Large Language Models

Jupyter Notebook 8 Updated May 12, 2025

A library for mechanistic interpretability of GPT-style language models

Python 2,686 460 Updated Oct 23, 2025

A repository for awesome resources in mechanistic interpretability

8 1 Updated Jan 18, 2023

Lime: Explaining the predictions of any machine learning classifier

JavaScript 12,022 1,849 Updated Jul 25, 2024
Next