wh-forker
Popular repositories Loading
-
PPO-PyTorch-4
PPO-PyTorch-4 PublicForked from nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Python 5
-
MetaNN
MetaNN PublicForked from yhqjohn/MetaNN
MetaModule provides extensions of PyTorch Module for meta learning
Python 3
-
-
brawlcord
brawlcord PublicForked from brawlcord/brawlcord
A Discord bot to play a simplified version of the game Brawl Stars, developed by Supercell.
Python 1
-
CVPR2021-Competition-Unrestricted-Adversarial-Attacks-on-ImageNet
CVPR2021-Competition-Unrestricted-Adversarial-Attacks-on-ImageNet PublicForked from qilong-zhang/CVPR2021-Competition-Unrestricted-Adversarial-Attacks-on-ImageNet
Our Team (green hand) 6th Solution for CVPR-2021 AIC-VI: Unrestricted Adversarial Attacks on ImageNet
Python 1
-
open_brats2020
open_brats2020 PublicForked from lescientifik/open_brats2020
Top 10 brats 2020 Solution
Python 1
Repositories
- Awesome-RL-for-LRMs Public Forked from TsinghuaC3I/Awesome-RL-for-LRMs
A Survey of Reinforcement Learning for Large Reasoning Models
wh-forker/Awesome-RL-for-LRMs’s past year of commit activity - Vision-Zero Public Forked from wangqinsi1/Vision-Zero
This is the official Python version of Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play.
wh-forker/Vision-Zero’s past year of commit activity - GRIT Public Forked from eric-ai-lab/GRIT
Official code for NeurIPS 2025 paper "GRIT: Teaching MLLMs to Think with Images"
wh-forker/GRIT’s past year of commit activity - Vision-R1 Public Forked from Osilly/Vision-R1
This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reasoning capability.
wh-forker/Vision-R1’s past year of commit activity - Pixel-Reasoner Public Forked from TIGER-AI-Lab/Pixel-Reasoner
Pixel-Level Reasoning Model trained with RL [NeuIPS25]
wh-forker/Pixel-Reasoner’s past year of commit activity - Awesome-RL-Reasoning-Recipes Public Forked from TsinghuaC3I/Awesome-RL-for-LRMs
Awesome RL Reasoning Recipes ("Triple R")
wh-forker/Awesome-RL-Reasoning-Recipes’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…