Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View vanzll's full-sized avatar

Block or report vanzll

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Uni-RLHF-Platform Uni-RLHF-Platform Public

    Forked from pickxiguapi/Uni-RLHF-Platform

    Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)

    Python

  2. EBC EBC Public

    [ICML'25] Diversifying Policy Behaviors via Extrinsic Behavioral Curiosity

    Python 14

  3. Johnny221B/OSCAR Johnny221B/OSCAR Public

    This is official github for our paper

    Python 3

  4. FM_IRL FM_IRL Public

    Official pytorch Implementation of FM-IRL.

    Python 4

  5. acodercat/cave-agent acodercat/cave-agent Public

    Stateful runtime management for LLM agents—inject, manipulate, and retrieve Python objects across turns.

    Python 73 2

  6. bennidict23/GoRL bennidict23/GoRL Public

    An Algorithm-Agnostic Framework for Online Reinforcement Learning with Generative Policies

    Python 21