Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View ShareableXue's full-sized avatar

Block or report ShareableXue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. sudoku_trl_grpo sudoku_trl_grpo Public

    Forked from 828Tina/sudoku_trl_grpo

    基于trl框架对Qwen模型做grpo训练,从而完成4*4数独游戏的训练任务

    Python

  2. trl trl Public

    Forked from huggingface/trl

    Train transformer language models with reinforcement learning.

    Python