Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View tongyu0924's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report tongyu0924

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PMC-Patients: A Large-scale Dataset of Patient Summaries and Relations for Benchmarking Retrieval-based Clinical Decision Support Systems.

Python 70 2 Updated Dec 20, 2023

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 346 9 Updated Sep 22, 2025

A curated collection of papers on portrait style transfer

27 3 Updated Oct 11, 2025

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Python 3,071 311 Updated Oct 11, 2025

A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address realworld medical entrance exam questions.

Jupyter Notebook 236 33 Updated Nov 28, 2022

Code and data for MedQA

Python 324 25 Updated Dec 1, 2022

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 5,786 314 Updated Sep 30, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,722 2,346 Updated Oct 25, 2025

An Awesome List of Agentic Model trained with Reinforcement Learning

HTML 522 16 Updated Oct 13, 2025

[EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards

Python 45 2 Updated Sep 15, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,384 3,876 Updated Oct 23, 2025

MIMIC Code Repository: Code shared by the research community for the MIMIC family of databases

Jupyter Notebook 3,012 1,632 Updated Jul 1, 2025

《Designing Data-Intensive Application》DDIA 第一版 / 第二版 中文翻译

Python 22,017 4,440 Updated Sep 24, 2025

PubMedQA: A Dataset for Biomedical Research Question Answering

Python 373 51 Updated Apr 18, 2023

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 22,099 3,304 Updated Oct 25, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,764 230 Updated Aug 11, 2024

[arxiv'25] MedAgentGYM: Training LLM Agents for Code-Based Medical Reasoning at Scale

Python 62 3 Updated Aug 5, 2025

LLM search engine faster than perplexity!

TypeScript 365 47 Updated Aug 19, 2025

Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various c…

Python 174 19 Updated Mar 18, 2024

MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs

Python 231 19 Updated Jun 19, 2025

A Graph RAG System for Evidenced-based Medical Information Retrieval [ACL 2025]

Python 621 105 Updated Oct 18, 2025

Awesome papers about unifying LLMs and KGs

2,491 172 Updated May 2, 2025
Python 220 38 Updated Sep 19, 2025

Official Implementation of ICML 2025 Paper: "Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models".

Python 167 14 Updated May 20, 2025

每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈

Jupyter Notebook 4,583 446 Updated Oct 13, 2025

free and open OpenAI Deep Research

Python 685 90 Updated Feb 18, 2025

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,869 682 Updated Oct 11, 2025

A curated list of SLAM resources

1,027 157 Updated Oct 13, 2023

[CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models

Python 75 2 Updated Sep 11, 2024
Next