Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View kooWZ's full-sized avatar
  • Fudan University
  • Shanghai, China
  • 11:04 (UTC +08:00)

Highlights

  • Pro

Organizations

@FDUCSLG @DanXi-Dev

Block or report kooWZ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A python module to repair invalid JSON from LLMs

Python 3,588 146 Updated Oct 22, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,301 28 Updated Oct 15, 2025
5 Updated Nov 30, 2022

[ICCV 2025] SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer

Python 300 7 Updated Dec 29, 2024

Exercises and projects for Jane Street's OCaml Workshop

OCaml 654 170 Updated Apr 4, 2022

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

Python 870 64 Updated Sep 26, 2025

An implementation of the Muon optimizer in pytorch featuring the latest research improvements.

Python 5 Updated Oct 15, 2025
Python 25 1 Updated Apr 26, 2023

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,112 1,657 Updated Sep 24, 2025

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Python 377 6 Updated Oct 15, 2025

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,880 91 Updated Aug 15, 2024

Implementation of the MuonClip optimizer in PyTorch/JAX based on the Kimi K2 technical report

Python 5 Updated Jul 24, 2025

Flash-Muon: An Efficient Implementation of Muon Optimizer

Python 197 13 Updated Jun 15, 2025

Muon is an optimizer for hidden layers in neural networks

Python 1,918 91 Updated Jul 12, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,628 2,328 Updated Oct 23, 2025

Run corplink(feilian) in a container

Shell 42 15 Updated Oct 15, 2025

Pixel-Space Generative Models

Python 272 15 Updated May 11, 2025

FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Jupyter Notebook 256 12 Updated Jun 2, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 60,630 7,344 Updated Oct 22, 2025

VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models

Python 75 1 Updated Jul 13, 2024

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,295 161 Updated Oct 22, 2025

Efficient Triton Kernels for LLM Training

Python 5,767 418 Updated Oct 20, 2025

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Python 345 38 Updated Feb 25, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,924 285 Updated May 15, 2025
Rust 5 Updated Sep 26, 2018

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,088 207 Updated Oct 21, 2025
Jupyter Notebook 1,055 132 Updated Sep 18, 2024

集多家之源,自检自查,供自家使用

244 15 Updated Jul 12, 2025
Next