-
Westlake University
- Hang Zhou, Zhe Jiang, China
-
11:49
(UTC +08:00) - https://akawincent.github.io/
- https://www.zhihu.com/people/wincent-84
- @pu_wen99907
Lists (9)
Sort Name ascending (A-Z)
Stars
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
Complete Claude Code configuration collection - agents, skills, hooks, commands, rules, MCPs. Battle-tested configs from an Anthropic hackathon winner.
The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Multi-agent orchestration workflow (Claude Code Codex Gemini OpenCode)
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Public code for XFactor: Introduces the first geometry-free model to achieve true self-supervised / pose-free Novel View Synthesis (NVS) by learning transferable latent camera pose representations.
"E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training" official implementation.
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …
Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"
Examples and guides for using the Gemini API
When do we not need larger vision models?
verl: Volcano Engine Reinforcement Learning for LLMs
"MoCA: Mixture-of-Components Attention for Scalable Compositional 3D Generation"
Godot Engine – Multi-platform 2D and 3D game engine
Part-X-MLLM: Part-aware 3D Multimodal Large Language Model
[NeurIPS 2025]"DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling"
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Unified framework for robot learning built on NVIDIA Isaac Sim