WANDY666

wanzihao WANDY666

BUAAer

12 followers · 17 following

Sensetime

Achievements

Starred repositories

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 17,167 1,079 Updated Feb 21, 2026

ROCm / aiter

AI Tensor Engine for ROCm

Python 359 218 Updated Feb 27, 2026

pytorch / audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python 2,833 766 Updated Feb 26, 2026

gpu-mode / resource-stream

GPU programming related news and material links

2,001 114 Updated Sep 17, 2025

vllm-project / flash-attention

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python 114 122 Updated Feb 12, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 235,256 45,274 Updated Feb 27, 2026

ranaroussi / quantstats

Portfolio analytics for quants, written in Python

Python 6,786 1,125 Updated Jan 13, 2026

dsl-learn / LeetGPU

LeetGPU Solutions

Python 111 5 Updated Oct 9, 2025

flagos-ai / FlagGems

FlagGems is an operator library for large language models implemented in the Triton Language.

Python 905 257 Updated Feb 27, 2026

polakowo / vectorbt

⚡️ Lightning-fast backtesting engine to find your trading edge

Python 6,766 873 Updated Jan 30, 2026

mementum / backtrader

Python Backtesting library for trading strategies

Python 20,549 4,930 Updated Aug 19, 2024

ai4trade / XtQuant

迅投QMT接口

Python 395 114 Updated Sep 11, 2022

refkxh / BiCo

[CVPR 2026] Official implementation of BiCo: Composing Concepts from Images and Videos via Concept-prompt Binding

Python 74 3 Updated Feb 22, 2026

yt-dlp / yt-dlp

A feature-rich command-line audio/video downloader

Python 148,813 12,064 Updated Feb 26, 2026

shihaobai / ms-swift

Forked from modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 1 Updated Dec 31, 2025

NVIDIA / nsight-python

Nsight Python is a Python kernel profiling interface based on NVIDIA Nsight Tools

Python 144 11 Updated Feb 26, 2026

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...…

Python 12,788 1,218 Updated Feb 27, 2026

sgl-project / SpecForge

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 704 160 Updated Feb 25, 2026

mlc-ai / notebooks

Jupyter Notebook 223 81 Updated Nov 22, 2024

NVIDIA / cutile-python

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 1,935 116 Updated Feb 26, 2026

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 3,182 358 Updated Jan 17, 2026

vipshop / cache-dit

🤗 A PyTorch-native and Flexible Inference Engine with Hybrid Cache Acceleration and Parallelism for DiTs.

Python 1,053 62 Updated Feb 26, 2026

NVIDIA / Fuser

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

C++ 379 78 Updated Feb 27, 2026

tugot17 / pmpp

Complete solutions to the Programming Massively Parallel Processors Edition 4

Jupyter Notebook 670 90 Updated Jun 18, 2025

ModelTC / LightTTS

LightTTS is a lightweight TTS inference framework optimized for CosyVoice2 and CosyVoice3, enabling fast and scalable speech synthesis in Python and supports stream and bistream modes.

Python 28 7 Updated Feb 27, 2026