- Bay Area, CA
-
21:41
(UTC -08:00) - https://zhyncs.com
- https://orcid.org/0009-0006-7743-2508
- @zhyncs42
Stars
SkyRL: A Modular Full-stack RL Library for LLMs
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA teβ¦
A TUI-based utility for real-time monitoring of InfiniBand traffic and performance metrics on the local node
Lightweight coding agent that runs in your terminal
Helpful kernel tutorials and examples for tile-based GPU programming
A high-performance inference engine for LLMs, optimized for diverse AI accelerators.
Perplexity open source garden for inference technology
Autonomous GPU Kernel Generation via Deep Agents
Ship correct and fast LLM kernels to PyTorch
A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.
A unified inference and post-training framework for accelerated video generation.
Tile primitives for speedy kernels
Building the Virtuous Cycle for AI-driven LLM Systems
Post-training with Tinker
Verify Precision of all Kimi K2 API Vendor