You must be logged in to sponsor zhyncs
Become a sponsor to Yineng Zhang
- πΌ Senior Director at Together AI β I run the inference team at Together AI.
- π§βπ» I have initiated and led the end-to-end DeepSeek V3/R1 effort on SGLang β from day-0 support and performance optimization to large-scale EP deployment and GB200 NVL72 integration β driving roadmap, coordination, and execution across community collaborations that pushed the frontier of open-source inference engines.
- π€ Interviewed by The New York Times (Article 1, Article 2), Featured speaker at AI Engineer World's Fair 2025, AMD AI DevDay 2025 and PyTorch Conference 2025.
- π Co-author of the FlashInfer paper (MLSys 2025 Best Paper) and committer to FlashInfer. Previously, I was Lead Software Engineer at Baseten (co-authored the DeepSeek V3 and Qwen 3 launches) and led CTR GPU inference and vector retrieval system development at Meituan.
- π« Contact: [email protected] | Telegram | LinkedIn | Homepage