- ๐ Hi, Iโm @Zishan-Shao (You can call me Bruce, I know it's hard to pronounce this)
- ๐ Iโm broadly interested in Large Language Models (LLMs) and ML Systems: spanning LLM efficiency (model compression, hardware-aware acceleration) and mechanistic analysis (decode-time dynamics).
- ๐ฑ Iโm currently working on decode-time evaluation methodologies and efficient inference serving.
- ๐๏ธ Iโm looking to collaborate on hardware-efficient LLM serving or mechanistic interpretability.
- ๐๏ธ Hobbies: Competitive Powerlifting (100kg+ BP / 160kg+ SQ & DL), Combat Sports (MMA, Boxing, BJJ ๐ฅ), and occasionally exploring Teyvat in Genshin Impact.
- ๐ซ How to reach me: [email protected], [email protected]
-
@duke University
- Durham
-
04:02
(UTC -04:00) - https://zishan-shao.github.io/
- https://orcid.org/0009-0003-7873-8857
Pinned Loading
-
decodeshare
decodeshare Public๐[ICML 2026 Spotlight] Official implementation of "DecodeShare: Tracing the Shared Subspace of LLM Decode-Time Decisions"
Python 1
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.