Thanks to visit codestin.com
Credit goes to github.com

Skip to content

xzAscC/xzAscC

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 

Repository files navigation

Hi, I'm Xudong Zhu 👋

Arch Linux Neovim/Fish/OpenCode Research Email Google Scholar OpenReview Website

PhD student at The Ohio State University working on understanding and controlling large language models. I also vibe code research prototypes, developer tools, and AI systems.

Recent Projects

  • Understanding Linear Steering (ongoing)
    Investigating the geometry, linearity, and causal structure of steering directions in LLM representation space.

  • AbsTopK: Rethinking Sparse Autoencoders For Bidirectional Features ArXiv, OpenReview
    Developed a principled proximal-gradient framework that unifies SAE variants (ReLU, JumpReLU, TopK) and reveals that non-negativity constraints prevent bidirectional feature representation. Proposed AbsTopK, a magnitude-based sparse operator that recovers complete semantic axes and improves interpretability and steering in LLMs.

  • From Emergence to Control: Probing and Modulating Self-Reflection in Language Models Arxiv
    Showed that linear directions in representation space can enable and control self-reflection behavior in pretrained LLMs without finetuning.

GitHub Stats


If you are interested in collaboration, feel free to open an issue or connect with me.

Acknowledgments

GitHub stats cards are powered by github-readme-stats. Many thanks to the authors for building and maintaining it.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors