Thanks to visit codestin.com
Credit goes to mohsenhariri.github.io

Mohsen Hariri

Hi, I'm Mohsen!

and I love math ❤️.

🎓 Google Scholar 🐙 GitHub 💼 LinkedIn 🪪 CV/Résumé

📄 Papers 📝 Posts 🎯 Research Interests

News

Oct 20, 2025

🗃️ Check More for Keys, Less for Values on arXiv.

Oct 18, 2025

📦 Julia & Python pkgs for the Bayesian framework are out!

Oct 18, 2025

🎲 Bayesian framework for LLM evaluation is out!

Oct 15, 2025

📦 vLLM × DFloat11: run your model with 30% less memory!

Sep 17, 2025

✨ DF11 accepted to NeurIPS 2025!

Recent Papers

Don’t Pass@𝑘: A Bayesian Framework for Large Language Model Evaluation

Mohsen Hariri, Amirhossein Samandar, Michael Hinczewski, Vipin Chaudhary • Oct 21, 2025

Quantize What Counts: More For Keys, Less For Values ☝️🔑👇🔢

Mohsen Hariri, Alan Luo, Weicong Chen, Shaochen Zhong, Tianyi Zhang, Qifan Wang, Xia Hu, Xiaotian Han, Vipin Chaudhary • Oct 20, 2025

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

Tianyi Zhang, Mohsen Hariri, Shaochen Zhong, Vipin Chaudhary, Yang Sui, Xia Hu, Anshumali Shrivastali • Oct 19, 2025

Recent Posts

Entropy of bfloat16 During Training: How Optimizers Shape Weight Distributions

Nov 17, 2025 • Training, Information Theory, Optimizers

Entropy of bfloat16: 8 Bits Are Doing 2.6 Bits of Work

Oct 28, 2025 • LLMs, Information Theory, Efficiency

Simulating LLM Evaluation Datasets Using Psychometric Models

Oct 23, 2025 • Simulation, LLMs, Reasoning

Recent Slides

Virtual Agentic Lab!

Jan 18, 2026 • AI Agents, LLMs, Science

10-slide paper summary of Swanson et al. (doi:10.1038/s41586-025-09442-9)

LLM Research Directions

Jan 18, 2026 • LLMs, Reasoning Models, Test-time scaling

SCIPE Workshop on LLMs - Day 3

Tool Use (Function Calling) & RAG

Jan 17, 2026 • LLMs, Tools, Function Calling

SCIPE Workshop on LLMs - Day 2