DEM1TASSE

✨

Focusing

Demi Wang DEM1TASSE

✨

Focusing

Master @cmulti | Ex @msra @bytedance @cal

34 followers · 25 following

Carnegie Mellon University
Pittsburgh, United States
dem1tasse.github.io
@demisama_

Achievements

Highlights

Stars

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 76,698 11,297 Updated Oct 22, 2025

ServiceNow / AgentLab

AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.

Python 434 91 Updated Oct 27, 2025

ServiceNow / BrowserGym

🌎💪 BrowserGym, a Gym environment for web task automation

Python 942 132 Updated Oct 27, 2025

OSU-NLP-Group / SkillWeaver

SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.

Python 98 8 Updated Apr 14, 2025

zorazrw / agent-skill-induction

Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"

Python 31 5 Updated Apr 24, 2025

neubig / test-repo

Python 2 Updated Oct 21, 2025

speedyapply / 2026-AI-College-Jobs

2026 AI/ML internship & new graduate job list updated daily

3,797 158 Updated Oct 27, 2025

DEM1TASSE / Miko

AI-powered desktop companion to boost your efficiency

Python 2 1 Updated Jul 27, 2025

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 2,947 220 Updated Oct 28, 2025

LLM360 / Reasoning360

A repo for open research on building large reasoning models

Python 108 14 Updated Oct 27, 2025

zhyang2226 / AR-Lopti

[AI4MATH@ICML2025] Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs

Python 40 1 Updated May 20, 2025

hemingkx / TokenSkip

[EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs

Python 186 11 Updated Jun 28, 2025

denkiwakame / arxiv2notion

Chrome extension for clipping arXiv articles to Notion.

JavaScript 130 18 Updated Oct 2, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 60,886 7,359 Updated Oct 27, 2025

ElliottYan / LUFFY

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 355 41 Updated Oct 4, 2025

qixucen / atom

[NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling

Python 591 51 Updated Jun 16, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,317 1,518 Updated Apr 24, 2025

zwxandy / Awesome-Efficient-CoT-Reasoning-Summary

🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasoning performance is an important topic!

63 4 Updated May 22, 2025