keroro824

Beidi Chen keroro824

Assistant Prof@CMU, Research Scientist@FAIR

702 followers · 12 following

https://www.andrew.cmu.edu/user/beidic/

Achievements

x3 x2

Achievements

x3 x2

Stars

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,188 1,845 Updated Jan 9, 2026

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,628 230 Updated Jun 17, 2025

tianweiy / CausVid

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,173 68 Updated Aug 7, 2025

Infini-AI-Lab / UMbreLLa

LLM Inference on consumer devices

Python 128 15 Updated Mar 17, 2025

Infini-AI-Lab / MagicPIG

[ICLR2025 Spotlight] MagicPIG: LSH Sampling for Efficient LLM Generation

Python 246 17 Updated Dec 16, 2024

microsoft / MS-MARCO-Web-Search

A large-scale information-rich web dataset, featuring millions of real clicked query-document labels

345 18 Updated Dec 16, 2024

Infini-AI-Lab / Sequoia

scalable and robust tree-based speculative decoding algorithm

Python 366 37 Updated Jan 28, 2025

flexflow / flexflow-train

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,854 248 Updated Jan 17, 2026

mit-han-lab / llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,418 288 Updated Jul 17, 2025

FMInference / H2O

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

Python 501 74 Updated Aug 1, 2024

litanlitudan / skyagi

SkyAGI: Emerging human-behavior simulation capability in LLM

TypeScript 787 55 Updated Sep 21, 2023

tloen / alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,982 2,217 Updated Jul 29, 2024

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,271 4,024 Updated Jul 17, 2024

jwkirchenbauer / lm-watermarking

Jupyter Notebook 660 88 Updated Sep 17, 2025

dair-ai / Prompt-Engineering-Guide

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 69,272 7,391 Updated Jan 16, 2026

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 31,617 2,569 Updated Jan 19, 2026

FMInference / FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,379 589 Updated Oct 28, 2024

kyleliang919 / Long-context-transformers

Exploring finetuning public checkpoints on filter 8K sequences on Pile

Python 116 14 Updated Mar 22, 2023

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 11,224 2,970 Updated Jan 16, 2026

VieClus / VieClus

Vienna Graph Clustering

C++ 17 3 Updated Nov 12, 2025

pltrees / abcboost

C++ 107 28 Updated Oct 19, 2023

ciphermodelabs / ciphercore

User-friendly secure computation engine based on secure multi-party computation

Rust 377 6 Updated Aug 4, 2023

NVIDIA / cuCollections

C++ 612 106 Updated Jan 5, 2026

dmlc / dgl

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Python 14,220 3,060 Updated Jul 31, 2025

lucidrains / reformer-pytorch

Reformer, the efficient Transformer, in Pytorch

Python 2,193 256 Updated Jun 21, 2023

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,100 6,648 Updated Sep 30, 2025

openai / sparse_attention

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Python 1,607 191 Updated Aug 12, 2020

sahajgarg / image_transformer

Pytorch implementation of the image transformer for unconditional image generation

Python 118 31 Updated Jul 25, 2024

tensorflow / tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 16,914 3,709 Updated Jun 2, 2023

ilyaraz / ot_estimators

Implementations of several fast approximate algorithms for geometric optimal transport (OT)

C++ 117 8 Updated Apr 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Beidi Chen keroro824

Achievements

Achievements

Block or report keroro824

Stars

QwenLM / Qwen3

SandAI-org / MAGI-1

tianweiy / CausVid

Infini-AI-Lab / UMbreLLa

Infini-AI-Lab / MagicPIG

microsoft / MS-MARCO-Web-Search

Infini-AI-Lab / Sequoia

flexflow / flexflow-train

mit-han-lab / llm-awq

FMInference / H2O

litanlitudan / skyagi

tloen / alpaca-lora

tatsu-lab / stanford_alpaca

jwkirchenbauer / lm-watermarking

dair-ai / Prompt-Engineering-Guide

stanfordnlp / dspy

FMInference / FlexLLMGen

kyleliang919 / Long-context-transformers

EleutherAI / lm-evaluation-harness

VieClus / VieClus

pltrees / abcboost

ciphermodelabs / ciphercore

NVIDIA / cuCollections

dmlc / dgl

lucidrains / reformer-pytorch

facebookresearch / fairseq

openai / sparse_attention

sahajgarg / image_transformer

tensorflow / tensor2tensor

ilyaraz / ot_estimators