masonwang025

mason masonwang025

stanford

76 followers · 30 following

stanford
masonjwang.com

Achievements

Highlights

Organizations

Stars

JAMESYJL / ShapeLLM-Omni

[NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding

Python 523 28 Updated Oct 20, 2025

Physical-Intelligence / openpi

Python 9,512 1,274 Updated Dec 18, 2025

hashicorp / next-mdx-remote

Load MDX content from anywhere

TypeScript 3,047 147 Updated Dec 9, 2025

tilde-research / nsa-impl

An efficient implementation of the NSA (Native Sparse Attention) kernel

Python 126 4 Updated Jun 24, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,838 1,084 Updated Dec 25, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,118 338 Updated Dec 24, 2025

OpenSparseLLMs / MoM

Python 114 3 Updated Sep 17, 2025

Tina-Mai / tinamai

beep boop personal website hosted at tinabmai.com

TypeScript 16 3 Updated Nov 28, 2025

HazyResearch / lolcats

Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"

Python 249 25 Updated Jan 31, 2025

fla-org / native-sparse-attention

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 944 48 Updated Mar 19, 2025

harish-kamath / rqae

Residual Quantization Autoencoder, used for interpreting LLMs

Python 13 2 Updated Jan 1, 2025

BerenMillidge / Theory_Associative_Memory

repo for code for paper on general theory associative memory models

Python 21 3 Updated Jun 15, 2022

huggingface / search-and-learn

Recipes to scale inference-time compute of open models

Python 1,120 131 Updated May 22, 2025

mlabonne / llm-datasets

Curated list of datasets and tools for post-training.

4,110 335 Updated Nov 10, 2025

nebius / soperator

Run Slurm in Kubernetes

Go 338 49 Updated Dec 24, 2025

TransluceAI / observatory

A toolkit for describing model features and intervening on those features to steer behavior.

Python 223 20 Updated Dec 12, 2025

jrefusta / joan-portfolio

JavaScript 95 14 Updated Nov 2, 2024

o2bomb / space-warp

Recreating and refactoring weareninja.com's "space warp" effect

TypeScript 31 4 Updated Oct 21, 2025

henryjeff / portfolio-website

TypeScript 1,907 269 Updated Dec 16, 2025

xjdr-alt / entropix

Entropy Based Sampling and Parallel CoT Decoding

Python 3,432 325 Updated Nov 13, 2024

eszaher / Manifold-Integrated-Gradients

Repository for Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution

Jupyter Notebook 4 1 Updated May 13, 2024

saprmarks / dictionary_learning

Python 375 89 Updated Aug 21, 2025

vgel / repeng

A library for making RepE control vectors

Jupyter Notebook 673 53 Updated Sep 24, 2025

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 10,995 1,224 Updated Dec 25, 2025

haizelabs / thorn-in-haizestack

Thorn in a HaizeStack test for evaluating long-context adversarial robustness.

Python 26 1 Updated Aug 3, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,389 8,613 Updated Nov 12, 2025

shuding / nextra

Simple, powerful and flexible site generation framework with everything you love from Next.js.

TypeScript 13,458 1,414 Updated Dec 23, 2025

overeasy-sh / overeasy

Orchestrate zero-shot computer vision models

HTML 392 14 Updated Aug 20, 2024

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 28,459 3,338 Updated Jun 26, 2025

run-llama / llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 46,007 6,661 Updated Dec 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mason masonwang025

Achievements

Achievements

Highlights

Organizations

Block or report masonwang025

Stars

JAMESYJL / ShapeLLM-Omni

Physical-Intelligence / openpi

hashicorp / next-mdx-remote

tilde-research / nsa-impl

modelscope / ms-swift

fla-org / flash-linear-attention

OpenSparseLLMs / MoM

Tina-Mai / tinamai

HazyResearch / lolcats

fla-org / native-sparse-attention

harish-kamath / rqae

BerenMillidge / Theory_Associative_Memory

huggingface / search-and-learn

mlabonne / llm-datasets

nebius / soperator

TransluceAI / observatory

jrefusta / joan-portfolio

o2bomb / space-warp

henryjeff / portfolio-website

xjdr-alt / entropix

eszaher / Manifold-Integrated-Gradients

saprmarks / dictionary_learning

vgel / repeng

axolotl-ai-cloud / axolotl

haizelabs / thorn-in-haizestack

karpathy / nanoGPT

shuding / nextra

overeasy-sh / overeasy

karpathy / llm.c

run-llama / llama_index