Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View masonwang025's full-sized avatar

Highlights

  • Pro

Organizations

@stanford-oval

Block or report masonwang025

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding

Python 523 28 Updated Oct 20, 2025

Load MDX content from anywhere

TypeScript 3,047 147 Updated Dec 9, 2025

An efficient implementation of the NSA (Native Sparse Attention) kernel

Python 126 4 Updated Jun 24, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,838 1,084 Updated Dec 25, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,118 338 Updated Dec 24, 2025
Python 114 3 Updated Sep 17, 2025

beep boop personal website hosted at tinabmai.com

TypeScript 16 3 Updated Nov 28, 2025

Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"

Python 249 25 Updated Jan 31, 2025

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 944 48 Updated Mar 19, 2025

Residual Quantization Autoencoder, used for interpreting LLMs

Python 13 2 Updated Jan 1, 2025

repo for code for paper on general theory associative memory models

Python 21 3 Updated Jun 15, 2022

Recipes to scale inference-time compute of open models

Python 1,120 131 Updated May 22, 2025

Curated list of datasets and tools for post-training.

4,110 335 Updated Nov 10, 2025

Run Slurm in Kubernetes

Go 338 49 Updated Dec 24, 2025

A toolkit for describing model features and intervening on those features to steer behavior.

Python 223 20 Updated Dec 12, 2025
JavaScript 95 14 Updated Nov 2, 2024

Recreating and refactoring weareninja.com's "space warp" effect

TypeScript 31 4 Updated Oct 21, 2025
TypeScript 1,907 269 Updated Dec 16, 2025

Entropy Based Sampling and Parallel CoT Decoding

Python 3,432 325 Updated Nov 13, 2024

Repository for Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution

Jupyter Notebook 4 1 Updated May 13, 2024

A library for making RepE control vectors

Jupyter Notebook 673 53 Updated Sep 24, 2025

Go ahead and axolotl questions

Python 10,995 1,224 Updated Dec 25, 2025

Thorn in a HaizeStack test for evaluating long-context adversarial robustness.

Python 26 1 Updated Aug 3, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,389 8,613 Updated Nov 12, 2025

Simple, powerful and flexible site generation framework with everything you love from Next.js.

TypeScript 13,458 1,414 Updated Dec 23, 2025

Orchestrate zero-shot computer vision models

HTML 392 14 Updated Aug 20, 2024

LLM training in simple, raw C/CUDA

Cuda 28,459 3,338 Updated Jun 26, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 46,007 6,661 Updated Dec 25, 2025
Next