Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View mathematicallfs's full-sized avatar

Highlights

  • Pro

Block or report mathematicallfs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The best ChatGPT that $100 can buy.

Python 34,814 3,929 Updated Oct 30, 2025

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Python 201 17 Updated Oct 28, 2025

toy reproduction of Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts

Python 24 2 Updated Sep 1, 2024

[CVPR 2025 Oral & Award Candidate] Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

Python 887 64 Updated Jun 28, 2025
Python 140 7 Updated May 6, 2025

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 10,872 979 Updated Aug 5, 2025

A MAD laboratory to improve AI architecture designs 🧪

Python 132 14 Updated Dec 17, 2024

LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models

Python 76 4 Updated Oct 16, 2024

Some preliminary explorations of Mamba's context scaling.

Python 216 10 Updated Feb 8, 2024

official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233 [NeurIPS 2024]

Python 19 3 Updated Jul 27, 2025

GPT-2 (124M) quality in 5B tokens

Python 1 Updated Jun 6, 2024

Official implementation of IJCAI 2024 paper "Cross-Domain Feature Augmentation for Domain Generalization"

Python 15 1 Updated Aug 20, 2024

LLM training in simple, raw C/CUDA

Cuda 27,997 3,253 Updated Jun 26, 2025

[NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models

Python 102 6 Updated Aug 5, 2024

Collection of papers on state-space models

604 22 Updated Sep 6, 2025

Understand and test language model architectures on synthetic tasks.

Python 234 38 Updated Sep 25, 2025

A simple and efficient Mamba implementation in pure PyTorch and MLX.

Python 1,352 114 Updated Dec 4, 2024

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

C++ 330 29 Updated Dec 28, 2024

Codes for "MixupE: Understanding and Improving Mixup from Directional Derivative Perspective" UAI 2023 Oral

Python 29 Updated Aug 30, 2023

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Python 4,366 399 Updated Oct 25, 2023

Official code for ''From Optimization Dynamics to Generalization Bounds via Łojasiewicz Gradient Inequality'' (TMLR)

Python 6 Updated Oct 5, 2022

Official code for "In Search of Robust Measures of Generalization" (NeurIPS 2020)

Python 28 4 Updated Dec 22, 2020

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

JavaScript 43,239 5,309 Updated Oct 30, 2025

可视化Bilibili本地视频XML弹幕转换ASS字幕转换器

Python 195 5 Updated Jan 2, 2024

A library for users to write (experiment in research) configurations in Python Dict or JSON format, read and write parameter value via dot . in code, while can read parameters from the command line…

Python 2,029 259 Updated Aug 22, 2024

Training-free data valuation on deep neural network applications. (ICML-2022)

Python 26 Updated Jul 13, 2022

RobustBench: a standardized adversarial robustness benchmark [NeurIPS 2021 Benchmarks and Datasets Track]

Python 750 100 Updated Mar 31, 2025