Stars
- All languages
- AppleScript
- Assembly
- Awk
- Brainfuck
- C
- C#
- C++
- CSS
- Clojure
- CoffeeScript
- Common Lisp
- Cuda
- Cython
- D
- Dockerfile
- Elixir
- Emacs Lisp
- Erlang
- Fortran
- Frege
- GLSL
- Gherkin
- Go
- HTML
- Haskell
- Haxe
- Java
- JavaScript
- Jinja
- Julia
- Jupyter Notebook
- Kotlin
- Lean
- Less
- LilyPond
- LiveScript
- Lua
- MATLAB
- MDX
- Makefile
- Markdown
- Mathematica
- NSIS
- OCaml
- Objective-C
- OpenEdge ABL
- PHP
- Pascal
- Perl
- Pug
- PureBasic
- Python
- R
- Rocq Prover
- Roff
- Ruby
- Rust
- SAS
- SCSS
- Scala
- Shell
- Stan
- Standard ML
- Svelte
- Swift
- TLA
- TSQL
- TeX
- TypeScript
- Vala
- Vim Script
AI Crash Course to help busy builders catch up to the public frontier of AI research in 2 weeks
ControlArena is a collection of settings, model organisms and protocols - for running control experiments.
A system for assigning and grading notebooks
slime is an LLM post-training framework for RL Scaling.
Kimi K2 is the large language model series developed by Moonshot AI team
A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments
General technology for enabling AI capabilities w/ LLMs and MLLMs
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
AdalFlow: The library to build & auto-optimize LLM applications.
SVGBench: A challenging LLM benchmark that tests knowledge, coding, physical reasoning capabilities of LLMs.
code for paper "Large Language Models as End-to-end Combinatorial Optimization Solvers"
The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Monet is an Emacs package that implements the Claude Code IDE protocol, enabling Claude to interact with your Emacs environment through a WebSocket connection.
Evolutionary Scale Modeling (esm): Pretrained language models for proteins
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…
The official implementation of "ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning"
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
Official implementation of "SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization" (ECCV 2024)
A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.
MTEB: Massive Text Embedding Benchmark
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models