Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View bamos's full-sized avatar
❤️
❤️❤️❤️❤️❤️❤️❤️❤️
❤️
❤️❤️❤️❤️❤️❤️❤️❤️

Organizations

@VT-Magnum-Research @adobe-research @locuslab @cmu-rdr @cparse

Block or report bamos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI Crash Course to help busy builders catch up to the public frontier of AI research in 2 weeks

5,183 731 Updated Feb 27, 2025

ControlArena is a collection of settings, model organisms and protocols - for running control experiments.

Python 111 71 Updated Oct 24, 2025

A system for assigning and grading notebooks

Python 1,355 328 Updated May 9, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,244 227 Updated Oct 25, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

8,381 549 Updated Sep 11, 2025
Python 195 29 Updated Oct 23, 2025

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 300 67 Updated Oct 7, 2025

Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Python 629 86 Updated Sep 11, 2025

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 731 167 Updated Oct 25, 2025

A Gym for Agentic LLMs

Python 333 13 Updated Oct 22, 2025

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,157 340 Updated Jun 30, 2025

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 2,905 278 Updated Jan 14, 2025

AdalFlow: The library to build & auto-optimize LLM applications.

Python 3,837 351 Updated Oct 8, 2025

SVGBench: A challenging LLM benchmark that tests knowledge, coding, physical reasoning capabilities of LLMs.

Python 52 3 Updated Oct 25, 2025

code for paper "Large Language Models as End-to-end Combinatorial Optimization Solvers"

Python 29 2 Updated Oct 24, 2025
Python 4 Updated Sep 26, 2025

The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.

Python 353 19 Updated Oct 8, 2025
Python 18 Updated Oct 20, 2025

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 323,893 52,838 Updated May 21, 2025

Monet is an Emacs package that implements the Claude Code IDE protocol, enabling Claude to interact with your Emacs environment through a WebSocket connection.

Emacs Lisp 51 4 Updated Sep 26, 2025

Claude Code Emacs integration

Emacs Lisp 507 36 Updated Oct 10, 2025

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

Python 3,839 752 Updated Feb 7, 2024

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…

Python 8,861 929 Updated Oct 24, 2025

The official implementation of "ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning"

Python 184 26 Updated Sep 28, 2025

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 1,039 162 Updated Oct 17, 2025

Official implementation of "SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization" (ECCV 2024)

Python 38 Updated Jul 12, 2024

A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

Python 11,465 925 Updated Jul 17, 2025
Python 116 5 Updated Aug 10, 2025

MTEB: Massive Text Embedding Benchmark

Python 2,931 485 Updated Oct 25, 2025

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,813 295 Updated Jan 16, 2024
Next