samvelyan

Mikayel Samvelyan samvelyan

Research Scientist at Google DeepMind

175 followers · 13 following

Achievements

Highlights

Organizations

Stars

openai / weak-to-strong

Python 2,547 308 Updated May 19, 2024

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 11,946 1,743 Updated Dec 19, 2025

NetHack-LE / nle

Forked from facebookresearch/nle

The NetHack Learning Environment

C 102 14 Updated Dec 31, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,174 3,504 Updated Jan 26, 2025

eliahuhorwitz / Academic-project-page-template

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 4,361 901 Updated Sep 4, 2025

microsoft / promptbench

A unified evaluation framework for large language models

Python 2,771 219 Updated Oct 13, 2025

meta-llama / PurpleLlama

Set of tools to assess and improve LLM security.

Python 3,978 689 Updated Jan 16, 2026

facebookresearch / BenchMARL

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically ground…

Python 555 110 Updated Nov 10, 2025

jennyzzt / awesome-open-ended

Awesome Open-ended AI

392 39 Updated Jan 12, 2026

OpenRL-Lab / TiZero

Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体

Python 65 7 Updated Sep 10, 2023

OpenRL-Lab / openrl

Unified Reinforcement Learning Framework

Python 805 80 Updated Sep 6, 2024

PKU-MARL / HARL

Official implementation of HARL algorithms based on PyTorch.

Python 848 119 Updated Apr 27, 2025

Farama-Foundation / chatarena

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Python 1,527 148 Updated Aug 11, 2025

google-deepmind / pysc2

StarCraft II Learning Environment

Python 8,245 1,171 Updated Jul 23, 2024

marlbenchmark / on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,850 359 Updated Jul 18, 2024

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

29,691 2,413 Updated Jun 18, 2024

google-research / football

Check out the new game server:

Python 3,540 1,346 Updated Jun 17, 2025

facebookresearch / e3b

Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".

Python 87 14 Updated Mar 22, 2024

instadeepai / marl-eval

A tool for aggregating and plotting MARL experiment data.

Python 80 7 Updated Jan 20, 2025

ucl-dark / skillhack

SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning

Python 17 Updated Oct 23, 2022

adaptive-intelligent-robotics / QDax

Accelerated Quality-Diversity

Python 335 56 Updated Oct 30, 2025

oxwhirl / smacv2

Python 284 48 Updated Feb 15, 2024

google-research / rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Jupyter Notebook 866 48 Updated Aug 12, 2024

google-deepmind / mujoco_menagerie

A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.

Python 2,932 404 Updated Jan 11, 2026

ngoodger / nle-language-wrapper

Nethack Learning Environment Wrapper for Language Interface

Python 41 9 Updated Sep 11, 2023

facebookresearch / LaMCTS

The release codes of LA-MCTS with its application to Neural Architecture Search.

Python 482 70 Updated Nov 28, 2022

andyljones / boardlaw

Scaling scaling laws with board games.

Python 53 12 Updated Jul 17, 2023

proroklab / VectorizedMultiAgentSimulator

VMAS is a vectorized differentiable simulator designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set …

Python 510 98 Updated Nov 10, 2025

facebookresearch / dcd

Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.

Python 138 33 Updated Aug 20, 2024

minihack-editor / minihack-editor.github.io

Simple editor for making minihack DES files.

JavaScript 2 Updated Jul 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mikayel Samvelyan samvelyan

Achievements

Achievements

Highlights

Organizations

Block or report samvelyan

Stars

openai / weak-to-strong

SakanaAI / AI-Scientist

NetHack-LE / nle

meta-llama / llama3

eliahuhorwitz / Academic-project-page-template

microsoft / promptbench

meta-llama / PurpleLlama

facebookresearch / BenchMARL

jennyzzt / awesome-open-ended

OpenRL-Lab / TiZero

OpenRL-Lab / openrl

PKU-MARL / HARL

Farama-Foundation / chatarena

google-deepmind / pysc2

marlbenchmark / on-policy

google-research / tuning_playbook

google-research / football

facebookresearch / e3b

instadeepai / marl-eval

ucl-dark / skillhack

adaptive-intelligent-robotics / QDax

oxwhirl / smacv2

google-research / rliable

google-deepmind / mujoco_menagerie

ngoodger / nle-language-wrapper

facebookresearch / LaMCTS

andyljones / boardlaw

proroklab / VectorizedMultiAgentSimulator

facebookresearch / dcd

minihack-editor / minihack-editor.github.io