Stars
Code for “FlowMM Generating Materials with Riemannian Flow Matching” and "FlowLLM: Flow Matching for Material Generation with Large Language Models as Base Distributions"
Official Implementation of "DeLLMa: Decision Making Under Uncertainty with Large Language Models"
Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX, and NVIDIA Warp) with support for Gymnasium/Gym, NVIDIA Isaac Lab, MuJoCo Playground and other environments
A PPO agent leveraging reinforcement learning performs Penetration Testing in a simulated computer network environment. The agent is trained to scan for vulnerabilities in the network and exploit t…
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
Generate automated tests for your Node.js app via LLMs without developers having to write a single line of code.
A curated list of large language model tools for cybersecurity research.
A fast multithreaded C++ implementation of NLTK BLEU with Python wrapper.
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks
Reinforcement Learning for Molecular Design Guided by Quantum Mechanics
Benchmarks for Model-Based Optimization
Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting
Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization
A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.
Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.
Learning representations for RL in Healthcare under a POMDP assumption
Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)
Real NVP PyTorch a Minimal Working Example | Normalizing Flow
Reverse Engineering Resources About All Platforms(Windows/Linux/macOS/Android/iOS/IoT) And Every Aspect! (More than 3500 open source tools and 2300 posts&videos)
Simple Minimalistic Gridworld Environment for OpenAI Gym (Simple-MiniGrid)
Reward Propagation using Graph Convolutional Networks
Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments