Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View williamd4112's full-sized avatar

Block or report williamd4112

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 18 1 Updated Dec 30, 2025

Code for “FlowMM Generating Materials with Riemannian Flow Matching” and "FlowLLM: Flow Matching for Material Generation with Large Language Models as Base Distributions"

Python 173 32 Updated Oct 29, 2024

Official Implementation of "DeLLMa: Decision Making Under Uncertainty with Large Language Models"

Python 69 10 Updated Oct 21, 2024
Python 10 4 Updated Feb 16, 2022

Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX, and NVIDIA Warp) with support for Gymnasium/Gym, NVIDIA Isaac Lab, MuJoCo Playground and other environments

Python 961 121 Updated Jan 17, 2026

A PPO agent leveraging reinforcement learning performs Penetration Testing in a simulated computer network environment. The agent is trained to scan for vulnerabilities in the network and exploit t…

Python 28 7 Updated Apr 2, 2025

[NeurIPS-2023] Annual Conference on Neural Information Processing Systems

Python 224 18 Updated Dec 22, 2024

Generate automated tests for your Node.js app via LLMs without developers having to write a single line of code.

JavaScript 1,814 110 Updated Jan 18, 2026

A curated list of large language model tools for cybersecurity research.

479 55 Updated Apr 10, 2024
Python 35 5 Updated Jul 10, 2021

A fast multithreaded C++ implementation of NLTK BLEU with Python wrapper.

Python 46 1 Updated May 3, 2024

CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery

Python 83 18 Updated Jul 27, 2022

IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks

Python 545 62 Updated Mar 3, 2023

Reinforcement Learning for Molecular Design Guided by Quantum Mechanics

Python 127 25 Updated Jul 23, 2023

Benchmarks for Model-Based Optimization

Python 97 23 Updated Apr 21, 2024

Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting

Python 17 Updated Feb 14, 2024

Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization

Python 83 5 Updated Apr 13, 2023

A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.

Jupyter Notebook 977 77 Updated May 7, 2024

Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.

Python 25 3 Updated Feb 16, 2023

Learning representations for RL in Healthcare under a POMDP assumption

Jupyter Notebook 57 14 Updated Jan 21, 2025
Python 228 45 Updated May 23, 2024

Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)

Python 45 11 Updated Sep 28, 2020

Real NVP PyTorch a Minimal Working Example | Normalizing Flow

Jupyter Notebook 142 27 Updated Nov 6, 2020

Reverse Engineering Resources About All Platforms(Windows/Linux/macOS/Android/iOS/IoT) And Every Aspect! (More than 3500 open source tools and 2300 posts&videos)

4,836 892 Updated Sep 1, 2021

Simple Grid Environment for Gymnasium

Python 65 15 Updated Feb 16, 2025

Simple Minimalistic Gridworld Environment for OpenAI Gym (Simple-MiniGrid)

Python 7 2 Updated Mar 16, 2022

Diffusion-Based Approximate Value Functions

Python 4 1 Updated Nov 3, 2018

Reward Propagation using Graph Convolutional Networks

Python 13 12 Updated Jun 19, 2021

Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments

Python 29 5 Updated Sep 10, 2020
Next