williamd4112

zhangwei hong williamd4112

47 followers · 22 following

williamd4112.github.io

Achievements

Stars

iamxjy / BOAD-SWE-Agent

Python 18 1 Updated Dec 30, 2025

facebookresearch / flowmm

Code for “FlowMM Generating Materials with Riemannian Flow Matching” and "FlowLLM: Flow Matching for Material Generation with Large Language Models as Base Distributions"

Python 173 32 Updated Oct 29, 2024

DeLLMa / DeLLMa

Official Implementation of "DeLLMa: Decision Making Under Uncertainty with Large Language Models"

Python 69 10 Updated Oct 21, 2024

Daniella1 / urdf_files_dataset

Python 447 60 Updated Apr 6, 2024

aspuru-guzik-group / curiosity

Python 10 4 Updated Feb 16, 2022

Toni-SM / skrl

Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX, and NVIDIA Warp) with support for Gymnasium/Gym, NVIDIA Isaac Lab, MuJoCo Playground and other environments

Python 961 121 Updated Jan 17, 2026

yyzpiero / RL4RedTeam

A PPO agent leveraging reinforcement learning performs Penetration Testing in a simulated computer network environment. The agent is trained to scan for vulnerabilities in the network and exploit t…

Python 28 7 Updated Apr 2, 2025

yunqing-me / AttackVLM

[NeurIPS-2023] Annual Conference on Neural Information Processing Systems

Python 224 18 Updated Dec 22, 2024

Pythagora-io / pythagora

Generate automated tests for your Node.js app via LLMs without developers having to write a single line of code.

JavaScript 1,814 110 Updated Jan 18, 2026

tenable / awesome-llm-cybersecurity-tools

A curated list of large language model tools for cybersecurity research.

479 55 Updated Apr 10, 2024

apple / ml-uwac

Python 35 5 Updated Jul 10, 2021

Danial-Alh / fast-bleu

A fast multithreaded C++ implementation of NLTK BLEU with Python wrapper.

Python 46 1 Updated May 3, 2024

rll-research / cic

CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery

Python 83 18 Updated Jul 27, 2022

clvrai / furniture

IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks

Python 545 62 Updated Mar 3, 2023

gncs / molgym

Reinforcement Learning for Molecular Design Guided by Quantum Mechanics

Python 127 25 Updated Jul 23, 2023

brandontrabucco / design-bench

Benchmarks for Model-Based Optimization

Python 97 23 Updated Apr 21, 2024

Improbable-AI / harness-offline-rl

Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting

Python 17 Updated Feb 14, 2024

Improbable-AI / eipo

Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization

Python 83 5 Updated Apr 13, 2023

tanelp / tiny-diffusion

A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.

Jupyter Notebook 977 77 Updated May 7, 2024

yudasong / HyQ

Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.

Python 25 3 Updated Feb 16, 2023

MLforHealth / rl_representations

Learning representations for RL in Healthcare under a POMDP assumption

Jupyter Notebook 57 14 Updated Jan 21, 2025

wenhao-gao / mol_opt

Python 228 45 Updated May 23, 2024

clinicalml / gumbel-max-scm

Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)

Python 45 11 Updated Sep 28, 2020

senya-ashukha / real-nvp-pytorch

Real NVP PyTorch a Minimal Working Example | Normalizing Flow

Jupyter Notebook 142 27 Updated Nov 6, 2020

alphaSeclab / awesome-reverse-engineering

Reverse Engineering Resources About All Platforms(Windows/Linux/macOS/Android/iOS/IoT) And Every Aspect! (More than 3500 open source tools and 2300 posts&videos)

4,836 892 Updated Sep 1, 2021

damat-le / gym-simplegrid

Simple Grid Environment for Gymnasium

Python 65 15 Updated Feb 16, 2025

rafelps / gym-simple-minigrid

Forked from Farama-Foundation/Minigrid

Simple Minimalistic Gridworld Environment for OpenAI Gym (Simple-MiniGrid)

Python 7 2 Updated Mar 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zhangwei hong williamd4112

Achievements

Achievements

Block or report williamd4112

Stars

iamxjy / BOAD-SWE-Agent

facebookresearch / flowmm

DeLLMa / DeLLMa

Daniella1 / urdf_files_dataset

aspuru-guzik-group / curiosity

Toni-SM / skrl

yyzpiero / RL4RedTeam

yunqing-me / AttackVLM

Pythagora-io / pythagora

tenable / awesome-llm-cybersecurity-tools

apple / ml-uwac

Danial-Alh / fast-bleu

rll-research / cic

clvrai / furniture

gncs / molgym

brandontrabucco / design-bench

Improbable-AI / harness-offline-rl

Improbable-AI / eipo

tanelp / tiny-diffusion

yudasong / HyQ

MLforHealth / rl_representations

wenhao-gao / mol_opt

clinicalml / gumbel-max-scm

senya-ashukha / real-nvp-pytorch

alphaSeclab / awesome-reverse-engineering

damat-le / gym-simplegrid

rafelps / gym-simple-minigrid

mklissa / DAVF

mklissa / phi_gcn

acyclics / MPO