Highlights
- Pro
Stars
Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Implementation of benchmark RL algorithms
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".
电网技术论文《考虑实时市场联动的电力零售商鲁棒定价策略》(DWJS21-2157)的数据和开源代码文件