I am α and w, the learning rate and the weight parameters
A machine for turning coffee into buggy code
Pinned Loading
-
Which-Experiences-Are-Influential-for-RL-Agents
Which-Experiences-Are-Influential-for-RL-Agents PublicSource files to replicate experiments in my RLC 2025 paper.
Python 7
-
Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning
Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning PublicSource files to replicate experiments in my ICLR 2022 paper.
-
Learning-Robust-Options-by-Conditional-Value-at-Risk-Optimization
Learning-Robust-Options-by-Conditional-Value-at-Risk-Optimization PublicSource files to replicate experiments in my NeurIPS 2019 paper.
-
Multi-Agent-Reinforcement-Learning-in-Stochastic-Games
Multi-Agent-Reinforcement-Learning-in-Stochastic-Games PublicUnofficial PyBrain extension for multi-agent reinforcement learning in general sum stochastic games.
-
Dialogue-State-Tracking-using-LSTM
Dialogue-State-Tracking-using-LSTM PublicSource files to replicate experiments in my IWSDS 2016 paper.
-
Mujoco-Wasm-Playground
Mujoco-Wasm-Playground PublicMy MuJoCo WASM playground for educational purposes
JavaScript
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.