Thanks to visit codestin.com
Credit goes to github.com

TakuyaHiraoka

Follow

🅰️

I am α and w, the learning rate and the weight parameters

Takuya Hiraoka TakuyaHiraoka

🅰️

I am α and w, the learning rate and the weight parameters

Follow

A machine for turning coffee into buggy code

48 followers · 19 following

Achievements

Achievements

Pinned Loading

Which-Experiences-Are-Influential-for-RL-Agents Which-Experiences-Are-Influential-for-RL-Agents Public

Source files to replicate experiments in my RLC 2025 paper.

Python 7
Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning Public

Source files to replicate experiments in my ICLR 2022 paper.

Python 71 4
Learning-Robust-Options-by-Conditional-Value-at-Risk-Optimization Learning-Robust-Options-by-Conditional-Value-at-Risk-Optimization Public

Source files to replicate experiments in my NeurIPS 2019 paper.

Python 10 1
Multi-Agent-Reinforcement-Learning-in-Stochastic-Games Multi-Agent-Reinforcement-Learning-in-Stochastic-Games Public

Unofficial PyBrain extension for multi-agent reinforcement learning in general sum stochastic games.

Python 69 25
Dialogue-State-Tracking-using-LSTM Dialogue-State-Tracking-using-LSTM Public

Source files to replicate experiments in my IWSDS 2016 paper.

Python 22 11
Mujoco-Wasm-Playground Mujoco-Wasm-Playground Public

My MuJoCo WASM playground for educational purposes

JavaScript