Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View TakuyaHiraoka's full-sized avatar
🅰️
I am α and w, the learning rate and the weight parameters
🅰️
I am α and w, the learning rate and the weight parameters

Block or report TakuyaHiraoka

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Which-Experiences-Are-Influential-for-RL-Agents Which-Experiences-Are-Influential-for-RL-Agents Public

    Source files to replicate experiments in my RLC 2025 paper.

    Python 7

  2. Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning Public

    Source files to replicate experiments in my ICLR 2022 paper.

    Python 71 4

  3. Learning-Robust-Options-by-Conditional-Value-at-Risk-Optimization Learning-Robust-Options-by-Conditional-Value-at-Risk-Optimization Public

    Source files to replicate experiments in my NeurIPS 2019 paper.

    Python 10 1

  4. Multi-Agent-Reinforcement-Learning-in-Stochastic-Games Multi-Agent-Reinforcement-Learning-in-Stochastic-Games Public

    Unofficial PyBrain extension for multi-agent reinforcement learning in general sum stochastic games.

    Python 69 25

  5. Dialogue-State-Tracking-using-LSTM Dialogue-State-Tracking-using-LSTM Public

    Source files to replicate experiments in my IWSDS 2016 paper.

    Python 22 11

  6. Mujoco-Wasm-Playground Mujoco-Wasm-Playground Public

    My MuJoCo WASM playground for educational purposes

    JavaScript