Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View pinghsieh's full-sized avatar
  • National Yang Ming Chiao Tung University
  • Hsinchu, Taiwan

Highlights

  • Pro

Block or report pinghsieh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the official implementation for the paper "Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation"

Python 3 Updated May 22, 2025

A SpaceX Rocket Lander environment for OpenAI gym using Box2D

Python 304 43 Updated Jan 19, 2021

Asilomar 2020 code for Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile Networks

Python 42 14 Updated Jul 27, 2020

[Reimplementation Ross et al 2011] An implementation of DAGGER using ConvNets for driving from pixels.

Python 84 22 Updated Feb 22, 2018

Pytorch implementation of Neural Processes for functions and images 🎆

Jupyter Notebook 236 47 Updated Feb 8, 2022

An ns-3 module for simulations of power line communication networks

Python 24 21 Updated Nov 16, 2022

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

Jupyter Notebook 1,082 328 Updated May 19, 2021

User facing library for accessing the Ushiriki Policy Engine webservice API

Python 6 9 Updated Sep 17, 2025

Modularized Implementation of Deep RL Algorithms in PyTorch

Python 3,394 695 Updated Apr 16, 2024

ns3-gym - The Playground for Reinforcement Learning in Networking Research

C++ 663 216 Updated Jun 17, 2025

Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.

Python 152 48 Updated May 28, 2023

Implementations of Reinforcement Learning Models in Tensorflow

Python 487 135 Updated Oct 31, 2017

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Python 9,418 5,018 Updated Mar 31, 2024

Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch

Python 626 159 Updated Aug 13, 2018

A simple framework for experimenting with Reinforcement Learning in Python.

Python 327 108 Updated Feb 27, 2024

Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL

Jupyter Notebook 3,156 595 Updated Nov 4, 2021

Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regression' based on OpenAi DQN baselines.

Python 133 27 Updated May 5, 2019
TeX 9 2 Updated Jan 29, 2015

Bayesian optimization for Python

Python 246 61 Updated Mar 2, 2022

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Python 4,320 727 Updated Sep 4, 2022

Author's PyTorch implementation of TD3 for OpenAI gym tasks

Python 2,019 477 Updated Jul 14, 2023

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,623 4,953 Updated Aug 1, 2024

Collection of Deep Reinforcement Learning algorithms

Python 300 191 Updated Mar 19, 2019

This repository contains notebook implementations of the following Neural Process variants: Conditional Neural Processes (CNPs), Neural Processes (NPs), Attentive Neural Processes (ANPs).

Jupyter Notebook 1,011 154 Updated Jan 19, 2021

Spearmint Bayesian optimization codebase

Python 1,562 329 Updated Dec 27, 2019

Heterogeneous Multi-output Gaussian Processes

Jupyter Notebook 54 17 Updated May 4, 2020

MATLAB implementation of my Bayesian Optimization algorithms

MATLAB 12 1 Updated Mar 17, 2018

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,873 843 Updated May 29, 2022

Atari - Deep Reinforcement Learning algorithms in TensorFlow

Python 139 35 Updated Mar 27, 2024

A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.

Python 1,017 162 Updated Mar 13, 2019
Next