pinghsieh

Ping-Chun Hsieh pinghsieh

20 followers · 0 following

National Yang Ming Chiao Tung University
Hsinchu, Taiwan

Achievements

Highlights

Stars

NYCU-RL-Bandits-Lab / Plan2Align

This is the official implementation for the paper "Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation"

Python 3 Updated May 22, 2025

EmbersArc / gym-rocketlander

A SpaceX Rocket Lander environment for OpenAI gym using Box2D

Python 304 43 Updated Jan 19, 2021

sinannasir / Power-Control-asilomar

Asilomar 2020 code for Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile Networks

Python 42 14 Updated Jul 27, 2020

avisingh599 / imitation-dagger

[Reimplementation Ross et al 2011] An implementation of DAGGER using ConvNets for driving from pixels.

Python 84 22 Updated Feb 22, 2018

EmilienDupont / neural-processes

Pytorch implementation of Neural Processes for functions and images 🎆

Jupyter Notebook 236 47 Updated Feb 8, 2022

ns3-plc-module / plc

An ns-3 module for simulations of power line communication networks

Python 24 21 Updated Nov 16, 2022

qfettes / DeepRL-Tutorials

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

Jupyter Notebook 1,082 328 Updated May 19, 2021

IBM / ushiriki-policy-engine-library

User facing library for accessing the Ushiriki Policy Engine webservice API

Python 6 9 Updated Sep 17, 2025

ShangtongZhang / DeepRL

Modularized Implementation of Deep RL Algorithms in PyTorch

Python 3,394 695 Updated Apr 16, 2024

tkn-tub / ns3-gym

ns3-gym - The Playground for Reinforcement Learning in Networking Research

C++ 663 216 Updated Jun 17, 2025

yrlu / reinforcement_learning

Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.

Python 152 48 Updated May 28, 2023

yukezhu / tensorflow-reinforce

Implementations of Reinforcement Learning Models in Tensorflow

Python 487 135 Updated Oct 31, 2017

MorvanZhou / Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Python 9,418 5,018 Updated Mar 31, 2024

ghliu / pytorch-ddpg

Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch

Python 626 159 Updated Aug 13, 2018

david-abel / simple_rl

A simple framework for experimenting with Reinforcement Learning in Python.

Python 327 108 Updated Feb 27, 2024

higgsfield / RL-Adventure

Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL

Jupyter Notebook 3,156 595 Updated Nov 4, 2021

Silvicek / distributional-dqn

Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regression' based on OpenAi DQN baselines.

Python 133 27 Updated May 5, 2019

iosband / psrl_2013

TeX 9 2 Updated Jan 29, 2015

josejimenezluna / pyGPGO

Bayesian optimization for Python

Python 246 61 Updated Mar 2, 2022

hill-a / stable-baselines

Forked from openai/baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Python 4,320 727 Updated Sep 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ping-Chun Hsieh pinghsieh

Achievements

Achievements

Highlights

Block or report pinghsieh

Stars

NYCU-RL-Bandits-Lab / Plan2Align

EmbersArc / gym-rocketlander

sinannasir / Power-Control-asilomar

avisingh599 / imitation-dagger

EmilienDupont / neural-processes

ns3-plc-module / plc

qfettes / DeepRL-Tutorials

IBM / ushiriki-policy-engine-library

ShangtongZhang / DeepRL

tkn-tub / ns3-gym

yrlu / reinforcement_learning

yukezhu / tensorflow-reinforce

MorvanZhou / Reinforcement-learning-with-tensorflow

ghliu / pytorch-ddpg

david-abel / simple_rl

higgsfield / RL-Adventure

Silvicek / distributional-dqn

iosband / psrl_2013

josejimenezluna / pyGPGO

hill-a / stable-baselines

sfujim / TD3

openai / baselines

pemami4911 / deep-rl

google-deepmind / neural-processes

HIPS / Spearmint

pmorenoz / HetMOGP

econtal / gp-optimization-matlab

ikostrikov / pytorch-a2c-ppo-acktr-gail

brendanator / atari-rl

google-deepmind / scalable_agent