Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View zhouliang-yu's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Highlights

  • Pro

Organizations

@Sphere-AI-Lab

Block or report zhouliang-yu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zhouliang-yu/README.md

👋 Hi there! I'm Zhouliang (郁昼亮), an PhD student at the Scalable Principles for Learning and Reasoning Lab (SphereLab) of the Chinese University of Hong Kong, in the Computer Science & Engineering department, advised by Prof. Weiyang Liu, working on reinforcement learning for formal reasoning.

Previously, I spent a wonderful year at HKGAI, HKUST, as a PhD student advised by Prof. Yike Guo. Before that, I received my bachelor's degree from the Chinese University of Hong Kong, Shenzhen.

🎯 Research Focus

During the long-term future (maybe 2024-2027), I will be (almost) entirely focused on exploration-based reinforcement learning for formal mathematics reasoning via (agentic) large language models. Despite not being my major research focus, I am actively learning RL infrastructure to support Large Model Training.

🌍 Other broader interests include their applications in model-based embodied AI and scientific discovery via formal verification (e.g., Scientist AI, PhysLean, however, I have not yet published in this domain).

📧 Contact Me:My Email

Popular repositories Loading

  1. csc3002 csc3002 Public

    course code of csc3002

    C++ 1

  2. zhouliang-yu zhouliang-yu Public

    Config files for my GitHub profile.

  3. operatSystem operatSystem Public

    This repo contains my project code in csc3150

    C

  4. DC3 DC3 Public

    Forked from locuslab/DC3

    DC3: A Learning Method for Optimization with Hard Constraints

    Python

  5. MIPS-assembler MIPS-assembler Public

    C++

  6. Mips-Simulator Mips-Simulator Public

    C++