Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View yingjiahao14's full-sized avatar
๐Ÿ’ญ
I may be slow to respond.
๐Ÿ’ญ
I may be slow to respond.

Block or report yingjiahao14

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
yingjiahao14/README.md

Jiahao Ying

Ph.D. Candidate @ Singapore Management University

Researching robust, fair, effective, and generalizable methods for Automated Evaluation and Improvement of LLMs.

Profile Views


๐Ÿ‘‹ About Me

  • Third-year Ph.D. candidate at Singapore Management University, advised by Yixin Cao & Qianru Sun.
  • I focus on LLM Evaluation and LLM Improvement .

๐Ÿ”ฌ Research Interests

  • LLM Evaluation

    • Automated Evaluation Data Generation
    • Generalizable Evaluation
    • Reliable Evaluator Development
  • LLM Improvement

    • Adaptive Learning Strategies

๐Ÿ”— Useful Links


๐Ÿ“ซ Contact

If youโ€™re interested in LLM evaluation/improvement or potential collaborations, feel free to reach out.


Last updated: Nov 11, 2025.

Pinned Loading

  1. Automating-DatasetUpdates Automating-DatasetUpdates Public

    github for the paper "Have Seen Me Before? Automating Dataset Updates Towards Reliable and Timely Evaluation"

    HTML 10

  2. KRE KRE Public

    dataset for the paper

    Python 7

  3. Dual-Eval Dual-Eval Public

    Repository for the paper "Disentangling Language Medium and Cultural Context for Evaluating Multilingual Large Language Models"

    Python 2

  4. ALEX-nlp/MUI-Eval ALEX-nlp/MUI-Eval Public

    Repository for the paper: Revisiting LLM Evaluation through Mechanism Interpretability: a New Metric and Model Utility Law

    Python 12 2