kite99520

Mingqi Gao kite99520

Master student at Peking University; B.S. in Computer Science at Peking University

21 followers · 13 following

Peking University
Beijing

Achievements

Stars

chinasaokolo / csGraduateFellowships

Forked from chinasatokolo/csGraduateFellowships

A curated list of fellowships for graduate students in Computer Science and related fields.

80 4 Updated Aug 11, 2025

lihualei71 / transferUQ

An R package to quantify uncertainty for transfer errors

R 3 Updated May 20, 2023

aangelopoulos / ppi_py

A package for statistically rigorous scientific discovery using machine learning. Implements prediction-powered inference.

Python 269 31 Updated Sep 5, 2025

valeman / awesome-conformal-prediction

A professionally curated list of awesome Conformal Prediction videos, tutorials, books, papers, PhD and MSc theses, articles and open-source libraries.

1,101 92 Updated Dec 14, 2025

aangelopoulos / conformal-prediction

Lightweight, useful implementation of conformal prediction on real data.

Jupyter Notebook 995 114 Updated Nov 14, 2025

chinasatokolo / csGraduateFellowships

A curated list of fellowships for graduate students in Computer Science and related fields.

794 73 Updated Oct 22, 2025

kite99520 / NLGCorrEval

Python 1 Updated Jun 1, 2025

PKU-ONELab / LLM-evaluator-reliability

The official repository for our ACL 2024 paper: Are LLM-based Evaluators Confusing NLG Quality Criteria?

Python 8 1 Updated Feb 23, 2025

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 11,852 1,732 Updated Dec 19, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,847 12,099 Updated Dec 21, 2025

lmarena / arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Python 972 137 Updated Jun 21, 2025

tatsu-lab / alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Python 839 63 Updated Jul 1, 2024

PKU-ONELab / Themis

The official repository for our EMNLP 2024 paper, Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretability.

Python 21 1 Updated Feb 23, 2025

statsmodels / statsmodels

Statsmodels: statistical modeling and econometrics in Python

Python 11,155 3,309 Updated Dec 17, 2025

google-research / mt-metrics-eval

Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.

Python 123 26 Updated Oct 13, 2025

pln-fing-udelar / fast-krippendorff

Fast computation of Krippendorff's alpha agreement measure in Python.

Python 153 17 Updated Dec 1, 2025

ur-whitelab / chemcrow-public

Chemcrow

Python 856 134 Updated Dec 19, 2024

xlang-ai / OpenAgents

[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

Python 4,648 512 Updated Nov 18, 2024

sympy / sympy

A computer algebra system written in pure Python

Python 14,201 4,950 Updated Dec 20, 2025

Yale-LILY / ROSE

Python 39 2 Updated Jun 7, 2023

google-research-datasets / seahorse

Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 quality dimensions: comprehensibility, repetition, grammar, a…

89 13 Updated Feb 27, 2024

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,096 2,671 Updated Nov 3, 2025

shmsw25 / FActScore

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

Python 411 60 Updated Apr 13, 2025

yuh-zha / AlignScore

ACL2023 - AlignScore, a metric for factual consistency evaluation.

Python 147 27 Updated Mar 11, 2024

kite99520 / DialSummFactCorr

Resources for paper "Reference Matters: Benchmarking Factual Error Correction for Dialogue Summarization with Fine-grained Evaluation Framework"

Macaulay2 3 Updated Dec 4, 2023

tatsu-lab / alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,925 295 Updated Aug 9, 2025

Hannibal046 / Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

25,837 2,222 Updated Jul 31, 2025

kite99520 / Fact_CLS

2 Updated May 18, 2023

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 180,406 46,188 Updated Dec 20, 2025

ranaroussi / yfinance

Download market data from Yahoo! Finance's API

Python 20,227 2,945 Updated Dec 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mingqi Gao kite99520

Achievements

Achievements

Block or report kite99520

Stars

chinasaokolo / csGraduateFellowships

lihualei71 / transferUQ

aangelopoulos / ppi_py

valeman / awesome-conformal-prediction

aangelopoulos / conformal-prediction

chinasatokolo / csGraduateFellowships

kite99520 / NLGCorrEval

PKU-ONELab / LLM-evaluator-reliability

SakanaAI / AI-Scientist

vllm-project / vllm

lmarena / arena-hard-auto

tatsu-lab / alpaca_farm

PKU-ONELab / Themis

statsmodels / statsmodels

google-research / mt-metrics-eval

pln-fing-udelar / fast-krippendorff

ur-whitelab / chemcrow-public

xlang-ai / OpenAgents

sympy / sympy

Yale-LILY / ROSE

google-research-datasets / seahorse

meta-llama / llama-cookbook

shmsw25 / FActScore

yuh-zha / AlignScore

kite99520 / DialSummFactCorr

tatsu-lab / alpaca_eval

Hannibal046 / Awesome-LLM

kite99520 / Fact_CLS

Significant-Gravitas / AutoGPT

ranaroussi / yfinance