Framework for Newborn Article Impact Prediction & Quality Estimation.

📊 NAIP-v1-weights | 📈 NAIP-v2-weights | 🤗 Hugging Face Demo
📑 v1 Homepage | 📑 v2 Homepage

Overview

The NAIP series leverages large language models (LLMs) to efficiently assess the potential impact and quality of research articles through analysis of their intrinsic content. NAIP-v1 focuses on regressing a field- and time-normalized score (TNCSIsp) as a quantitative indicator of scientific impact, while NAIP-v2 aims to model human preferences in the peer-review process by learning from pairwise review data.

Version	Input	Output	Model Weights	Homepage	Paper
v1	Title & Abstract	Impact Estimation (0–1)	Link	Link	AAAI 2025
v2	Title & Abstract	Quality Estimation	Link	Link	arXiv

🚀 Update Log

250930 – Introducing NAIPv2: extending the series with an emphasis on quality estimation.
241210 - The paper has now been accepted by AAAI 2025!
241204 - Huggingface Spaces Support🥰
- We've set up an online demo on Hugging Face Spaces—now you can easily give it a try without writing a single line of code!
241126 - V1.0 We’re thrilled to announce the end of Early Access and the official release of V1.0! ✨
- The codebase is now more organized and easier to navigate! 🧹
- Updated and streamlined README with detailed instructions for setup and usage. 💡
- Decoupling the dataset, more LoRa adapters weight download links, and more! 🔄
- Known Issues: The functionality for building the NAID dataset has not been tested on other machines, which may lead to potential issues. We plan to replace this function with a more powerful framefowk in our another codebase.
240808 - Eerly Access
- We have released the Early Access version of our code！

Quick Deployment (for most researchers)

First, pull the repo and type following commands in the console:

git clone https://github.com/ssocean/NAIP.git
cd NAIP
pip install -r requirements.txt

To try v1, please use demo_v1.py.
To try v2, please use demo_v2.py.
You may need to download the corresponding model weights.
When providing the title and abstract, please avoid line breaks, LaTeX symbols, or other special formatting.

Note
The inference logic of v1 and v2 is identical. They are separated only to demonstrate two different loading methods:

demo_v1.py shows how to load the full model weights.

demo_v2.py shows how to load the LoRA adapter on top of the base model.

How to Reproduce

NAIPv1

Prepare train.sh bash file like below to fine-tune NAIPv1:

DATA_PATH="NAIP/v1_resource/NAIDv1/NAID_train_extrainfo.csv"
TEST_DATA_PATH="NAIP/v1_resource/NAIDv1/NAID_test_extrainfo.csv"

OMP_NUM_THREADS=1 accelerate launch NAIP/v1_resource/v1_finetune.py \
    --total_epochs 5 \
    --learning_rate 1e-4 \
    --data_path $DATA_PATH \
    --test_data_path $TEST_DATA_PATH \
    --runs_dir official_runs/LLAMA3 \
    --checkpoint  path_to_huggingface_LLaMA3

Similar to fine-tuning, prepare test.sh as below:

python NAIP/v1_resource/v1_test.py \
 --data_path NAIP/NAID/NAID_test_extrainfo.csv \
 --weight_dir path_to_runs_dir

Then, type sh test.sh.

NAIPv2

Check NAIP/v2_resource/shell/fine-tune.sh and modify it according to your setup.

🛠️ Free Support for Academic Use

To ensure that research comparisons with NAIP are carried out under consistent and reproducible conditions, we provide free technical assistance for researchers who may encounter challenges in environment setup or code reproduction.

You may send a .jsonl file containing the "title" and "abstract" fields, and we will return the corresponding prediction results.

The jsonl file template is provided in ./assets/free_inference_template.jsonl

In urgent cases, results can usually be provided within one day.
This support is intended solely to facilitate rigorous and reproducible evaluation within the research community and is not available for commercial use or requests.
📩 Contact: [[email protected]]

📚 Citation

If you find this work useful, please cite:

@inproceedings{zhao2025NAIPv1,
  title={From Words to Worth: Newborn Article Impact Prediction with LLM},
  author={Zhao, Penghai and Xing, Qinghua and Dou, Kairan and Tian, Jinyu and Tai, Ying and Yang, Jian and Cheng, Ming-Ming and Li, Xiang},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={39},
  number={1},
  pages={1183--1191},
  year={2025}
}

@article{zhao2025NAIPv2,
  title={NAIPv2: Debiased Pairwise Learning for Efficient Paper Quality Estimation},
  author={Zhao, Penghai and Tian, Jinyu and Xing, Qinghua and Zhang, Xin and Li, Zheng and Qian, Jianjun and Cheng, Ming-Ming and Li, Xiang},
  journal={arXiv preprint arXiv:2509.25179},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.idea		.idea
assets		assets
tools		tools
v1_resource		v1_resource
v2_resource		v2_resource
.gitignore		.gitignore
README.md		README.md
demo_v1.py		demo_v1.py
demo_v2.py		demo_v2.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Framework for Newborn Article Impact Prediction & Quality Estimation.

Overview

🚀 Update Log

Quick Deployment (for most researchers)

How to Reproduce

NAIPv1

NAIPv2

🛠️ Free Support for Academic Use

📚 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

ssocean/NAIP

Folders and files

Latest commit

History

Repository files navigation

Framework for Newborn Article Impact Prediction & Quality Estimation.

Overview

🚀 Update Log

Quick Deployment (for most researchers)

How to Reproduce

NAIPv1

NAIPv2

🛠️ Free Support for Academic Use

📚 Citation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages