[EMNLP 2025] Rearank: Reasoning Re-ranking Agent via Reinforcement Learning

Rearank is an innovative listwise reasoning reranking agent powered by a specialized large language model (LLM). It significantly enhances information retrieval by employing explicit reasoning to reorder search results. Built upon Qwen2.5-7B, Rearank achieves performance comparable to GPT-4 while remarkably requiring minimal annotated samples for training.

Key Features

Rearank stands out with several technical advancements:

Reinforcement Learning: Leverages reinforcement learning to dramatically improve its reasoning capabilities for reranking.
Superior Performance: Demonstrates significant improvements over existing baseline models in information retrieval tasks.
Explicit Reasoning: Specifically designed with explicit reasoning for each passages when reranking.

Installation

Getting started with Rearank is straightforward:

# Clone the repository
git clone https://github.com/yourusername/Rearank.git
cd Rearank

# Install dependencies
pip install -r requirements.txt

For BEIR and TREC-DL datasets, Pyserini will automatically download the required files. For the Bright dataset, you'll need to manually download and extract the files (including qrels, queries, and passages) using:

wget https://huggingface.co/datasets/le723z/bright_tar/resolve/main/bright.tar.gz?download=true && tar -xvf bright.tar.gz -C data/ && rm bright.tar.gz

Usage

The model is available at

Here's how you can use Rearank for your reranking tasks:

from rank_gpt import process_rank_results_in_batches, bm25_retrieve
from utils import get_hits_from_run_bright
from agent import get_agent
from trec_eval import eval_rerank
import os

# Initialize the Rearank agent
agent = get_agent(model_name="le723z/Rearank-7B", api_key=None)
enable_thinking = True # Set to True to enable explicit reasoning traces

# Example usage with different datasets
for data in ['dl19']: # You can iterate through multiple datasets like 'dl19', 'bright', etc.
    if data in BRIGHT: # Assuming BRIGHT is a predefined collection of datasets requiring specific handling
        bm25_results = get_hits_from_run_bright(os.getcwd(), data)
    else:
        bm25_results = bm25_retrieve(data, top_k_retrieve=100)

    # Evaluate original BM25 results
    original_metrics, _ = eval_rerank(data, bm25_results)
    print(f"Original BM25 metrics for {data}: {original_metrics}")

    # Rerank results using Rearank
    rerank_results = process_rank_results_in_batches(
        agent,
        bm25_results,
        batch_size=16,    # Number of queries to process in parallel
        window_size=20,   # Size of the reranking window
        step=10,          # Step size for the sliding window
        enable_thinking=enable_thinking
    )

    # Evaluate reranked results
    rerank_metrics, _ = eval_rerank(data, rerank_results)
    print(f"Rearank metrics for {data}: {rerank_metrics}")

Benchmarks

To reproduce our benchmark results or run evaluations on your own, use the run_evaluation.py script:

# Evaluate on TREC-DL 19, 20, and BEIR datasets
python run_evaluation.py --model_name le723z/Rearank-7B --skip_existing --standard --enable_thinking --log_name cotprompt

# Note:
# --standard evaluates on TREC-DL 19, 20, and BEIR datasets, which requires Pyserini.
# --bright evaluates on the BRIGHT dataset, which needs to be downloaded manually.

Training

To train Rearank, follow these steps:

cd verl/
bash examples/grpo_trainer/deeprerank.sh

Before starting, ensure all dependencies are installed as described in the VERL repository.

The core of Rearank's training is its custom reward function, implemented in listwiserank.py. This function is invoked during each rollout by the naive reward manager, located at batch.py.

Citation

If Rearank proves useful in your research, please consider citing our paper:

@misc{zhang2025rearankreasoningrerankingagent,
      title={REARANK: Reasoning Re-ranking Agent via Reinforcement Learning},
      author={Le Zhang and Bo Wang and Xipeng Qiu and Siva Reddy and Aishwarya Agrawal},
      year={2025},
      eprint={2505.20046},
      archivePrefix={arXiv},
      primaryClass={cs.IR},
      url={[https://arxiv.org/abs/2505.20046](https://arxiv.org/abs/2505.20046)},
}

Acknowledgement

The repo and work is built upon great work Pyserini and RankGPT

Contact

For any questions or feedback, feel free to reach out directly to [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
assets		assets
data		data
verl		verl
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
agent.py		agent.py
rank_gpt.py		rank_gpt.py
requirements.txt		requirements.txt
run_evaluation.py		run_evaluation.py
trec_eval.py		trec_eval.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

[EMNLP 2025] Rearank: Reasoning Re-ranking Agent via Reinforcement Learning

Key Features

Installation

Usage

Benchmarks

Training

Citation

Acknowledgement

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

lezhang7/Rearank

Folders and files

Latest commit

History

Repository files navigation

[EMNLP 2025] Rearank: Reasoning Re-ranking Agent via Reinforcement Learning

Key Features

Installation

Usage

Benchmarks

Training

Citation

Acknowledgement

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages