Thanks to visit codestin.com
Credit goes to github.com

Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
18 changes: 10 additions & 8 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,13 +20,13 @@ Anserini is packaged in a self-contained fatjar, which also provides the simples
Assuming you've already got Java 21 installed (Yes, you need _exactly_ this version), fetch the fatjar:

```bash
wget https://repo1.maven.org/maven2/io/anserini/anserini/1.1.1/anserini-1.1.1-fatjar.jar
wget https://repo1.maven.org/maven2/io/anserini/anserini/1.2.0/anserini-1.2.0-fatjar.jar
```

Do a BM25 run on the venerable MS MARCO passage corpus using the dev queries:

```bash
java -cp anserini-1.1.1-fatjar.jar io.anserini.search.SearchCollection \
java -cp anserini-1.2.0-fatjar.jar io.anserini.search.SearchCollection \
-index msmarco-v1-passage \
-topics msmarco-v1-passage.dev \
-output run.msmarco-v1-passage.dev.bm25.txt \
Expand All @@ -36,7 +36,7 @@ java -cp anserini-1.1.1-fatjar.jar io.anserini.search.SearchCollection \
To evaluate:

```bash
java -cp anserini-1.1.1-fatjar.jar trec_eval -c -M 10 -m recip_rank msmarco-v1-passage.dev \
java -cp anserini-1.2.0-fatjar.jar trec_eval -c -M 10 -m recip_rank msmarco-v1-passage.dev \
run.msmarco-v1-passage.dev.bm25.txt
```

Expand All @@ -50,7 +50,7 @@ Anserini's got you covered.
For example, same as above (MS MARCO passage, dev queries) using the BGE model (en, v1.5):

```bash
java -cp anserini-1.1.1-fatjar.jar io.anserini.search.SearchHnswDenseVectors \
java -cp anserini-1.2.0-fatjar.jar io.anserini.search.SearchHnswDenseVectors \
-index msmarco-v1-passage.bge-base-en-v1.5.hnsw \
-topics msmarco-v1-passage.dev \
-encoder BgeBaseEn15 \
Expand All @@ -61,7 +61,7 @@ java -cp anserini-1.1.1-fatjar.jar io.anserini.search.SearchHnswDenseVectors \
To evaluate:

```bash
java -cp anserini-1.1.1-fatjar.jar trec_eval -c -M 10 -m recip_rank msmarco-v1-passage.dev \
java -cp anserini-1.2.0-fatjar.jar trec_eval -c -M 10 -m recip_rank msmarco-v1-passage.dev \
run.msmarco-v1-passage.dev.bge.txt
```

Expand All @@ -77,7 +77,7 @@ Anserini's also got you covered.
For example, same as above (MS MARCO passage, dev queries) using SPLADE++ EnsembleDistil:

```bash
java -cp anserini-1.1.1-fatjar.jar io.anserini.search.SearchCollection \
java -cp anserini-1.2.0-fatjar.jar io.anserini.search.SearchCollection \
-index msmarco-v1-passage.splade-pp-ed \
-topics msmarco-v1-passage.dev \
-encoder SpladePlusPlusEnsembleDistil \
Expand All @@ -88,22 +88,23 @@ java -cp anserini-1.1.1-fatjar.jar io.anserini.search.SearchCollection \
To evaluate:

```bash
java -cp anserini-1.1.1-fatjar.jar trec_eval -c -M 10 -m recip_rank msmarco-v1-passage.dev \
java -cp anserini-1.2.0-fatjar.jar trec_eval -c -M 10 -m recip_rank msmarco-v1-passage.dev \
run.msmarco-v1-passage.dev.splade-pp-ed.txt
```

You should get an MRR (`recip_rank`) of 0.3828.

</details>

See [detailed instructions](docs/fatjar-regressions/fatjar-regressions-v1.1.1.md) for the current fatjar release of Anserini (v1.1.1) to reproduce regression experiments on the MS MARCO V2.1 corpora for TREC 2024 RAG, on MS MARCO V1 Passage, and on BEIR, all directly from the fatjar!
See [detailed instructions](docs/fatjar-regressions/fatjar-regressions-v1.2.0.md) for the current fatjar release of Anserini (v1.2.0) to reproduce regression experiments on the MS MARCO V2.1 corpora for TREC 2024 RAG, on MS MARCO V1 Passage, and on BEIR, all directly from the fatjar!

❗ Beware, Anserini ships with many prebuilt indexes, which are automatically downloaded upon request: these indexes can take up a lot of space.
See [this guide on prebuilt indexes](docs/prebuilt-indexes.md) for more details.

<details>
<summary>Older instructions</summary>

+ [Anserini v1.1.1](docs/fatjar-regressions/fatjar-regressions-v1.1.1.md)
+ [Anserini v1.1.0](docs/fatjar-regressions/fatjar-regressions-v1.1.0.md)
+ [Anserini v1.0.0](docs/fatjar-regressions/fatjar-regressions-v1.0.0.md)
+ [Anserini v0.39.0](docs/fatjar-regressions/fatjar-regressions-v0.39.0.md)
Expand Down Expand Up @@ -673,6 +674,7 @@ Beyond that, there are always [open issues](https://github.com/castorini/anserin

## 📜️ Release History

+ v1.2.0: August 12, 2025 [[Release Notes](docs/release-notes/release-notes-v1.2.0.md)]
+ v1.1.1: July 1, 2025 [[Release Notes](docs/release-notes/release-notes-v1.1.1.md)]
+ v1.1.0: July 1, 2025 [[Release Notes](docs/release-notes/release-notes-v1.1.0.md)] [[Known Issues](docs/known-issues/known-issues-v1.1.0.md)]
+ v1.0.0: April 25, 2025 [[Release Notes](docs/release-notes/release-notes-v1.0.0.md)]
Expand Down
8 changes: 7 additions & 1 deletion docs/regressions-log.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,11 +3,17 @@
The following change log details commits to regression tests that alter effectiveness and the addition (or removal) of new regression tests.
This documentation is useful for figuring why results may have changed over time.

### July 9, 2025

+ commit [`35ab54`](https://github.com/castorini/anserini/commit/35ab54df4618f167c62d2252ef97b7a3510a4bf8)

Added SPLADE-v3 regressions for MS MARCO v2.1.

### July 5, 2025

+ commit [`c42dfa`](https://github.com/castorini/anserini/commit/c42dfa41ab600e32b1a39f09d4b4d087ef27954b)

Add RAG 24 official NIST qrels + associated regressions.
Added RAG 24 official NIST qrels + associated regressions.

### June 23, 2025

Expand Down
90 changes: 90 additions & 0 deletions docs/release-notes/release-notes-v1.2.0.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,90 @@
# Anserini Release Notes (v1.2.0)

+ **Release date:** August 12, 2025
+ **Lucene version:** Lucene 9.9.1

## Summary of Changes

+ Added SPLADE-v3 regressions for MS MARCO v2.1.
+ Added BRIGHT regressions.
+ Added TREC 2024 RAG official qrels and associated regressions.
+ Added TREC 2025 RAG topics.
+ Added `-dryRun` and `-printCommands` to `io.anserini.reproduce.RunRepro`.
+ Added auto-generated `prebuilt-indexes.md`.
+ Updated prebuilt index pointers to Hugging Face as appropriate.
+ Replaced `ThreadPoolExecutor` with `newWorkStealingPool`.

## Contributors (This Release)

Sorted by number of commits:

+ Lily Ge ([lilyjge](https://github.com/lilyjge))
+ Jimmy Lin ([lintool](https://github.com/lintool))
+ Daniel Guo ([clides](https://github.com/clides))
+ Adefioye ([Adefioye](https://github.com/Adefioye))
+ Edward Huang ([ed-ward-huang](https://github.com/ed-ward-huang))
+ hari495 ([hari495](https://github.com/hari495))
+ MindlessTruffle ([MindlessTruffle](https://github.com/MindlessTruffle))
+ Niruhan Viswarupan ([niruhan](https://github.com/niruhan))
+ OmarKhaled0K ([OmarKhaled0K](https://github.com/OmarKhaled0K))
+ Shivani Upadhyay ([UShivani3](https://github.com/UShivani3))
+ Steven Gu ([br0mabs](https://github.com/br0mabs))
+ Suraj Subrahmanyan ([suraj-subrahmanyan](https://github.com/suraj-subrahmanyan))

## All Contributors

All contributors with five or more commits, sorted by number of commits, [according to GitHub](https://github.com/castorini/Anserini/graphs/contributors):

+ Jimmy Lin ([lintool](https://github.com/lintool))
+ Peilin Yang ([Peilin-Yang](https://github.com/Peilin-Yang))
+ Lily Ge ([lilyjge](https://github.com/lilyjge))
+ Ogundepo Odunayo ([ToluClassics](https://github.com/ToluClassics))
+ Arthur Chen ([ArthurChen189](https://github.com/ArthurChen189))
+ Xueguang Ma ([MXueguang](https://github.com/MXueguang))
+ Ahmet Arslan ([iorixxx](https://github.com/iorixxx))
+ Tommaso Teofili ([tteofili](https://github.com/tteofili))
+ Edwin Zhang ([edwinzhng](https://github.com/edwinzhng))
+ Rodrigo Nogueira ([rodrigonogueira4](https://github.com/rodrigonogueira4))
+ Jheng-Hong Yang ([justram](https://github.com/justram))
+ Emily Wang ([emmileaf](https://github.com/emmileaf))
+ Royal Sequiera ([rosequ](https://github.com/rosequ))
+ Eric Zhang ([16BitNarwhal](https://github.com/16BitNarwhal))
+ Yuqi Liu ([yuki617](https://github.com/yuki617))
+ Victor Yang ([Victor0118](https://github.com/Victor0118))
+ Vincent ([vincent-4](https://github.com/vincent-4))
+ Chris Kamphuis ([Chriskamphuis](https://github.com/Chriskamphuis))
+ Boris Lin ([borislin](https://github.com/borislin))
+ Nikhil Gupta ([nikhilro](https://github.com/nikhilro))
+ Jasper Xian ([jasper-xian](https://github.com/jasper-xian))
+ Ronak Pradeep ([ronakice](https://github.com/ronakice))
+ Yuhao Xie ([Kytabyte](https://github.com/Kytabyte))
+ Shane Ding ([shaneding](https://github.com/shaneding))
+ Stephanie Hu ([stephaniewhoo](https://github.com/stephaniewhoo))
+ Daniel Guo ([clides](https://github.com/clides))
+ Kuang Lu ([lukuang](https://github.com/lukuang))
+ Xinyu (Crystina) Zhang ([crystina-z](https://github.com/crystina-z))
+ Mofe Adeyemi ([Mofetoluwa](https://github.com/Mofetoluwa))
+ Adam Yang ([adamyy](https://github.com/adamyy))
+ Joel Mackenzie ([JMMackenzie](https://github.com/JMMackenzie))
+ Salman Mohammed ([salman1993](https://github.com/salman1993))
+ Xinyu Mavis Liu ([x389liu](https://github.com/x389liu))
+ Luchen Tan ([LuchenTan](https://github.com/LuchenTan))
+ Shivani Upadhyay ([UShivani3](https://github.com/UShivani3))
+ Manveer Tamber ([manveertamber](https://github.com/manveertamber))
+ Johnson Han ([x65han](https://github.com/x65han))
+ Hang Cui ([HangCui0510](https://github.com/HangCui0510))
+ Akintunde Oladipo ([theyorubayesian](https://github.com/theyorubayesian))
+ Matt Yang ([matthewyryang](https://github.com/matthewyryang))
+ Kelvin Jiang ([kelvin-jiang](https://github.com/kelvin-jiang))
+ Zhiying Jiang ([bazingagin](https://github.com/bazingagin))
+ Brayden Zhong ([b8zhong](https://github.com/b8zhong))
+ Aileen Lin ([AileenLin](https://github.com/AileenLin))
+ Michael Tu ([tuzhucheng](https://github.com/tuzhucheng))
+ Dayang Shi ([dyshi](https://github.com/dyshi))
+ Yuqing Xie ([amyxie361](https://github.com/amyxie361))
+ Ryan Clancy ([ryan-clancy](https://github.com/ryan-clancy))
+ Vivek Alamuri ([valamuri2020](https://github.com/valamuri2020))
+ Peng Shi ([Impavidity](https://github.com/Impavidity))
+ Steven Chen ([wu-ming233](https://github.com/wu-ming233))
+ Nandan Thakur ([thakur-nandan](https://github.com/thakur-nandan))
+ Zeynep Akkalyoncu Yilmaz ([zeynepakkalyoncu](https://github.com/zeynepakkalyoncu))