Thanks to visit codestin.com
Credit goes to github.com

Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
18 changes: 10 additions & 8 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,13 +20,13 @@ Anserini is packaged in a self-contained fatjar, which also provides the simples
Assuming you've already got Java 21 installed (Yes, you need _exactly_ this version), fetch the fatjar:

```bash
wget https://repo1.maven.org/maven2/io/anserini/anserini/1.2.0/anserini-1.2.0-fatjar.jar
wget https://repo1.maven.org/maven2/io/anserini/anserini/1.2.1/anserini-1.2.1-fatjar.jar
```

Do a BM25 run on the venerable MS MARCO passage corpus using the dev queries:

```bash
java -cp anserini-1.2.0-fatjar.jar io.anserini.search.SearchCollection \
java -cp anserini-1.2.1-fatjar.jar io.anserini.search.SearchCollection \
-index msmarco-v1-passage \
-topics msmarco-v1-passage.dev \
-output run.msmarco-v1-passage.dev.bm25.txt \
Expand All @@ -36,7 +36,7 @@ java -cp anserini-1.2.0-fatjar.jar io.anserini.search.SearchCollection \
To evaluate:

```bash
java -cp anserini-1.2.0-fatjar.jar trec_eval -c -M 10 -m recip_rank msmarco-v1-passage.dev \
java -cp anserini-1.2.1-fatjar.jar trec_eval -c -M 10 -m recip_rank msmarco-v1-passage.dev \
run.msmarco-v1-passage.dev.bm25.txt
```

Expand All @@ -50,7 +50,7 @@ Anserini's got you covered.
For example, same as above (MS MARCO passage, dev queries) using the BGE model (en, v1.5):

```bash
java -cp anserini-1.2.0-fatjar.jar io.anserini.search.SearchHnswDenseVectors \
java -cp anserini-1.2.1-fatjar.jar io.anserini.search.SearchHnswDenseVectors \
-index msmarco-v1-passage.bge-base-en-v1.5.hnsw \
-topics msmarco-v1-passage.dev \
-encoder BgeBaseEn15 \
Expand All @@ -61,7 +61,7 @@ java -cp anserini-1.2.0-fatjar.jar io.anserini.search.SearchHnswDenseVectors \
To evaluate:

```bash
java -cp anserini-1.2.0-fatjar.jar trec_eval -c -M 10 -m recip_rank msmarco-v1-passage.dev \
java -cp anserini-1.2.1-fatjar.jar trec_eval -c -M 10 -m recip_rank msmarco-v1-passage.dev \
run.msmarco-v1-passage.dev.bge.txt
```

Expand All @@ -77,7 +77,7 @@ Anserini's also got you covered.
For example, same as above (MS MARCO passage, dev queries) using SPLADE-v3:

```bash
java -cp anserini-1.2.0-fatjar.jar io.anserini.search.SearchCollection \
java -cp anserini-1.2.1-fatjar.jar io.anserini.search.SearchCollection \
-index msmarco-v1-passage.splade-v3 \
-topics msmarco-v1-passage.dev \
-encoder SpladeV3 \
Expand All @@ -88,22 +88,23 @@ java -cp anserini-1.2.0-fatjar.jar io.anserini.search.SearchCollection \
To evaluate:

```bash
java -cp anserini-1.2.0-fatjar.jar trec_eval -c -M 10 -m recip_rank msmarco-v1-passage.dev \
java -cp anserini-1.2.1-fatjar.jar trec_eval -c -M 10 -m recip_rank msmarco-v1-passage.dev \
run.msmarco-v1-passage.dev.splade-v3.txt
```

You should get an MRR (`recip_rank`) of 0.4000.

</details>

See [detailed instructions](docs/fatjar-regressions/fatjar-regressions-v1.2.0.md) for the current fatjar release of Anserini (v1.2.0) to reproduce regression experiments on the MS MARCO V2.1 corpora for TREC 2024 RAG, on MS MARCO V1 Passage, and on BEIR, all directly from the fatjar!
See [detailed instructions](docs/fatjar-regressions/fatjar-regressions-v1.2.1.md) for the current fatjar release of Anserini (v1.2.1) to reproduce regression experiments on the MS MARCO V2.1 corpora for TREC 2024 RAG, on MS MARCO V1 Passage, and on BEIR, all directly from the fatjar!

❗ Beware, Anserini ships with many prebuilt indexes, which are automatically downloaded upon request: these indexes can take up a lot of space.
See [this guide on prebuilt indexes](docs/prebuilt-indexes.md) for more details.

<details>
<summary>Older instructions</summary>

+ [Anserini v1.2.0](docs/fatjar-regressions/fatjar-regressions-v1.2.0.md)
+ [Anserini v1.1.1](docs/fatjar-regressions/fatjar-regressions-v1.1.1.md)
+ [Anserini v1.1.0](docs/fatjar-regressions/fatjar-regressions-v1.1.0.md)
+ [Anserini v1.0.0](docs/fatjar-regressions/fatjar-regressions-v1.0.0.md)
Expand Down Expand Up @@ -679,6 +680,7 @@ Beyond that, there are always [open issues](https://github.com/castorini/anserin

## 📜️ Release History

+ v1.2.1: August 20, 2025 [[Release Notes](docs/release-notes/release-notes-v1.2.1.md)]
+ v1.2.0: August 12, 2025 [[Release Notes](docs/release-notes/release-notes-v1.2.0.md)]
+ v1.1.1: July 1, 2025 [[Release Notes](docs/release-notes/release-notes-v1.1.1.md)]
+ v1.1.0: July 1, 2025 [[Release Notes](docs/release-notes/release-notes-v1.1.0.md)] [[Known Issues](docs/known-issues/known-issues-v1.1.0.md)]
Expand Down
Original file line number Diff line number Diff line change
@@ -1,15 +1,15 @@
# Anserini Fatjar Regresions (v1.3.0)
# Anserini Fatjar Regresions (v1.2.1)

Fetch the fatjar:

```bash
wget https://repo1.maven.org/maven2/io/anserini/anserini/1.3.0/anserini-1.3.0-fatjar.jar
wget https://repo1.maven.org/maven2/io/anserini/anserini/1.2.1/anserini-1.2.1-fatjar.jar
```

Let's start out by setting the `ANSERINI_JAR` and the `OUTPUT_DIR`:

```bash
export ANSERINI_JAR="anserini-1.3.0-fatjar.jar"
export ANSERINI_JAR="anserini-1.2.1-fatjar.jar"
export OUTPUT_DIR="."
```

Expand Down Expand Up @@ -362,7 +362,7 @@ To print out the commands that will generate the above runs without performing t

## BRIGHT

❗ Beware, running these experiments will automatically download 12 indexes totaling 1.1 GB.
❗ Beware, running these experiments will automatically download 24 indexes totaling 1.7 GB.

BRIGHT is a retrieval benchmark described [here](https://arxiv.org/abs/2407.12883).
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we want the SPLADE v3 cached numbers as well?

The following table reports nDCG@10 scores.
Expand Down
80 changes: 80 additions & 0 deletions docs/release-notes/release-notes-v1.2.1.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,80 @@
# Anserini Release Notes (v1.2.1)

+ **Release date:** August 20, 2025
+ **Lucene version:** Lucene 9.9.1

## Summary of Changes

+ Added SPLADE-v3 regressions for BRIGHT (ONNX and cached queries).
+ Added `-computeIndexSize` option to `RunRepro`.
+ Added functionality to compute index size in `IndexReaderUtils`.
+ Fixed bug with `newWorkStealingPool` in `SearchCollection`.

## Contributors (This Release)

Sorted by number of commits:

+ Jimmy Lin ([lintool](https://github.com/lintool))
+ Lily Ge ([lilyjge](https://github.com/lilyjge))
+ JJGreen0 ([JJGreen0](https://github.com/JJGreen0))
+ Suraj Subrahmanyan ([suraj-subrahmanyan](https://github.com/suraj-subrahmanyan))
+ Daniel Guo ([clides](https://github.com/clides))

## All Contributors

All contributors with five or more commits, sorted by number of commits, [according to GitHub](https://github.com/castorini/Anserini/graphs/contributors):

+ Jimmy Lin ([lintool](https://github.com/lintool))
+ Peilin Yang ([Peilin-Yang](https://github.com/Peilin-Yang))
+ Lily Ge ([lilyjge](https://github.com/lilyjge))
+ Ogundepo Odunayo ([ToluClassics](https://github.com/ToluClassics))
+ Arthur Chen ([ArthurChen189](https://github.com/ArthurChen189))
+ Xueguang Ma ([MXueguang](https://github.com/MXueguang))
+ Ahmet Arslan ([iorixxx](https://github.com/iorixxx))
+ Tommaso Teofili ([tteofili](https://github.com/tteofili))
+ Edwin Zhang ([edwinzhng](https://github.com/edwinzhng))
+ Rodrigo Nogueira ([rodrigonogueira4](https://github.com/rodrigonogueira4))
+ Jheng-Hong Yang ([justram](https://github.com/justram))
+ Emily Wang ([emmileaf](https://github.com/emmileaf))
+ Royal Sequiera ([rosequ](https://github.com/rosequ))
+ Yuqi Liu ([yuki617](https://github.com/yuki617))
+ Eric Zhang ([16BitNarwhal](https://github.com/16BitNarwhal))
+ Chris Kamphuis ([Chriskamphuis](https://github.com/Chriskamphuis))
+ Victor Yang ([Victor0118](https://github.com/Victor0118))
+ Vincent ([vincent-4](https://github.com/vincent-4))
+ Boris Lin ([borislin](https://github.com/borislin))
+ Nikhil Gupta ([nikhilro](https://github.com/nikhilro))
+ Jasper Xian ([jasper-xian](https://github.com/jasper-xian))
+ Ronak Pradeep ([ronakice](https://github.com/ronakice))
+ Yuhao Xie ([Kytabyte](https://github.com/Kytabyte))
+ Daniel Guo ([clides](https://github.com/clides))
+ Shane Ding ([shaneding](https://github.com/shaneding))
+ Stephanie Hu ([stephaniewhoo](https://github.com/stephaniewhoo))
+ Kuang Lu ([lukuang](https://github.com/lukuang))
+ Xinyu (Crystina) Zhang ([crystina-z](https://github.com/crystina-z))
+ Joel Mackenzie ([JMMackenzie](https://github.com/JMMackenzie))
+ Mofe Adeyemi ([Mofetoluwa](https://github.com/Mofetoluwa))
+ Adam Yang ([adamyy](https://github.com/adamyy))
+ Salman Mohammed ([salman1993](https://github.com/salman1993))
+ Xinyu Mavis Liu ([x389liu](https://github.com/x389liu))
+ Manveer Tamber ([manveertamber](https://github.com/manveertamber))
+ Luchen Tan ([LuchenTan](https://github.com/LuchenTan))
+ Shivani Upadhyay ([UShivani3](https://github.com/UShivani3))
+ Akintunde Oladipo ([theyorubayesian](https://github.com/theyorubayesian))
+ Hang Cui ([HangCui0510](https://github.com/HangCui0510))
+ Johnson Han ([x65han](https://github.com/x65han))
+ Zhiying Jiang ([bazingagin](https://github.com/bazingagin))
+ Matt Yang ([matthewyryang](https://github.com/matthewyryang))
+ Kelvin Jiang ([kelvin-jiang](https://github.com/kelvin-jiang))
+ Dayang Shi ([dyshi](https://github.com/dyshi))
+ Brayden Zhong ([b8zhong](https://github.com/b8zhong))
+ Aileen Lin ([AileenLin](https://github.com/AileenLin))
+ Michael Tu ([tuzhucheng](https://github.com/tuzhucheng))
+ JJGreen0 ([JJGreen0](https://github.com/JJGreen0))
+ Zeynep Akkalyoncu Yilmaz ([zeynepakkalyoncu](https://github.com/zeynepakkalyoncu))
+ Peng Shi ([Impavidity](https://github.com/Impavidity))
+ Yuqing Xie ([amyxie361](https://github.com/amyxie361))
+ Vivek Alamuri ([valamuri2020](https://github.com/valamuri2020))
+ Nandan Thakur ([thakur-nandan](https://github.com/thakur-nandan))
+ Steven Chen ([wu-ming233](https://github.com/wu-ming233))
+ Ryan Clancy ([ryan-clancy](https://github.com/ryan-clancy))