diff --git a/README.md b/README.md index 3102cd7d91..888a0637e1 100644 --- a/README.md +++ b/README.md @@ -555,7 +555,7 @@ Substitute the appropriate `$MODEL` from the table below. | Corpora | Size | Checksum | |:---------------------------------------------------------------------------------------------------------------------------------------|-------:|:-----------------------------------| -| [Post-Processed](https://huggingface.co/datasets/castorini/collections-bright/resolve/main/bright-corpus.tar) | 1.2 GB | `d8c829f0e4468a8ce62768b6a1162158` | +| [Post-Processed](https://huggingface.co/datasets/castorini/collections-bright/resolve/main/bright-corpus.tar) | 297 MB | `d8c829f0e4468a8ce62768b6a1162158` | The [BRIGHT](https://arxiv.org/abs/2407.12883) corpus used here was processed from Hugging Face with these [scripts](https://github.com/ielab/llm-rankers/tree/main/Rank-R1/bright).