Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Why doesn't io.anserini.reproduce.RunMsMarco show download size? #3030

@lintool

Description

@lintool

Why is the download size "-"?

$ bin/run.sh io.anserini.reproduce.RunMsMarco -computeIndexSize -dryRun -printCommands -collection msmarco-v1-passage
WARNING: Using incubator modules: jdk.incubator.vector
Indexes referenced by this run (10 total):
name                                                    size on disk  download size  path                                                                                                                                               
------------------------------------------------------  ------------  ------------  ---------------------------------------------------------------------------------------------------------------------------------------------------
msmarco-v1-passage                                            2.5 GB        2.0 GB  /u4/jimmylin/.cache/pyserini/indexes/lucene-inverted.msmarco-v1-passage.20221004.252b5e.678876e8c99a89933d553609a0fd8793                           
msmarco-v1-passage.d2q-t5                                   970.8 MB      770.4 MB  /u4/jimmylin/.cache/pyserini/indexes/lucene-inverted.msmarco-v1-passage.d2q-t5.20221004.252b5e.cfd6acef0912647603457b1e98ca5bc0                    
msmarco-v1-passage.splade-pp-ed                               2.2 GB             -  /u4/jimmylin/.cache/pyserini/indexes/lucene-inverted.msmarco-v1-passage.splade-pp-ed.20230524.a59610.2c008fc36131e27966a72292932358e6              
msmarco-v1-passage.splade-v3                                  3.1 GB             -  /u4/jimmylin/.cache/pyserini/indexes/lucene-inverted.msmarco-v1-passage.splade-v3.20250329.4f4c68.52f4b59d236547f570555715ed314311                 
msmarco-v1-passage.cosdpr-distil.hnsw                        26.2 GB             -  /u4/jimmylin/.cache/pyserini/indexes/lucene-hnsw.msmarco-v1-passage.cosdpr-distil.20240108.825148.df4c60fa1f3804fa409499824d12d035                 
msmarco-v1-passage.cosdpr-distil.hnsw-int8                   32.6 GB             -  /u4/jimmylin/.cache/pyserini/indexes/lucene-hnsw-int8.msmarco-v1-passage.cosdpr-distil.20240108.825148.119124ad358bb81e6a203b04d1b99a9c            
msmarco-v1-passage.bge-base-en-v1.5.hnsw                     26.2 GB             -  /u4/jimmylin/.cache/pyserini/indexes/lucene-hnsw.msmarco-v1-passage.bge-base-en-v1.5.20240117.53514b.00a577f689d90f95e6c5611438b0af3d              
msmarco-v1-passage.bge-base-en-v1.5.hnsw-int8                32.6 GB             -  /u4/jimmylin/.cache/pyserini/indexes/lucene-hnsw-int8.msmarco-v1-passage.bge-base-en-v1.5.20240117.53514b.7830712459cf124c96fd058bb0a405b7         
msmarco-v1-passage.cohere-embed-english-v3.0.hnsw            34.6 GB             -  /u4/jimmylin/.cache/pyserini/indexes/lucene-hnsw.msmarco-v1-passage.cohere-embed-english-v3.0.20240228.eacd13.c7294ca988ae1b812d427362ffca1ee2     
msmarco-v1-passage.cohere-embed-english-v3.0.hnsw-int8       43.1 GB             -  /u4/jimmylin/.cache/pyserini/indexes/lucene-hnsw-int8.msmarco-v1-passage.cohere-embed-english-v3.0.20240228.eacd13.dbaca578cc8495f504cdd0a7187f4c36
total                                                       204.0 GB        2.8 GB  -                                                                                                                                                  

Total size across 10 of 10 indexes: 204.0 GB

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions