This repository contains the R source code for reproducing the results in the article, 'A Probabilistic Model of Uncertainty in Rank-Biased Overlap for Tied Data' (Lukáš Chládek, 2025).
The article is available at: https://resolver.tudelft.nl/uuid:bf1898c1-3c4c-4625-b953-5a6a398c5ad2
Please use the following BibTeX citation:
@mastersthesis{Chladek2025,
title = {A Probabilistic Account of the Uncertainty Due to Ties in Rank-Biased Overlap},
author = {Lukáš Chládek}, year = 2025, month = {June}, address = {Delft, The Netherlands},
note = {Available at \url{https://resolver.tudelft.nl/uuid:bf1898c1-3c4c-4625-b953-5a6a398c5ad2}},
school = {Delft University of Technology},
type = {BSc thesis}
} R is required.
# Libraries required by Corsi and Urbano (2024)
install.packages("dplyr")
install.packages("extraDistr")
install.packages("future.apply")
install.packages("ggplot2")
install.packages("glue")
install.packages("latex2exp")
install.packages("mvtnorm")
install.packages("rio")
# Additional libraries required for this evaluation
install.packages("combinat")
install.packages("purrr")
install.packages("argparse")
install.packages("rlang")
install.packages("progressr")
install.packages("beepr")This requires R and Gourd.
Please note that R's random number generator may give different results depending on the software version. The results in the paper use R version 4.3.0.
-
Install R and Gourd
If these are not yet installed, use the instructions at https://www.r-project.org/ and https://github.com/ConSol-Lab/gourd.
-
Install R dependencies
Run the installation commands from "Using the R libraries" above.
-
Clone this repository
git clone https://github.com/lchladek/RBO && cd RBO -
Enter the
evaluationdirectorycd evaluation -
Run the evaluation
gourd run local
This requires an R environment as well as Gourd installed on a SLURM-capable cluster.
Please note that R's random number generator may give different results depending on the software version. The results in the paper use R version 4.3.0.
- Install R and Gourd
If your cluster does not use the module command to enable R (not DelftBlue):
install Gourd normally using the instructions at https://github.com/ConSol-Lab/gourd.
You may also need to install R - consult the user manual for your HPC facility.
Then continue to step 2.
If your cluster uses the module command (DelftBlue): please build the the Gourd version with a modules-support tag.
R is preinstalled.
# install the rust toolchain
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
# clone the Gourd repository
git clone --branch modules-support --single-branch https://github.com/ConSol-Lab/gourd
cd gourd
# build
cargo build --release
# copy the program to your PATH
# on most clusters, you will need to change this to a directory in your /home
cp ./target/release/{gourd, gourd-wrapper} /usr/bin/-
Install R dependencies
Run the installation commands from "Using the R libraries" above in an interactive R shell.
If your cluster uses the
modulecommand, first load the modules required for R. For DelftBlue:module load 2024r1 r/4.3.0. During installation, you will be asked to create a personal R package library: do so with the default path. -
Clone this repository
git clone https://github.com/lchladek/RBO && cd RBO -
Enter the
evaluationdirectorycd evaluation -
Configure the SLURM parameters
Edit the
gourd.tomlaccording to the accounting parameters of your cluster and the desired output file paths. See the example below with 'lchladek' as the username.If your cluster uses the
modulecommand, uncomment the 'modules' line and if necessary change the module names.
# Put all outputs and metrics in the same folder
output_path = "/scratch/lchladek/experiments"
metrics_path = "/scratch/lchladek/experiments"
experiments_folder = "/scratch/lchladek/experiments"
[slurm]
experiment_name = "RBO experimental evaluation"
output_folder = "/scratch/lchladek/slurm-output"
partition = "compute-p2"
account = "education-eemcs-courses-cse3000"
mail_type = "ALL"
#mail_user = "[email protected]"
#modules = ["2024r1", "r/4.3.0" ]
-
Run the evaluation
Use
gourd run slurm. Check on the status usinggourd status. If something does not work, refer to the to the Gourd documentation.
All source code is distributed under the terms of the MIT license. Different copyright applies to each subdirectory.
Copyright 2025 Lukáš Chládek [email protected]
The MIT license is located at /src/LICENSE.txt and /evaluation/LICENSE.txt.
This subdirectory is a modified copy of the SIGIR-AP24 repository by Corsi and Urbano, 2024. Please cite their work: https://doi.org/10.1145/3673791.369842
Copyright 2024 Matteo Corsi [email protected] and Julián Urbano [email protected]
The MIT license is located at /sigir_ap24/LICENSE.txt.