YCSB-IVS: Benchmarking Databases with Varying Value Sizes

YCSB-IVS introduces a novel benchmarking technique to evaluate database performance as value sizes vary over time. This enhancement builds on the original YCSB framework, enabling experiments with dynamic value size growth and providing new insights into database behavior under evolving workloads.

Overview

Our benchmarking approach evaluates three widely-used databases:

MongoDB
MariaDB + InnoDB
MariaDB + RocksDB

Key Features

Analysis of database performance under dynamic value size variations.
Comparison of latency, throughput, and scalability across different database systems.
Comprehensive scripts and figures for replicating our analysis.

Cloning Details

To get started, clone the YCSB-IVS repository:

git clone https://github.com/dliyanage/YCSB-IVS.git
cd YCSB-IVS

For details on running YCSB (the core tool behind YCSB-IVS), refer to the installation and build guide:
Official YCSB README

The rest of this document outlines the additions made on top of the original YCSB version as of 1 Feb 2025, specifically for YCSB-IVS experiments.

Repository Structure

Directory	Description
`./experiment_scripts`	Bash scripts for running workload experiments.
`./analysis/Data`	Output data files relevant for analysis that are generated from our experiments by running bash scripts in `./experiment_scripts`. Refer to `./analysis/README.md`.
`./analysis/Scripts`	Analysis scripts written in R (Jupyter notebooks).
`./analysis/Figures`	Generated output figures and visualizations referenced in our paper.

Experimental Scripts

The ./experiment_scripts directory contains all the necessary bash scripts for running workload experiments and saving the results as CSV files.

Examples:

MongoDB Workload:
Use the ./experiment_scripts/experiment_mongodb.sh script to execute the benchmarking workloads in MongoDB with varying value sizes.
MongoDB Baseline:
Use the ./experiment_scripts/experiment_mongodb_baseline.sh script to run baseline executions with fixed value sizes for comparison.

Please refer to the general instructions on configuring experiments in the README file at ./experiment_scripts/README.md.

Analysis Data

All output files generated during experiments are stored in the ./analysis/Data directory. These files are prepared for analysis and visualization.

To understand the analysis process and view results, refer to the README.md within the ./analysis directory. This document provides step-by-step details on our analysis methodology and generates outputs included in our publication.

Analysis Workflow

The R Jupyter notebook located in the ./analysis/Scripts folder includes:

Step-by-step explanations of our analysis process.
Intermediate results and final output figures.

How to Use This Repository

Clone the repository:

git clone https://github.com/dliyanage/YCSB-IVS.git
cd YCSB-IVS

Explore the data:
Navigate to the ./analysis/Data directory to view raw data files.
Run the analysis:
Open the R notebook in the ./analysis/Scripts folder to reproduce the analysis and figures.
View results:
Output figures are stored in the ./analysis/Figures directory.

Citation

If you use this work, please cite:
Benchmarking Databases with Varying Value Sizes [Experiment, Analysis, and Benchmark]." VLDB 2025.

For further information, visit the YCSB-IVS GitHub repository.

YCSB-IVS expands the capabilities of the original YCSB framework to simulate realistic scenarios of value size growth, offering a powerful tool for evaluating database scalability and performance.

Name		Name	Last commit message	Last commit date
Latest commit History 1,336 Commits
accumulo1.9		accumulo1.9
aerospike		aerospike
analysis		analysis
arangodb		arangodb
asynchbase		asynchbase
azurecosmos		azurecosmos
azuretablestorage		azuretablestorage
bin		bin
binding-parent		binding-parent
cassandra		cassandra
cloudspanner		cloudspanner
core		core
couchbase		couchbase
couchbase2		couchbase2
crail		crail
distribution		distribution
doc		doc
dynamodb		dynamodb
elasticsearch		elasticsearch
elasticsearch5		elasticsearch5
experiment_scripts		experiment_scripts
foundationdb		foundationdb
geode		geode
googlebigtable		googlebigtable
googledatastore		googledatastore
griddb		griddb
hbase1		hbase1
hbase2		hbase2
ignite		ignite
infinispan		infinispan
jdbc-binding/conf		jdbc-binding/conf
jdbc		jdbc
kudu		kudu
maprdb		maprdb
maprjsondb		maprjsondb
memcached		memcached
mongodb		mongodb
nosqldb		nosqldb
orientdb		orientdb
postgrenosql		postgrenosql
rados		rados
redis		redis
rest		rest
riak		riak
rocksdb		rocksdb
rocksdb_dumpandload		rocksdb_dumpandload
s3		s3
scylla		scylla
seaweedfs		seaweedfs
solr7		solr7
tablestore		tablestore
tarantool		tarantool
voltdb		voltdb
workloads		workloads
zookeeper		zookeeper
.editorconfig		.editorconfig
.gitignore		.gitignore
.travis.yml		.travis.yml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.txt		LICENSE.txt
NOTICE.txt		NOTICE.txt
README.md		README.md
checkstyle.xml		checkstyle.xml
mariadb.properties		mariadb.properties
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

YCSB-IVS: Benchmarking Databases with Varying Value Sizes

Overview

Key Features

Cloning Details

Repository Structure

Experimental Scripts

Examples:

Analysis Data

Analysis Workflow

How to Use This Repository

Citation

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

dliyanage/YCSB-IVS

Folders and files

Latest commit

History

Repository files navigation

YCSB-IVS: Benchmarking Databases with Varying Value Sizes

Overview

Key Features

Cloning Details

Repository Structure

Experimental Scripts

Examples:

Analysis Data

Analysis Workflow

How to Use This Repository

Citation

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages