Vector processing benchmarks

This repository contains a collection of vector processing benchmarks for Python and R packages. The tests cover the most common operations such as loading and saving a geopackage file, sampling points in a polygon, creating buffers, transformating the coordinate system (CRS), calculating the distance between points, and intersecting geometries.

Note that all operations were performed in the Cartesian coordinate system excluding s2 package, where calculations were performed on the sphere (this may affects the longer calculation times). For more information, see the "Spherical geometry in sf using s2geometry" article and presentation at the FOSS4G 2021 conference.

The detailed results are available at https://kadyb.github.io/vector-benchmark/report.html.

For high-performance data frames processing in R, check data.table and collapse.

You may also be interested in the raster processing benchmarks.

Software

Python:

geopandas

R:

Julia:

GeometryOps.jl

Reproduction

Generate the data from data/ folder in R.
Run all benchmarks using batch script (run_benchmarks.sh) or single benchmarks files.

Batch script

cd vector-benchmark
./run_benchmarks.sh

Single benchmark

Rscript sf/buffer.R

python3 geopandas/buffer.py

Dataset

The dataset is synthetically generated and consists of 500,000 points in a planar coordinate system.

Hardware configuration

CPU: AMD Ryzen 9 5900X @ 3.7 GHz
RAM: 64 GB
OS: Ubuntu 22.04.5 LTS

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
data		data
geometryops		geometryops
geopandas		geopandas
geos		geos
s2		s2
sf		sf
terra		terra
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
comparison.png		comparison.png
pixi.toml		pixi.toml
report.Rmd		report.Rmd
report.html		report.html
run_benchmarks.sh		run_benchmarks.sh
timings.csv		timings.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Vector processing benchmarks

Software

Reproduction

Dataset

Hardware configuration

About

Uh oh!

Contributors 4

Uh oh!

Languages

License

kadyb/vector-benchmark

Folders and files

Latest commit

History

Repository files navigation

Vector processing benchmarks

Software

Reproduction

Dataset

Hardware configuration

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors 4

Uh oh!

Languages