geopki

Prototype implementation for a PKI extension that allows the querying of the set of all certificates associated with a given geographical area. This is analogous to F-PKI which provides the same functionality for domains, i.e. allows the querying of the set of all certificates for a given domain.

Project Structure

Go Server & Client

Go Files, relevant for the server and client implementation

.
├── cmd                           # the go programs
|   ├── geopki-client                 # program for querying a geopki server
|   └── geopki-server                 # the geopki server program
|
├── database                      # SQL table schemes and function definitions
|   ├── constraints                   # contraints for ensuring the consistency of the database
|   |   └── node-integrity.sql        # ensures the integrity of all node rows
|   |
|   ├── functions                     # required and useful SQL functions
|   |   └── ...                           # set of functions for a scheme with z subtrees
|   |
|   └── tables                         # SQL tables schemas
|       ├── certificates.sql               # the table schema for the certificates table
|       └── nodes.sql                      # the table schema for the nodes table
|
├── internal                      # code internal to this project, not exposing any useful APIs for other projects
|   └── geopki-server                 # internal code related to the server
|       ├── consistency.go                # server endpoints serving the consistency tree
|       ├── constants.go                  # server constants such as the relative grid size and the MMD
|       ├── ingestion.go                  # server endpoints for ingestion of new certificates
|       ├── query.go                      # main server endpoints for queries
|       ├── server.go                     # code for bootstrapping the server
|       ├── state.go                      # defines the struct for keeping server state
|       └── utilities.go                  # helper functions to issue new SMHs or to verify the insertion key
|
├── pkg                           # code potentially re-usable for projects building on this one
|   ├── bitstring                     # code related to the bit strings
|   |   ├── bit_string_pair.go            # code for a bit string object supporting higher level operations
|   |   └── raw_bit_string_pair.go        # code for a more low level bit string object supporting
|   |                                     # operations directly on its bits
|   ├── comm                          # communication related code
|   |   ├── query.go                      # code building a query client-side and parsing it server-side
|   |   ├── request.pb.go                 # protobuf query request message (./proto/request.proto)
|   |   └── response.pb.go                # protobuf query response message (./proto/response.proto)
|   |
|   ├── crypto                        # cryptography related code
|   |   ├── certificate.go                # model of the certificate object, currently JSON
|   |   ├── consistency.go                # code for proving the consistency of the map server
|   |   ├── hash.go                       # hashing related code, at the moment mostly just the default hash value
|   |   ├── node.go                       # in-memory representation of a tree node corresponding to a DB row
|   |   ├── smh.go                        # signed map head object supporting signing and signature verification
|   |   └── verification.go               # code for verifying server responses client-side
|   └── database                      # helper functions for accessing the database
|       ├── ingestion.go                  # code for ingesting new data but also removing expired certificates
|       └── query.go                      # code querying the database for SMT nodes and certificates
|
├── proto                         # protobuf messages
|   ├── request.proto                 # protobuf message for a query request sent by a client
|   └── response.proto                # protobuf message for a query response sent by a server
|
├── go.mod                        # standard go dependency managment file
└── go.sum                        # standard go dependency managment file

Bitstring Computation Performance Measurements

.
├── cmd                           # the go programs
|   └── bitstring                     # program for measuring the performance of the bit string computations given a volume
|
└── performance
    └── bitstring-performance         # measures the performance for approximating a sphere using bitstrings
        ├── bitstring-performance.py      # measures the performancy by calling cmd/bitstring-performance
        ├── bitstring-performance-plot.py # plots the output of bitstring-performancy.py (bitstring-performance.csv)
        ├── bitstring-performance.csv     # output of of bitstring-performancy.py
        ├── f-count.png                   # plot output of of bitstring-performance-plot.py
        └── f-time.png                    # plot output of of bitstring-performance-plot.py

Database Input

└── cmd                           # the go programs
    └── db-address-exporter       # generates SQL INSERT files based on the dataset in
                                  # https://gitlab.inf.ethz.ch/OU-PERRIG/theses/msc_nico_hauser/data/-/tree/main/osm-dataset

database/scripts/input
├── db-input-analyzer.py          # plots the number of bit strings used for DBs with and without z subtrees
├── db-input-visualizer.py        # plots the polygon of a specific OSM location
└── db-sanity-check-generator.py  # generates sql files for a dummy table to verify the measurements

database/scripts/input
├── db-address-exporter.py        # generates SQL INSERT files based on the dataset in
|                                 # https://gitlab.inf.ethz.ch/OU-PERRIG/theses/msc_nico_hauser/data/-/tree/main/osm-dataset
|                                 # *requires a lot of RAM*, worked with 64GB
├── db-input-analyzer.py          # plots the number of bit strings used for DBs with and without z subtrees
├── db-sanity-check-generator.py  # generates sql files for a dummy table to verify the measurements
└── graph-db-address-exporter.py  # generates CSV files that can be imported into neo4j

Database Performance Measurements

database/scripts/performance
├── db-types                      # the scripts for running performance tests on various schemes
|   ├── postgres-baseline-run.py                  # performance test on postgres, baseline table
|   ├── postgres-bitstring-int-run.py             # performance test on postgres using integer range queries
|   ├── postgres-bitstring-txt-run.py             # performance test on postgres using prefix matching
|   └── postgres-spatial-run.py                   # performance test on postgres using a spatial index
|
└── performance-evaluation.py     # runs one script in db-types several times and collects the results

Full System Performance Measurements

.
├── cmd                           # the programs used by the performance measurement scripts
|   ├── geopki-client-ingestion                   # inserts the passed certificates and measures the time
|   ├── geopki-client-performance                 # collect metrics on sequential requests
|   ├── geopki-client-throughput                  # spawns multiple goroutines and sends many parallel requests
|   ├── geopki-public-key                         # retrieves the public key of a geopki-server
|   └── release                                   # initiates a table swap after the ingestion of new certificates
|
└── performance                   # scripts for measuring the performance
    ├── client                                    # collects metrics on sequential requests
    |   ├── client-performance.py                     # runs the performance test
    |   └── client-performance-plot.py                # plots the result
    |
    ├── ingestion                                 # measures the time it takes to ingest new certificates
    |   ├── ingestion.py                              # runs the performance test
    |   └── ingestion-plot.py                         # plots the result
    |
    ├── throughput                                 # measures the number of parallel requests the server can process
    |   ├── throughput.py                              # runs the performance test
    |   └── throughput-plot.py                         # plots the result
    |
    ├── sampling-map-generator.py                  # generates .parquet sampling maps
    └── sampling-map-converter.py                  # converts .parquet sampling maps to .json sampling maps

Setup

Prerequisites

Go

For running the geopki client or server, go is required. Most dependencies are automatically downloaded and managed by go. The only exception is Geospatial Data Abstraction Library (GDAL) which has to be installed seperately. Go bindings are used to interact with GDAL unless with the exception of the wasm client that uses S2.

Python

For running the python scripts, python and the relevant dependencies have to be installed. Note that python is not required to run the geopki client nor the server but scripts such as the performance measurements are written in python.

Install python3 (tested on version 3.11.3), should probably work on later versions too
Install pip dependencies using pip3 install -r requirements.txt

GeoPKI Server

Setup the database
1. Setup a postgres instance
2. Create a new database
3. Create the required tables
  1. Run the code in database/tables/certificates.sql to create the certificates table
  2. Run the code in database/tables/nodes.sql to create the nodes table
4. Define the required functions
  1. Define functions for computing the difference, intersection and union of arrays
    1. Run the code in database/functions/array_difference.sql
    2. Run the code in database/functions/array_intersect.sql
    3. Run the code in database/functions/array_union.sql
  2. Define function for updating the hash of a node after ingestion of new data / deletion of expired data
    1. Run the code in database/functions/update_children_hashes.sql
5. (Optional) Import the dataset as described here
Setup a trillian instance
1. Follow the instructions on the trillian repo: https://github.com/google/trillian/tree/v1.5.2/examples/deployment
2. The local deployment with docker is straight forward
3. Create a new tree using go run ./cmd/createtree --admin_server=127.0.0.1:8090
4. Copy the log id from the output
  1. The output should contain a line like Initialised Log (2947592571015490951) with new SignedTreeHead where 2947592571015490951 corresponds to the log id.
Define environment variables
1. DATABASE_URL
  1. Set the database url for the postgres database using export DATABASE_URL=postgresql://<username>:<password>@127.0.0.1:5432/<postgres database> and replacing the <> with values set when setting up the postgres database.
2. PRIVATE_KEY
  1. Set the private key of the geopki server that is used to sign the map and consistency heads using export PRIVATE_KEY=MHcCAQEEIGv4NvMEZL3JjuQ8BnWVkTwkwCtXZhpkozMu1iCUvXBdoAoGCCqGSM49AwEHoUQDQgAEvvlGuoiglGCYNXJ0rpbKwQuXIQnIHE2mCDrDtlm7KxF4//6w3quLK/4Q8DwkM27zkOpnjv701tdFuBbf5EloqA==
  2. The format is base64 encoded bytes that are accepted by go's ParseECPrivateKey function which according to the documentation means any EC private key in SEC 1, ASN.1 DER form.
  3. The easiest way to obtain a correctly formatted value is to run the server once without having the value set. The server will then print a randomly generated key to the console.
3. CERT_INSERT_KEY
  1. The server exposes the endpoint /v1/insert for showcasing the ingestion and deletion functionality. Since this endpoint causes a lot of computation and allows the insertion of arbitrary data at the moment, it is protected by a secret. In the future no such endpoint should exist and the map server should on its own crawl the CT logs.
  2. Use export CERT_INSERT_KEY=xxxxxxxxx to set that value.
Run the server instance using go run ./cmd/geopki-server --address=0.0.0.0 --port=1234 --trillian-address=127.0.0.1:8090 --clog-id=<trillian log id>.
1. All arguments except clog-id can be omitted if the just shown default values should be used.
2. If you forgot the log id, you can use cmd/list-trees (in the geopki repo) to list all trees: go run ./cmd/list-trees --admin_server=127.0.0.1:8090

Import Dataset

To import the dataset, i.e. osm-dataset/world.parquet, the binary ./cmd/db-address-exporter is used.

It requires the following parameters:

--input to specify the .parquet certificate dataset
--nodes to specify the output folder where to write the .sql files that insert data into the nodes table
--certs to specify the output folder where to write the .sql files that insert data into the certificates table
--cert-size-dist to specify the path where the .csv certificate size distribution should be written
--altitude-dist to specify the path where the .csv altitude distribution of certificates should be written

Example:

go run ./cmd/db-address-exporter/ \
        --input=../location-correlation/dist/world.parquet \
        --nodes=/db-import/nodes/ \
        --certs=/db-import/certificates/ \
        --sampling-map=./performance/sampling-map.csv \
        --cert-size-dist=./performance/cert-sizes/cert-sizes.csv \
        --altitude-dist=./database/measurements/altitude.csv

Alternative

Alternatively, the script ./database/scripts/input/db-address-exporter.py can be used for the same purpose. In constrast, this script is slower, consumes more RAM and only writes sql files. Therefore, for importing data for the actual system, this script is not recommended.

Howerver, for measuring the performance of the different indices, this script is the only way to generate the data in the respective format.

The script accepts three positional arguments, the input path pointing to the OSM .parquet certificates dataset and two output paths, one for the .sql files for the nodes table and one for the certificates table.

It then accepts an optional --spatial flag that generates the data for the evaluation of the spatial index. If the flag is omitted, the database format for integer range queries is generated.

Examples:

python3 database/scripts/input/db-address-exporter.py \
        ../location-correlation/dist/world.parquet \
        /db-import/nodes/ \
        /db-import/certificates/ \
        --spatial

GeoPKI Client

The client program can be run using go run ./cmd/geopki-client --address=http://127.0.0.1:1234 --longitude=<lon> --latitude=<lat> --altitude=<alt> --radius=<rad> --include-certificates. When run without the --include-certificates flag, only the certificate hashes are obtained.

Web Demo

To compile the web demo, use make demo. This compiles ./cmd/geopki-client-wasm and copies the binary to ./demo/geopki-web-client/geopki-client.wasm. Afterwards you only need to run a http server that serves ./demo/geopki-web-client.

Example:

python3 -m http.server 9000 --directory ./demo/geopki-web-client

Note that the location can usually only be accessed if access via HTTPS.

Performance Evaluation

This section explains how to reproduce the results from the thesis. First and foremost, the sampling map produced by ./cmd/db-address-exporter is required.

The output for the dataset from the thesis (/osm-dataset/word.parquet in the data repository) should result in the sampling map /sampling-maps/sampling-map.csv in the data repository.

The next section explains how this file is used to derive the various sampling maps in /sampling-maps/.

Sampling Maps

There are three types of sampling maps differentiated by their extension.

sampling-map.csv is the output of ./cmd/db-address-exporter
sampling-map-bitstrings-f1.parquet and sampling-map-bitstrings-f01.parquet are generated by performance/sampling-map-generator.py and contain for each location in sampling-map.csv the pre-computed bit strings for a given relative grid size. The order of rows is shuffeled.
sampling-map-bitstrings-f1.json and sampling-map-bitstrings-f01.json are generated by performance/sampling-map-converter.py correspond to the parquet version but are in the json format. They are generated by performance/sampling-map-converter.py.

The following sections detail the interface of the sampling map scripts.

performance/sampling-map-generator.py

Accepts three positional arguments, the path to sampling-map.csv, an output path ending in .parquet and a float specifying the relative grid size.

Examples:

python3 performance/sampling-map-generator.py /sampling-maps/sampling-map.csv \
                                              /sampling-maps/sampling-map-bitstrings-f1.parquet \
                                              1

python3 performance/sampling-map-generator.py /sampling-maps/sampling-map.csv \
                                              /sampling-maps/sampling-map-bitstrings-f01.parquet \
                                              0.1

performance/sampling-map-converter.py

Accepts two positional arguments, the path to a .parquet sampling map and an output path ending in .json. It converts the given .parquet sampling map to a json one.

Examples:

python3 performance/sampling-map-converter.py /sampling-maps/sampling-map-bitstrings-f1.parquet \
                                              /sampling-maps/sampling-map-bitstrings-f1.json

python3 performance/sampling-map-converter.py /sampling-maps/sampling-map-bitstrings-f01.parquet \
                                              /sampling-maps/sampling-map-bitstrings-f01.json

Database Only

To measure the database throughput for the spatial index, prefix matching and integer range queries, the script database/scripts/performance/performance-evaluation.py is used. It accepts two position arguments, a path to a sampling map and an output path. The output path must end with .csv.

The script then requires a set of database parameters:

--db-host=127.0.0.1 to specify the database host address
--db-port=5432 to specify the database port
--db-name=geopki to specify the database name
--db-user=geopki to specify the database user
--db-pass=... to specify the database password. This parameter is optional and the program asks for user input if it is omited.

With --repetitions=30, the number of repetitions for the measurement are specified. The --excluding-bit-string-computation flag decides whether the bit string computation is part of the measured time or not. By default it is included but for all measurements shown in the thesis it is not. The parameter --batch-size=1 specifies the number of queries to be issued per iteration and finally there are flags that determine the test that is run:

--postgres-spatial for the spatial index
--postgres-bitstrings-txt for prefix matching
--postgres-bitstrings-int for integer range queries

To reproduce the measurements from the thesis, use the following commands:

Spatial Index

Set up the database using the spatial data as described here.

python3 database/scripts/performance/performance-evaluation.py \
        ~/sampling-map.csv \
        ~/evaluation/database/spatial.csv \
        --db-host=127.0.0.1 \
        --db-port=5432 \
        --db-name=geopki \
        --db-user=postgres \
        --db-pass="$DATABASE_PASS" \
        --batch-size=1 \
        --repetitions=30 \
        --postgres-spatial

Prefix Matching

Set up the database using the data as described here.

Then use the following SQL commands to change the format of the database to use a text-based index. The goal is to add the column bit_string_51_txt equal to the text representation of bit_string_51 and created a clustered index on it.

ALTER TABLE nodes ADD COLUMN bit_string_51_txt character varying(51) COLLATE "C" NOT NULL GENERATED ALWAYS AS (bit_string_51::character varying(51)) STORED;

CREATE INDEX nodes_bit_string_text_pattern_ops_idx ON nodes(bit_string_51_txt COLLATE "C");
ALTER TABLE IF EXISTS nodes CLUSTER ON nodes_bit_string_text_pattern_ops_idx;
CLUSTER nodes USING nodes_bit_string_text_pattern_ops_idx;

Finally, run the following commands:

For f=1

python3 database/scripts/performance/performance-evaluation.py \
        /sampling-maps//sampling-map-bitstrings-f1.parquet \
        /evaluation/database/bitstrings-txt-f1.csv \
        --db-host=127.0.0.1 \
        --db-port=5432 \
        --db-name=geopki \
        --db-user=postgres \
        --db-pass="$DATABASE_PASS" \
        --batch-size=1 \
        --excluding-bit-string-computation \
        --repetitions=30 \
        --postgres-bitstrings-txt

For f=0.1

python3 database/scripts/performance/performance-evaluation.py \
        /sampling-maps/sampling-map-bitstrings-f01.parquet \
        /evaluation/database/bitstrings-txt-f01.csv \
        --db-host=127.0.0.1 \
        --db-port=5432 \
        --db-name=geopki \
        --db-user=postgres \
        --db-pass="$DATABASE_PASS" \
        --batch-size=1 \
        --excluding-bit-string-computation \
        --repetitions=30 \
        --postgres-bitstrings-txt

Integer Range Query

Set up the database using the data as described here.

For f=1

python3 database/scripts/performance/performance-evaluation.py \
        /sampling-maps/sampling-map-bitstrings-f1.parquet \
        /evaluation/database/bitstrings-int-f1.csv \
        --db-host=127.0.0.1 \
        --db-port=5432 \
        --db-name=geopki \
        --db-user=postgres \
        --db-pass="$DATABASE_PASS" \
        --batch-size=1 \
        --excluding-bit-string-computation \
        --repetitions=30 \
        --postgres-bitstrings-int

For f=0.1

python3 database/scripts/performance/performance-evaluation.py \
        /sampling-maps/sampling-map-bitstrings-f01.parquet \
        /evaluation/database/bitstrings-int-f01.csv \
        --db-host=127.0.0.1 \
        --db-port=5432 \
        --db-name=geopki \
        --db-user=postgres \
        --db-pass="$DATABASE_PASS" \
        --batch-size=1 \
        --excluding-bit-string-computation \
        --repetitions=30 \
        --postgres-bitstrings-int

Depths

There are two depth measurements we performed. The first is to measure the depth of the locations where certificates are placed and the second, the depth of sparse SMT nodes.

Certificate Depths

To collect statistics on the maximum depth of locations, the script ./database/scripts/depth/smt-depth-analysis.py is used. It accepts one positional argument, the output path.

The script then requires a set of database parameters:

--db-host=127.0.0.1 to specify the database host address
--db-port=5432 to specify the database port
--db-name=geopki to specify the database name
--db-user=geopki to specify the database user
--db-pass=... to specify the database password. This parameter is optional and the program asks for user input if it is omited.

Finally, the script requires the --sampling-map parameter which specifies the locations to query. This parameter should point to the .csv sampling map.

Our measurements can be found in ./database/measurements/depth.csv.zip.

Example:

python3 database/scripts/depth/smt-depth-analysis.py \
        ./database/measurements/depth.csv \
        --sampling-map=/sampling-maps/sampling-map.csv \
        --db-host=127.0.0.1 \
        --db-port=5432 \
        --db-name=geopki \
        --db-user=postgres \
        --db-pass="$DATABASE_PASS"

Sparse SMT Node Depths

To collect statistics on sparse SMT nodes, the SQL query in ./database/scripts/depth/leaves.sql is used. You can e.g. use pgAdmin4 to run the query and store the csv result at ./database/measurements/leaves.csv. Our measurements can be found in ./database/measurements/leaves.csv.zip.

Altitude Distribution

The altitude distribution of certificates is directly written by ./cmd/db-address-exporter. Our measurements can be found in ./database/measurements/altitude.csv.zip.

Plotting the Results

To plot the results use python3 database/measurements/plot.py. The script assumes the following files to be in the same directory as plot.py:

performance-evaluation-spatial.csv (Spatial Index)
performance-evaluation-bitstring-txt-f1.csv (Prefix Matching)
performance-evaluation-bitstring-int-f1.csv (Integer Range Queries)
leaves.csv (Sparse SMT Node Depths, contained in ./database/measurements/leaves.csv.zip)
depth.csv (Certificate Depths, contained in ./database/measurements/depth.csv.zip)
altitude.csv (Altitude SMT Node Depths, contained in ./database/measurements/altitude.csv.zip)

Plotting Certificate Size Distribution

To plot cert-sizes.csv of ./cmd/db-address-exporter (contained in ./performance/cert-sizes/cert-sizes-measurements.zip), the script ./performance/cert-sizes/cert-sizes.py is used.

It accepts two position arguments, the path to cert-sizes.csv and an output directory. It then plots the data and writes the images to the output directory.

Example:

python3 performance/cert-sizes/cert-sizes.py performance/cert-sizes/cert-sizes.csv performance/cert-sizes

Full System

This section detauls how to measure the performance of the full system. All scripts require a running geopki server. This section describes how to run an instance.

After an instance is running, three different types of measurements can be performed. They are detailed in the following sections.

Metrics on Individual Requests and Responses

To obtain metrics on individual requests and responses, queries are sequentially sent to the server. The modified geopki-client ./cmd/geopki-client-performance is used. Compared to the CLI version, it collects the relevant metrics and prints them to the console.

For measurements, the python script performance/client/client-performance.py is used. It accepts three positional arguments, the path to sampling-map.csv, an output path ending in .csv and the Base64 encoded public key of the server.

The Base64 encoded public key can be obtained using the helper utilitty ./cmd/geopki-public-key:

go run ./cmd/geopki-public-key --address=http://127.0.0.1:1234

On top of the positional parameters, the script accepts the parameter --address=http://127.0.0.1:1234 to specify the server's address as well as --locations=1000 to specify the number of queries.

To reproduce the measurements from the thesis, use the following parameters:

For f=1

make clean
make client-performance

python3 performance/client/client-performance.py \
        /sampling-maps/sampling-map.csv \
        /evaluation/client-performance-f1.csv \
        "$PUBLIC_KEY" \
        --address=http://127.0.0.1:1234 \
        --locations=100000

For f=0.1, manually modify cmd/geopki-client-performance/geopki-client-performance.go and set F_GROW=0.1 and re-build the binary using make.

make clean
make client-performance

python3 performance/client/client-performance.py \
        /sampling-maps/sampling-map.csv \
        /evaluation/client-performance-f01.csv \
        "$PUBLIC_KEY" \
        --address=http://127.0.0.1:1234 \
        --locations=100000

Plotting the Results

To plot the results, use performance/client/client-performance-plot.py. The script accepts two positional arguments, the output of performance/client/client-performance.py and an output folder. It plots the measurement and writes the images to the given output folder.

The thesis measurements are contained in ./performance/client/client-performance-measurements.zip.

Example:

python3 performance/client/client-performance-plot.py performance/client/client-performance.csv performance/client

Throughput

To measure the throughput the server can handle the specialized client ./cmd/geopki-client-throughput is used. It spawns the given number of goroutines and sends pre-computed queries to the server during the specified time. Bit string computations are all performed beforehand (by using the pre-computed sampling maps) and the cryptographic response verification is not performed.

The program writes the number of successful and failed responses to the console. Moreover it also collects the latencies for the issued queries and writes them to the console as well. Compared to the previous section where all requests are performed sequentially, this measures the latency during high load.

For measurements, the python script ./performance/throughput/throughput.py is used. It accepts two positional arguments, the path to a .json sampling map and an output path ending in .csv.

Moreover, the script accepts the parameter --address=http://127.0.0.1:1234 to specify the server's address as well as --repetitions=30 to specify how often the measurement should be repeated.

To reproduce the measurements from the thesis, use the following parameters:

For f=1

make clean
make client-throughput

python3 performance/throughput/throughput.py \
        /sampling-maps/sampling-map-bitstrings-f1.json \
        /evaluation/throughput-f1.csv \
        --address=http://127.0.0.1:1234 \
        --repetitions=30

For f=0.1

python3 performance/throughput/throughput.py \
        /sampling-maps/sampling-map-bitstrings-f01.json \
        /evaluation/throughput-f01.csv \
        --address=http://127.0.0.1:1234 \
        --repetitions=30

Plotting the Results

To plot the results, use performance/throughput/throughput-plot.py. The script accepts two positional arguments, the output of performance/throughput/throughput.py and an output folder. It plots the measurement and writes the images to the given output folder.

The thesis measurements are contained in ./performance/throughput/throughput-measurements.zip.

Example:

python3 performance/throughput/throughput-plot.py performance/throughput/throughput.csv performance/throughput

Ingestion

For ingestion, the specialized client ./cmd/geopki-client-ingestion is used. Compared to all other clients, this one does not query the server but instead access the ingestion endpoints. The program simply measures the time it takes to insert the certificates passed to it.

For measurements, the python script ./performance/ingestion/ingestion.py is used. It accepts two positional arguments, the path to a website density map, e.g. europe-10.parquet, and an output path ending in .csv.

Moreover, the script accepts the parameter --address=http://127.0.0.1:1234 to specify the server's address, --insertion-key=abc to specify the insertion key, as well as --repetitions=30 to specify how often the measurement should be repeated.

Finally, the script accepts an optional --with-altitude flag to determine whether the generated certificates should be placed in the altitude subtrees (if the flag is set) or just the surface tree (if the flag is not set).

The script randomly samples locations from the given density map, weighted by the number of websites in the respective grid cell. Within the grid cell, the script then samples a point uniformly at random and constructs a square at this location with a side-length of 30m. If the --with-altitude flag is set, a random altitude is drawn uniformly at random to allow a height that is also drawn uniformly at random from the range [3m, 30m].

The script measures different batch sizes, i.e. different numbers of certificates that are sent to the server at the same time.

To reproduce the measurements from the thesis, use the following parameters:

make clean
make client-ingestion

python3 performance/ingestion/ingestion.py \
        /website-density/europe-10.parquet \
        /evaluation/ingestion.csv \
        --address=http://127.0.0.1:1234 \
        --insertion-key=abc \
        --repetitions=30 \
        --with-altitude

Note that the ingestion only modifies the nodes_next table. To swap the nodes and nodes_next table, a release must be issued.

The last section details how to do this.

Plotting the Results

To plot the results, use performance/ingestion/ingestion-plot.py. The script accepts two positional arguments, the output of performance/ingestion/ingestion.py and an output folder. It plots the measurement and writes the images to the given output folder.

The thesis measurements are contained in ./performance/ingestion/ingestion.csv.

Example:

python3 performance/ingestion/ingestion-plot.py performance/ingestion/ingestion.csv performance/ingestion

Release

To issue a release, the specialized client ./cmd/release is used. It accesses a special endpoint to trigger a release.

The timing measurements of how long the release takes is written to the server's console.

Example:

go run ./cmd/release --address=http://127.0.0.1:1234 --insertion-key=abc

Name		Name	Last commit message	Last commit date
Latest commit History 411 Commits
.vscode		.vscode
cmd		cmd
database		database
demo		demo
internal/geopki-server		internal/geopki-server
performance		performance
pkg		pkg
proto		proto
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitleaksignore		.gitleaksignore
Makefile		Makefile
README.md		README.md
coordinates.py		coordinates.py
coordinatez.py		coordinatez.py
go.mod		go.mod
go.sum		go.sum
requirements.txt		requirements.txt

netsec-ethz/geopki

Folders and files

Latest commit

History

Repository files navigation

geopki

Table of Contents

Project Structure

Go Server & Client

Bitstring Computation Performance Measurements

Database Input

Database Performance Measurements

Full System Performance Measurements

Setup

Prerequisites

Go

Python

GeoPKI Server

Import Dataset

Alternative

GeoPKI Client

Web Demo

Performance Evaluation

Sampling Maps

performance/sampling-map-generator.py

performance/sampling-map-converter.py

Database Only

Spatial Index

Prefix Matching

Integer Range Query

Depths

Certificate Depths

Sparse SMT Node Depths

Altitude Distribution

Plotting the Results

Plotting Certificate Size Distribution

Full System

Metrics on Individual Requests and Responses

Plotting the Results

Throughput

Plotting the Results

Ingestion

Plotting the Results

Release

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages