Longbow

Longbow is a distributed, high-performance vector database built for modern AI workloads. It leverages zero-copy data paths, SIMD optimizations, and advanced storage backends to deliver sub-millisecond latency.

Key Features

High Performance: Built on Apache Arrow for zero-copy data transfer.
Distributed: Consistent hashing and gossip-based membership (SWIM protocol).
Optimized Storage: Optional io_uring WAL backend for high-throughput ingestion.
Hardware Aware: NUMA-aware memory allocation and SIMD vector distance calculations.
Smart Client: Resilient Go SDK that handles request routing transparently.

Architecture

Longbow uses a shared-nothing architecture where every node is identical. Data is sharded across the cluster using consistent hashing.

See Architecture Guide for a deep dive.

Getting Started

Prerequisites

Go 1.25+
Linux (recommended for best performance) or macOS

Installation

git clone https://github.com/23skdu/longbow.git
cd longbow
go build -o bin/longbow ./cmd/longbow

Running a Local Cluster

./scripts/start_local_cluster.sh

Running Benchmarks

Longbow includes a distributed benchmark tool:

go build -o bin/bench-tool ./cmd/bench-tool
./bin/bench-tool --mode=ingest --concurrency=4 --duration=10s

Configuration

Longbow is configured via environment variables. See Configuration for details.

Notable flags:

STORAGE_USE_IOURING=true (Enable new Linux storage engine)
LONGBOW_GOSSIP_ENABLED=true (Enable distributed discovery)
Protocol: Apache Arrow Flight (over gRPC/HTTP2).
Search: High-performance HNSW vector search with hybrid (Dense + Sparse) support.
Filtering: Metadata-aware predicate filtering for searches and scans.
Lifecycle: Support for vector deletion via tombstones.
Durable: WAL with Apache Parquet format snapshots.
Storage: In-memory ephemeral storage for zero-copy high-speed access.
Observability: Structured JSON logging and 100+ Prometheus metrics.

Architecture & Ports

To ensure high performance under load, Longbow splits traffic into two dedicated gRPC servers:

Data Server (Port 3000): Handles heavy I/O operations (DoGet, DoPut, DoExchange).
Meta Server (Port 3001): Handles lightweight metadata operations (ListFlights, GetFlightInfo, DoAction).

Why? Separating these concerns prevents long-running data transfer operations from blocking metadata requests. This ensures that clients can always discover streams and check status even when the system is under heavy write/read load.

Observability & Metrics

Longbow exposes Prometheus metrics on a dedicated port to ensure observability without impacting the main Flight service.

Scrape Port: 9090
Scrape Path: /metrics

Custom Metrics

Key Metrics

Metric Name	Type	Description
`longbow_flight_operations_total`	Counter	Total number of Flight operations (DoGet, DoPut, etc.)
`longbow_flight_duration_seconds`	Histogram	Latency distribution of Flight operations
`longbow_flight_rows_processed_total`	Counter	Total rows processed in scans and searches
`longbow_vector_search_latency_seconds`	Histogram	Latency of k-NN search operations
`longbow_vector_index_size`	Gauge	Current number of vectors in the index
`longbow_tombstones_total`	Gauge	Number of active deleted vector tombstones
`longbow_index_queue_depth`	Gauge	Depth of the asynchronous indexing queue
`longbow_memory_fragmentation_ratio`	Gauge	Ratio of system memory reserved vs used
`longbow_wal_bytes_written_total`	Counter	Total bytes written to the WAL
`longbow_snapshot_duration_seconds`	Histogram	Duration of the Parquet snapshot process
`longbow_evictions_total`	Counter	Total number of evicted records (LRU)
`longbow_ipc_decode_errors_total`	Counter	Count of IPC decoding errors or panics

For a detailed explanation of all 100+ metrics, see Metrics Documentation.

Standard Go runtime metrics are also exposed.

Usage

Running locally

go run cmd/longbow/main.go

Docker

docker build -t longbow .
docker run -p 3000:3000 -p 3001:3001 -p 9090:9090 longbow

Name		Name	Last commit message	Last commit date
Latest commit History 650 Commits
.github		.github
client		client
cmd		cmd
docs		docs
grafana		grafana
helm/longbow		helm/longbow
internal		internal
scripts		scripts
.gitignore		.gitignore
.golangci.yml		.golangci.yml
.markdownlint.json		.markdownlint.json
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
RELEASES.md		RELEASES.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Longbow

Key Features

Architecture

Getting Started

Prerequisites

Installation

Running a Local Cluster

Running Benchmarks

Configuration

Architecture & Ports

Observability & Metrics

Custom Metrics

Key Metrics

Usage

Running locally

Docker

Documentation

About

Uh oh!

Releases 5

Packages

Uh oh!

Uh oh!

Contributors 2

Languages

License

23skdu/longbow

Folders and files

Latest commit

History

Repository files navigation

Longbow

Key Features

Architecture

Getting Started

Prerequisites

Installation

Running a Local Cluster

Running Benchmarks

Configuration

Architecture & Ports

Observability & Metrics

Custom Metrics

Key Metrics

Usage

Running locally

Docker

Documentation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Uh oh!

Contributors 2

Languages

Packages