segment-performance.md

Prisma Streams Segment Performance

Status: informational. This repo does not ship a dedicated segment perf tool yet.

Current behavior:

Segment sealing defaults to 16 MiB / 100,000 rows. Auto-tune presets through 1024 MiB use 8 MiB / 50,000 rows to keep segment build memory bounded on small hosts.
The segmenter keeps a trailing window over the latest 8 sealed segments for the same stream and computes a cheap compressed/logical ratio from stored metadata (size_bytes / payload_bytes).
If recent segments compressed below 50% of the logical target, the next segment's logical byte target is raised so the expected compressed segment reaches at least 50% of DS_SEGMENT_MAX_BYTES.
The boost is capped at 5x the base logical target, which is equivalent to treating anything stronger than 10:1 compression as 10:1 for this heuristic.
This heuristic is best-effort:
- it never reduces the logical byte target below DS_SEGMENT_MAX_BYTES
- cut eligibility uses the same raised logical-byte target, so the segmenter does not begin cutting early just because backlog crossed the base byte threshold
- it does not decode old segments
- it still seals immediately when DS_SEGMENT_TARGET_ROWS is reached
The read path uses per-block bloom filters plus a single forward pass over cached segment bytes for keyed scans.
For unkeyed offset reads inside sealed segments, the reader now uses the segment footer's block index to jump directly to the first block whose firstOffset can satisfy the request. It does not decode earlier blocks just to walk forward to the requested offset.
For remote segments, the reader now downloads the full segment object once, stores it in the local segment cache, and serves block/footer reads from that local file instead of issuing remote range reads.
Cold remote segment reads issue a direct object GET and treat a missing-object GET failure as null; they do not perform an existence-probe round trip before fetching the segment body.
A bounded disk cache (DS_SEGMENT_CACHE_MAX_BYTES) stores recently used segment objects, and keyed reads now populate that cache on first touch instead of walking the remote object block-by-block.
Hot cached segment files are served from Bun.mmap() views, so keyed reads do not repeatedly open() / read() / close() tiny footer and block slices.
If mmap is unavailable for a cached or local segment file, the reader falls back to a single full-file byte buffer for that segment scan. It does not fall back to repeated tiny local file reads.
For uploaded keyed reads, the hot path is now: download the segment once into the local cache, map the cached file, read the footer trailer once to find the data limit, then walk block headers and matching blocks from the mapped bytes. The request path no longer performs remote range reads or repeated local syscall-heavy range reads.
For keyed reads with routing-index coverage, the reader now plans the sealed segment scan before it starts reading bytes:
- only indexed candidate segments are scanned from the indexed uploaded prefix
- the uncovered uploaded tail is scanned sequentially for correctness
- the request path no longer does one SQLite findSegmentForOffset() lookup per sealed segment just to skip non-candidate history
For unversioned JSON streams, HTTP format=json responses now reuse stored payload bytes directly and assemble the response body with native buffer concatenation. The request path no longer decodes and re-serializes each JSON record just to emit an array response.
since + key cursor seeks use the same planned sealed-segment scan, so they also avoid full indexed-prefix walks when the routing index is selective.
_search uses the same planning idea when exact clauses provide a candidate segment set: it scans only candidate indexed segments plus any uncovered tail, instead of iterating the full indexed sealed prefix one segment at a time.
_search append-order reverse scans (sort: ["offset:desc"]) use the segment footer's block index to decode blocks newest-to-oldest and can stop after the requested page is full. They no longer decode every block in the segment before walking records backward.
When bundled companions produce per-segment candidate doc IDs for an append-order reverse search, _search walks those doc IDs newest-to-oldest and decodes only the blocks containing candidate hits. Broad candidates still behave like normal newest-first scans, but rare exact candidates no longer force every intervening block to decode.
For remote segments on that candidate-doc path, _search range-reads the segment footer and only the needed compressed data blocks. It falls back to the full segment source only when the footer is unavailable.
Background routing, lexicon, exact, and bundled-companion builders now yield at bounded per-record or per-block intervals and slow down further while a foreground read or search is active. That keeps hot keyed reads fast even while async indexing is catching up.
Routing, exact, and lexicon compactions also defer briefly after recent foreground traffic, so back-to-back keyed reads do not bounce between fast request work and a newly resumed large compaction pass.

Benchmarking:

bun run experiments/bench/segment_cached_scan_perf.ts runs a repeatable before/after benchmark against a copied real segment fixture from the remote golden-stream-2 cache.
The benchmark reports:
- a legacy syscall-heavy keyed scan baseline over the copied segment
- the new mapped one-pass keyed scan over the same bytes

If you need a repeatable perf benchmark, use the synthetic ingest benchmark and measure read latency at the HTTP layer, or build a small custom driver that:

Appends a known number of records
Forces a segment seal
Issues keyed and unkeyed reads
Measures response latency and bytes read

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Prisma Streams Segment Performance

Uh oh!

FilesExpand file tree

segment-performance.md

Latest commit

History

segment-performance.md

File metadata and controls

Prisma Streams Segment Performance