Codestin Search App

N-o-Z · 2026-01-28T02:06:58Z

Proposal for branch read optimization

itaiad200 · 2026-01-28T14:49:25Z

design/open/branch_read_optimization.md

+        if !branch.Dirty {
+            branch.Dirty = true
+            // This triggers a branch record update
+        }


Too late to update it now. There's already an entry in staging. If another read request comes, it should be able to see it.

You are absolutely right - we must first set dirty bit before any write

ozkatz · 2026-02-02T15:06:54Z

Out of curiosity - do we know how common this case is? (i.e. % of reads that go to "clean" branches?) Might help us understand if the added complexity is worth it.

Perhaps a more granular approach could be beneficial: we can go further than a single boolean and maintain a small bitmap: one bit per range - essentially lakeFS' take on dirty pages.
Instead of reading the bool, read the bitmap; and if the requested range(s) are 0, you're good to read just the underlying committed data. if it's 1, check staging. on write, mark 1 for any range affected by the write.

arielshaqed

This is a really great idea! Enhance the structure of branches to boost performance.

But not sure how this exact proposal can work correctly at a branch level - I think it will be easier at the token level. Requesting changes to understand how we know a branch is clean.

arielshaqed · 2026-02-03T08:13:46Z

design/open/branch_read_optimization.md

+1. Attempts to read from the current staging token.
+2. Falls back to committed data if not found.
+
+This happens **even when the branch has no staged changes**.


As a workaround, users can read lakefs://repo/branch@/. Obviously in order to do this the user needs to know that this is what they want. (Alternatively, this could be an easy way to show the performance difference!)

We already know the performance difference as for example we have hard limits in DDB for read (3000/s) and writes (1000/s)

arielshaqed · 2026-02-03T08:16:14Z

design/open/branch_read_optimization.md

+
+### High-level idea
+
+Introduce a **branch-level boolean flag**:


An alternative might measure dirtiness per token. Doing this could additionally reduce read pressure during commits - when read pressure can anyway be higher.

The read decision in Graveler is inherently branch-level: if any token (current or sealed) may contain entries, reads must consult staging. Tracking dirtiness per token doesn’t change that requirement and mostly adds state and maintenance complexity.

Commit-time read pressure is transient; the hot-partition issue we’re addressing is steady-state reads on clean branches. A branch-level dirty flag targets that directly with much lower risk.

arielshaqed · 2026-02-03T08:20:35Z

design/open/branch_read_optimization.md

+
+```
+dirty = true  ⟺  (StagingToken has entries) OR (SealedTokens is non-empty)
+dirty = false ⟺  (StagingToken is empty) AND (SealedTokens is empty)


Intermediate cases exist! For instance while committing a clean branch (assume appropriate flags!) there is an empty staging token but non-empty sealed tokens. This proposal forces lakeFS to consider the branch dirty even though it could know it is clean.

The correctness requirement for dirty is that if there are changes to the branch then dirty``. An additional _performance_ requirement is that if dirty` then usually there are changes to the branch.

arielshaqed · 2026-02-03T08:21:59Z

design/open/branch_read_optimization.md

+
+For existing branches without the `dirty` field:
+
+- **Default value: `true`** (conservative/safe)


The current version of Google protobufs has all fields optional. That means no default values. I therefore suggest using a clean field instead - the default ! false is precisely what we want.

That's a good point. We can invert the logic accordingly without changing the design

arielshaqed · 2026-02-03T08:24:05Z

design/open/branch_read_optimization.md

+Operations that guarantee the absence of uncommitted changes set `dirty = false`.
+
+Examples:
+- successful commit (after sealed tokens are cleared)


I don't understand: writes can occur concurrently with the commit, and there might even be other concurrent commits. So there may be sealed tokens, or the staging token could already be dirty. You could work around the first, but I do not see how to work around the second.
(This may be an argument in favour of dirt-per-token.)

Clearing dirty is not unconditional. It must be done via a conditional branch update that succeeds only if staging and sealed tokens are empty at that moment.
If a concurrent write or another commit introduces staged data, the condition fails and dirty remains true. False positives are acceptable; false negatives are not. This is the same concurrency pattern already used for commit ID and token rotation, and per-token dirtiness doesn’t eliminate the need for these conditional checks.

N-o-Z · 2026-02-05T00:16:10Z

Out of curiosity - do we know how common this case is? (i.e. % of reads that go to "clean" branches?) Might help us understand if the added complexity is worth it.

We don't really know how common this case is - this is the reason for suggesting the phased implementation which introduced the metrics to provide visibility.

Perhaps a more granular approach could be beneficial: we can go further than a single boolean and maintain a small bitmap: one bit per range - essentially lakeFS' take on dirty pages. Instead of reading the bool, read the bitmap; and if the requested range(s) are 0, you're good to read just the underlying committed data. if it's 1, check staging. on write, mark 1 for any range affected by the write.

I am worried the bitmap approach might add a lot of complexity to this solution and create additional dangerous pitfalls:

Since ranges aren’t stable in Graveler due to compaction and layout evolution tying correctness to “range IDs” adds fragile, correctness-critical logic.
Extra work on the read path: To consult a bitmap you first need to resolve key/prefix -> range(s), which likely requires additional metadata reads and can cost as much as the staging lookup we’re trying to avoid (especially for List).
Higher write and coordination cost: Updating a shared bitmap on writes adds contention and write amplification and may just move the hot spot elsewhere.

The branch-level dirty flag addresses the bottleneck (hot partitions) with far less complexity and risk.

Additionally, this flag gives us a simple way to validate how frequently this scenario actually occurs before considering more granular optimizations

Proposal: Branch Read Optimization

aa3265d

N-o-Z requested review from a team and ozkatz January 28, 2026 02:06

N-o-Z self-assigned this Jan 28, 2026

N-o-Z added proposal exclude-changelog PR description should not be included in next release changelog minor-change Used for PRs that don't require issue attached labels Jan 28, 2026

itaiad200 reviewed Jan 28, 2026

View reviewed changes

Fixes

dd5817a

N-o-Z requested a review from itaiad200 January 29, 2026 02:04

arielshaqed requested changes Feb 3, 2026

View reviewed changes


		### High-level idea

		Introduce a branch-level boolean flag:


		For existing branches without the `dirty` field:

		- Default value: `true` (conservative/safe)

Conversation

N-o-Z commented Jan 28, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ozkatz commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arielshaqed left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

N-o-Z commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ozkatz commented Feb 2, 2026 •

edited

Loading