[Art][Wal]Unbound index allocations #19901

artjomPlaunov · 2025-11-24T13:10:42Z

Follow up to #19477, fix for https://github.com/duckdblabs/duckdb-internal/issues/6613

The previous PR added support for buffering and replaying WAL index deletes, however that introduced a memory over-allocation issue, as the UnboundIndex was storing a vector of BufferedIndexData, which stored each buffered operation in a ColumnDataCollection. This was extremely wasteful because if there are interleavings (insert -> delete -> ...) a single operation to be replayed would be stored in a ColumnDataCollection with an internal allocation of STANDARD_VECTOR_SIZE.

EDIT: See @Mytherin's comment below, this PR fixes the issue by changing the way buffering works, now we use two buffers, one for inserts, and another for deletes. Since the inserts and deletes may be interleaved, however, we need an additional vector data structure that stores replay operations and their intervals within the respective buffer. This all stored in BufferedIndexReplays within UnboundIndex.

Buffering data is much simpler now, as we can just append directly to either the insert or delete ColumnDataCollection, as well as appending a ReplayRange node (or extending the range of the last node, if the replay operation is the same type of operation).

Replaying is more efficient now, as we now maintain two interleaved scans on the respective contiguous ColumnDataCollections, fetching one DataChunk at a time to replay.

… + small changes

…x-allocations

taniabogatsch

Hi! Looks great! I just left a bunch of nits and then this is ready to go in from my side. :)

test/sql/storage/wal/wal_index_large_batch_interleaved.test

src/execution/index/unbound_index.cpp

src/execution/index/bound_index.cpp

…x-allocations

taniabogatsch

Just a few more comments / questions.

src/execution/index/unbound_index.cpp

…x-allocations

taniabogatsch

No more comments from my side! Let's run CI? :)

artjomPlaunov · 2025-11-27T09:55:15Z

Yep, thanks for the review!

Mytherin · 2025-11-27T12:43:21Z

Thanks for the PR!

Perhaps a simpler and more efficient solution here could be to share the ColumnDataCollection between all BufferedIndexData insert / delete nodes. We really only store two different collections:

Insert data, holding new data to be inserted
Delete data, holding row ids to be deleted

We could have two separate ColumnDataCollection nodes for these, and have each BufferedIndexData refer to a range within the ColumnDataCollection. These ranges will then always be consecutive. For example, if we have the following operations:

INSERT INTO tbl VALUES (2);
DELETE FROM tbl WHERE rowid=1;
COMMIT;

INSERT INTO tbl VALUES (3);
DELETE FROM tbl WHERE rowid=2;
COMMIT;

INSERT INTO tbl VALUES (4);
DELETE FROM tbl WHERE rowid=3;
COMMIT;

We would have the following collections:

InsertCollection

i: [2, 3, 4]

DeleteCollection

rowids: [1, 2, 3]

With the following nodes:

BufferedIndexData
    type: INSERT
    start: 0
    end: 1

BufferedIndexData
    type: DELETE
    start: 0
    end: 1
    
BufferedIndexData
    type: INSERT
    start: 1
    end: 2

BufferedIndexData
    type: DELETE
    start: 1
    end: 2

BufferedIndexData
    type: INSERT
    start: 2
    end: 3

BufferedIndexData
    type: DELETE
    start: 2
    end: 3

This has a number of advantages:

We only need to scan two ColumnDataCollections, and we do so in-order, so this will all be memory-adjacent and efficient
Constructing these will also be efficient, as we're not constantly allocating tiny batches
I think we will likely end up using less memory in most cases, as the (wasted) empty space is capped to the empty space in the two ColumnDataCollections - versus having potentially much more (wasted) empty space spread across different chunks and collections
From a code perspective I think this might also be simpler and easier to test fully - given we don't have as many special cases as the proposed solution here adds

artjomPlaunov · 2025-11-27T13:14:42Z

Thanks @Mytherin that's a great idea, going to rewrite it!

…playing buffered index operations

taniabogatsch

Thanks for the changes, looking so shiny now haha - left a few comments. :)

src/execution/index/bound_index.cpp

src/execution/index/unbound_index.cpp

src/include/duckdb/execution/index/unbound_index.hpp

…x-allocations

artjomPlaunov · 2025-12-01T12:08:29Z

@taniabogatsch Thank you for the review! I will run the CI now

Mytherin · 2025-12-01T13:35:24Z

Looks great, thanks for the changes!

[Art][Wal]Unbound index allocations (duckdb/duckdb#19901) Null assertion on denormalized_table argument (duckdb/duckdb#19947)

[Art][Wal]Unbound index allocations (duckdb/duckdb#19901) Null assertion on denormalized_table argument (duckdb/duckdb#19947) Co-authored-by: krlmlr <[email protected]>

artjomPlaunov added 5 commits November 21, 2025 14:25

unbound index buffer allocation

31fa1d8

format

48f8a4a

refactoring

dca9de7

remove logic for handling larger than standard_vector_size operations…

3931d46

… + small changes

clarifying comment

1adb92a

artjomPlaunov force-pushed the unbound-index-allocations branch from 65dca39 to 795dcef Compare November 24, 2025 13:44

Merge remote-tracking branch 'upstream/v1.4-andium' into unbound-inde…

9e8ded0

…x-allocations

artjomPlaunov force-pushed the unbound-index-allocations branch from 795dcef to 9e8ded0 Compare November 24, 2025 14:59

taniabogatsch self-requested a review November 26, 2025 10:11

taniabogatsch reviewed Nov 26, 2025

View reviewed changes

artjomPlaunov added 5 commits November 26, 2025 14:42

nits

5bae84d

memory limit slow test

af02c71

OOM test

15dbdac

Merge remote-tracking branch 'upstream/v1.4-andium' into unbound-inde…

8b03b01

…x-allocations

format

cdb5282

taniabogatsch reviewed Nov 26, 2025

View reviewed changes

src/execution/index/unbound_index.cpp Outdated Show resolved Hide resolved

src/execution/index/unbound_index.cpp Outdated Show resolved Hide resolved

src/execution/index/unbound_index.cpp Outdated Show resolved Hide resolved

artjomPlaunov force-pushed the unbound-index-allocations branch from dc66093 to b8cc6e8 Compare November 26, 2025 16:11

append into small_chunk first, then spill to column_data_collection

bfe0aad

artjomPlaunov force-pushed the unbound-index-allocations branch from b8cc6e8 to bfe0aad Compare November 26, 2025 16:16

Merge remote-tracking branch 'upstream/v1.4-andium' into unbound-inde…

14adebd

…x-allocations

taniabogatsch approved these changes Nov 27, 2025

View reviewed changes

artjomPlaunov marked this pull request as ready for review November 27, 2025 09:55

artjomPlaunov marked this pull request as draft November 27, 2025 13:13

initial rewrite to use two ColumnDataCollections for buffering and re…

e9376c6

…playing buffered index operations

artjomPlaunov force-pushed the unbound-index-allocations branch from 5647eb0 to e9376c6 Compare November 27, 2025 19:13

artjomPlaunov added 2 commits November 27, 2025 23:02

off by one, slice inside of loop

4f91557

edge case

fbc6a04

artjomPlaunov added 6 commits November 28, 2025 11:28

bug fix

5bb89af

OOM test

1b2a748

remove old tests

2c105fc

interval merging test

e88f8ee

wal index replay multi column table test

3e7f172

remove unecessary include

b970ec8

taniabogatsch suggested changes Dec 1, 2025

View reviewed changes

artjomPlaunov added 2 commits December 1, 2025 13:07

small changes

68542b2

Merge remote-tracking branch 'upstream/v1.4-andium' into unbound-inde…

003e2c4

…x-allocations

artjomPlaunov marked this pull request as ready for review December 1, 2025 12:09

artjomPlaunov marked this pull request as draft December 1, 2025 12:14

consts

6d273f2

artjomPlaunov marked this pull request as ready for review December 1, 2025 12:21

taniabogatsch added the Ready For Review label Dec 1, 2025

Mytherin approved these changes Dec 1, 2025

View reviewed changes

taniabogatsch added the Ready To Merge label Dec 1, 2025

pdet merged commit 52fe0d2 into duckdb:v1.4-andium Dec 1, 2025
62 checks passed

github-actions bot pushed a commit to duckdb/duckdb-r that referenced this pull request Dec 1, 2025

vendor: Update vendored sources to duckdb/duckdb@52fe0d2

a77d8a2

[Art][Wal]Unbound index allocations (duckdb/duckdb#19901) Null assertion on denormalized_table argument (duckdb/duckdb#19947)

github-actions bot mentioned this pull request Dec 1, 2025

vendor: Update vendored sources to duckdb/duckdb@52fe0d2bffdc766e7a75a9f966c6db537e3ffdca duckdb/duckdb-r#1793

Merged

artjomPlaunov deleted the unbound-index-allocations branch December 30, 2025 12:17

[Art][Wal]Unbound index allocations #19901

[Art][Wal]Unbound index allocations #19901

Uh oh!

Conversation

artjomPlaunov commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

taniabogatsch left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

taniabogatsch left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

taniabogatsch left a comment

Choose a reason for hiding this comment

Uh oh!

artjomPlaunov commented Nov 27, 2025

Uh oh!

Mytherin commented Nov 27, 2025

Uh oh!

artjomPlaunov commented Nov 27, 2025

Uh oh!

taniabogatsch left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

artjomPlaunov commented Dec 1, 2025

Uh oh!

Mytherin commented Dec 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

artjomPlaunov commented Nov 24, 2025 •

edited

Loading