Chunk application of entity modifications #2367

tilacog · 2021-04-11T16:23:10Z

This PR chunks the application of entity modifications as an attempt to fix #2330.

Given that the maximum number of PostgreSQL bind parameters per query is 65535, it makes sense to use N chunks where:

chunk size =  65535 / number of fields per entity

Nonetheless, we must remain wary that parameters other than entity fields are also bound in each query, such as block numbers and auxiliary data.

Therefore, it would be reasonable for us to discuss ways to accommodate those extra bindings, probably by reducing chunk size by an arbitrary amount.

store/postgres/src/deployment_store.rs

tilacog · 2021-04-13T22:38:49Z

I've moved the batch logic to the relational.rs module, where we can use table information.

I hope I got the math right:

InsertQuery uses one bind for block_range and one bind for each column in the given table.
ClampRangeQuery always uses 2 binds: one for block_range and another for the entity ids array. Assuming that one array uses only one bind, I understand we shouldn't batch it.

lutter

Yes, I agree on what you say about ClampRangeQuery - the one thing to check is if there is some other limit on array size. IIRC, in other contexts it was actually advantageous to break queries with large arrays into smaller ones because you get O(n^2) behavior from scanning these large arrays, so that, for example 10 queries with an array of length 1000 was faster than 1 query with an array of length 10,000

store/postgres/src/relational.rs

store/postgres/src/deployment_store.rs

tilacog · 2021-04-16T15:34:14Z

Yes, I agree on what you say about ClampRangeQuery - the one thing to check is if there is some other limit on array size. IIRC, in other contexts it was actually advantageous to break queries with large arrays into smaller ones because you get O(n^2) behavior from scanning these large arrays, so that, for example 10 queries with an array of length 1000 was faster than 1 query with an array of length 10,000

The docs state that array size limits are ignored, but Postgres will complain if the field size exceeds 1GB.
From what I have (quickly) researched, I couldn't any info on what that optimal array size for inserting values would be.

Do you believe this could be better addressed and reviewed in a new issue/PR?
If so, I can create and assign myself to it.

tilacog requested a review from lutter April 11, 2021 16:23

tilacog marked this pull request as draft April 11, 2021 16:30

tilacog marked this pull request as ready for review April 11, 2021 17:50

lutter requested changes Apr 12, 2021

View reviewed changes

store/postgres/src/deployment_store.rs Outdated Show resolved Hide resolved

store/postgres/src/deployment_store.rs Outdated Show resolved Hide resolved

tilacog added 2 commits April 13, 2021 18:33

store: chunk application of entity modifications

e29f61d

store: rename const

d7c66de

tilacog force-pushed the tiago/chunk-apply-entity-modifications branch from 7335e23 to fe15a91 Compare April 13, 2021 21:34

tilacog marked this pull request as draft April 13, 2021 21:37

tilacog force-pushed the tiago/chunk-apply-entity-modifications branch from fe15a91 to 5957d89 Compare April 13, 2021 21:52

tilacog added 4 commits April 13, 2021 18:52

store: calculate batch size using table columns

18f3b61

store: fix ClampRangeQuery batch size

5957d89

store: fix InsertQuery chunk size

5381231

store: don't batch ClampRangeQuery

0b87c24

tilacog marked this pull request as ready for review April 13, 2021 22:39

tilacog requested a review from lutter April 13, 2021 22:39

lutter requested changes Apr 14, 2021

View reviewed changes

store/postgres/src/relational.rs Outdated Show resolved Hide resolved

store/postgres/src/relational.rs Outdated Show resolved Hide resolved

store/postgres/src/deployment_store.rs Outdated Show resolved Hide resolved

tilacog added 2 commits April 16, 2021 11:16

store: fix bad math

5bc2c18

store: update chunk explanation comment

8a34df0

tilacog closed this Apr 16, 2021

tilacog reopened this Apr 16, 2021

store: perorm delete operations in chunks

50cae37

lutter approved these changes Apr 19, 2021

View reviewed changes

tilacog merged commit bdb1a7a into master Apr 20, 2021

v0idpwn approved these changes Apr 23, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Chunk application of entity modifications #2367

Chunk application of entity modifications #2367

Uh oh!

tilacog commented Apr 11, 2021 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

tilacog commented Apr 13, 2021

Uh oh!

lutter left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tilacog commented Apr 16, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Chunk application of entity modifications #2367

Chunk application of entity modifications #2367

Uh oh!

Conversation

tilacog commented Apr 11, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tilacog commented Apr 13, 2021

Uh oh!

lutter left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tilacog commented Apr 16, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tilacog commented Apr 11, 2021 •

edited

Loading