Thanks to visit codestin.com
Credit goes to github.com

Skip to content

feat: specify max_row_group_size in geoparquet WriterBuilder#846

Merged
gadomski merged 11 commits intomainfrom
feat/row-group-size
Nov 18, 2025
Merged

feat: specify max_row_group_size in geoparquet WriterBuilder#846
gadomski merged 11 commits intomainfrom
feat/row-group-size

Conversation

@hrodmn
Copy link
Collaborator

@hrodmn hrodmn commented Nov 16, 2025

Related to

Description

This adds the max_row_group_size parameter to geoparquet::WriterBuilder and sets the default value to 150_000 (per recommendations in cholmes' guide.

The rustac cli has a new option --parquet-max-row-group-size that can be used to set the max_row_group_size on output geoparquet files.

Checklist

Delete any checklist items that do not apply (e.g. if your change is minor, it may not require documentation updates).

  • Unit tests
  • Documentation, including doctests
  • Git history is linear
  • Commit messages are descriptive
  • (optional) Git commit messages follow conventional commits
  • Code is formatted (cargo fmt)
  • cargo test
  • Changes are added to the CHANGELOG

@hrodmn hrodmn requested a review from gadomski as a code owner November 16, 2025 19:29
@hrodmn hrodmn force-pushed the feat/row-group-size branch from 1b78bd8 to 31fe450 Compare November 16, 2025 19:31
@gadomski gadomski linked an issue Nov 17, 2025 that may be closed by this pull request
Copy link
Member

@gadomski gadomski left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the options fix, this feels nice. Just one tweak that might let us do fewer changes.

@gadomski gadomski self-requested a review November 18, 2025 17:17
@hrodmn hrodmn requested a review from gadomski November 18, 2025 18:57
Copy link
Member

@gadomski gadomski left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:shipit:

@gadomski gadomski enabled auto-merge (squash) November 18, 2025 19:08
@gadomski gadomski merged commit 2bde538 into main Nov 18, 2025
11 checks passed
@gadomski gadomski deleted the feat/row-group-size branch November 18, 2025 19:15
gadomski added a commit that referenced this pull request Dec 1, 2025
🤖 I have created a release *beep* *boop*
---


<details><summary>pgstac: 0.4.0</summary>

##
[0.4.0](pgstac-v0.3.2...pgstac-v0.4.0)
(2025-12-01)


### ⚠ BREAKING CHANGES

* move stac_api crate into stac crate
([#869](#869))
* remove unused error enums
([#868](#868))

### Bug Fixes

* remove unused error enums
([#868](#868))
([cf0e815](cf0e815))


### Code Refactoring

* move stac_api crate into stac crate
([#869](#869))
([d0f7405](d0f7405))


### Dependencies

* The following workspace dependencies were updated
  * dependencies
    * stac bumped from 0.14.0 to 0.15.0
</details>

<details><summary>rustac: 0.2.0</summary>

##
[0.2.0](rustac-v0.1.2...rustac-v0.2.0)
(2025-12-01)


### ⚠ BREAKING CHANGES

* move stac_api crate into stac crate
([#869](#869))
* move api client to stac-io crate
([#864](#864))

### Features

* add bind argument when serving
([#871](#871))
([f3a3517](f3a3517))
* shell completions
([#874](#874))
([717c4ee](717c4ee)),
closes [#650](#650)
* specify max_row_group_size in geoparquet WriterBuilder
([#846](#846))
([2bde538](2bde538))


### Code Refactoring

* move api client to stac-io crate
([#864](#864))
([e06de28](e06de28)),
closes [#764](#764)
* move stac_api crate into stac crate
([#869](#869))
([d0f7405](d0f7405))


### Dependencies

* The following workspace dependencies were updated
  * dependencies
    * stac bumped from 0.14.0 to 0.15.0
    * stac-duckdb bumped from 0.2.0 to 0.3.0
    * stac-io bumped from 0.1.0 to 0.2.0
    * stac-server bumped from 0.3.2 to 0.4.0
    * stac-validate bumped from 0.5.0 to 0.6.0
</details>

<details><summary>stac: 0.15.0</summary>

##
[0.15.0](stac-v0.14.0...stac-v0.15.0)
(2025-12-01)


### ⚠ BREAKING CHANGES

* move stac_api crate into stac crate
([#869](#869))
* remove unused error enums
([#868](#868))
* consume a geoparquet writer on close
([#867](#867))

### Features

* add geoparquet writer encoder and object writing
([#863](#863))
([ec6e7de](ec6e7de))
* specify max_row_group_size in geoparquet WriterBuilder
([#846](#846))
([2bde538](2bde538))


### Bug Fixes

* allow writing stac-geoparquet with no assets
([#882](#882))
([b049a71](b049a71))
* remove circular dev depependency
([#886](#886))
([dcb9b49](dcb9b49))
* remove unused error enums
([#868](#868))
([cf0e815](cf0e815))


### Code Refactoring

* consume a geoparquet writer on close
([#867](#867))
([bdd95be](bdd95be))
* move stac_api crate into stac crate
([#869](#869))
([d0f7405](d0f7405))
</details>

<details><summary>stac-duckdb: 0.3.0</summary>

##
[0.3.0](stac-duckdb-v0.2.2...stac-duckdb-v0.3.0)
(2025-12-01)


### ⚠ BREAKING CHANGES

* move stac_api crate into stac crate
([#869](#869))

### Bug Fixes

* remove filename by default
([#855](#855))
([8bba676](8bba676))


### Code Refactoring

* move stac_api crate into stac crate
([#869](#869))
([d0f7405](d0f7405))


### Dependencies

* The following workspace dependencies were updated
  * dependencies
    * stac bumped from 0.14.0 to 0.15.0
  * dev-dependencies
    * stac-validate bumped from 0.5.0 to 0.6.0
</details>

<details><summary>stac-extensions: 0.1.2</summary>

##
[0.1.2](stac-extensions-v0.1.1...stac-extensions-v0.1.2)
(2025-12-01)


### Dependencies

* The following workspace dependencies were updated
  * dependencies
    * stac bumped from 0.14.0 to 0.15.0
</details>

<details><summary>stac-io: 0.2.0</summary>

##
[0.2.0](stac-io-v0.1.2...stac-io-v0.2.0)
(2025-12-01)


### ⚠ BREAKING CHANGES

* move stac_api crate into stac crate
([#869](#869))
* remove unused error enums
([#868](#868))
* move api client to stac-io crate
([#864](#864))

### Features

* add geoparquet writer encoder and object writing
([#863](#863))
([ec6e7de](ec6e7de))
* specify max_row_group_size in geoparquet WriterBuilder
([#846](#846))
([2bde538](2bde538))


### Bug Fixes

* remove unused error enums
([#868](#868))
([cf0e815](cf0e815))


### Code Refactoring

* move api client to stac-io crate
([#864](#864))
([e06de28](e06de28)),
closes [#764](#764)
* move stac_api crate into stac crate
([#869](#869))
([d0f7405](d0f7405))


### Dependencies

* The following workspace dependencies were updated
  * dependencies
    * stac bumped from 0.14.0 to 0.15.0
</details>

<details><summary>stac-server: 0.4.0</summary>

##
[0.4.0](stac-server-v0.3.6...stac-server-v0.4.0)
(2025-12-01)


### ⚠ BREAKING CHANGES

* move stac_api crate into stac crate
([#869](#869))
* remove unused error enums
([#868](#868))

### Bug Fixes

* remove unused error enums
([#868](#868))
([cf0e815](cf0e815))


### Code Refactoring

* move stac_api crate into stac crate
([#869](#869))
([d0f7405](d0f7405))


### Dependencies

* The following workspace dependencies were updated
  * dependencies
    * pgstac bumped from 0.3.0 to 0.4.0
    * stac bumped from 0.14.0 to 0.15.0
    * stac-duckdb bumped from 0.2.0 to 0.3.0
</details>

<details><summary>stac-validate: 0.6.0</summary>

##
[0.6.0](stac-validate-v0.5.1...stac-validate-v0.6.0)
(2025-12-01)


### ⚠ BREAKING CHANGES

* remove unused error enums
([#868](#868))

### Bug Fixes

* remove circular dev depependency
([#886](#886))
([dcb9b49](dcb9b49))
* remove unused error enums
([#868](#868))
([cf0e815](cf0e815))


### Dependencies

* The following workspace dependencies were updated
  * dependencies
    * stac bumped from 0.14.0 to 0.15.0
  * dev-dependencies
    * stac-io bumped from 0.1.0 to 0.2.0
</details>

<details><summary>stac-wasm: 0.1.0</summary>

##
[0.1.0](stac-wasm-v0.0.4...stac-wasm-v0.1.0)
(2025-12-01)


### ⚠ BREAKING CHANGES

* remove unused error enums
([#868](#868))

### Features

* stac_wasm.stacJsonToParquet
([#786](#786))
([6b1971a](6b1971a))
* wasm ([#744](#744))
([db5cd21](db5cd21))


### Bug Fixes

* pin arrow-wasm
([#785](#785))
([8f9c28b](8f9c28b))
* remove the package lock
([#745](#745))
([b3337f6](b3337f6))
* remove unused error enums
([#868](#868))
([cf0e815](cf0e815))


### Dependencies

* The following workspace dependencies were updated
  * dependencies
    * stac bumped from 0.14.0 to 0.15.0
</details>

---
This PR was generated with [Release
Please](https://github.com/googleapis/release-please). See
[documentation](https://github.com/googleapis/release-please#release-please).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

make max_row_group_size configurable when writing geoparquet

2 participants

Comments