feat: specify max_row_group_size in geoparquet WriterBuilder#846
Merged
feat: specify max_row_group_size in geoparquet WriterBuilder#846
Conversation
1b78bd8 to
31fe450
Compare
gadomski
requested changes
Nov 17, 2025
gadomski
requested changes
Nov 18, 2025
Member
gadomski
left a comment
There was a problem hiding this comment.
Thanks for the options fix, this feels nice. Just one tweak that might let us do fewer changes.
gadomski
requested changes
Nov 18, 2025
This was referenced Dec 1, 2025
Closed
Closed
Merged
gadomski
added a commit
that referenced
this pull request
Dec 1, 2025
🤖 I have created a release *beep* *boop* --- <details><summary>pgstac: 0.4.0</summary> ## [0.4.0](pgstac-v0.3.2...pgstac-v0.4.0) (2025-12-01) ### ⚠ BREAKING CHANGES * move stac_api crate into stac crate ([#869](#869)) * remove unused error enums ([#868](#868)) ### Bug Fixes * remove unused error enums ([#868](#868)) ([cf0e815](cf0e815)) ### Code Refactoring * move stac_api crate into stac crate ([#869](#869)) ([d0f7405](d0f7405)) ### Dependencies * The following workspace dependencies were updated * dependencies * stac bumped from 0.14.0 to 0.15.0 </details> <details><summary>rustac: 0.2.0</summary> ## [0.2.0](rustac-v0.1.2...rustac-v0.2.0) (2025-12-01) ### ⚠ BREAKING CHANGES * move stac_api crate into stac crate ([#869](#869)) * move api client to stac-io crate ([#864](#864)) ### Features * add bind argument when serving ([#871](#871)) ([f3a3517](f3a3517)) * shell completions ([#874](#874)) ([717c4ee](717c4ee)), closes [#650](#650) * specify max_row_group_size in geoparquet WriterBuilder ([#846](#846)) ([2bde538](2bde538)) ### Code Refactoring * move api client to stac-io crate ([#864](#864)) ([e06de28](e06de28)), closes [#764](#764) * move stac_api crate into stac crate ([#869](#869)) ([d0f7405](d0f7405)) ### Dependencies * The following workspace dependencies were updated * dependencies * stac bumped from 0.14.0 to 0.15.0 * stac-duckdb bumped from 0.2.0 to 0.3.0 * stac-io bumped from 0.1.0 to 0.2.0 * stac-server bumped from 0.3.2 to 0.4.0 * stac-validate bumped from 0.5.0 to 0.6.0 </details> <details><summary>stac: 0.15.0</summary> ## [0.15.0](stac-v0.14.0...stac-v0.15.0) (2025-12-01) ### ⚠ BREAKING CHANGES * move stac_api crate into stac crate ([#869](#869)) * remove unused error enums ([#868](#868)) * consume a geoparquet writer on close ([#867](#867)) ### Features * add geoparquet writer encoder and object writing ([#863](#863)) ([ec6e7de](ec6e7de)) * specify max_row_group_size in geoparquet WriterBuilder ([#846](#846)) ([2bde538](2bde538)) ### Bug Fixes * allow writing stac-geoparquet with no assets ([#882](#882)) ([b049a71](b049a71)) * remove circular dev depependency ([#886](#886)) ([dcb9b49](dcb9b49)) * remove unused error enums ([#868](#868)) ([cf0e815](cf0e815)) ### Code Refactoring * consume a geoparquet writer on close ([#867](#867)) ([bdd95be](bdd95be)) * move stac_api crate into stac crate ([#869](#869)) ([d0f7405](d0f7405)) </details> <details><summary>stac-duckdb: 0.3.0</summary> ## [0.3.0](stac-duckdb-v0.2.2...stac-duckdb-v0.3.0) (2025-12-01) ### ⚠ BREAKING CHANGES * move stac_api crate into stac crate ([#869](#869)) ### Bug Fixes * remove filename by default ([#855](#855)) ([8bba676](8bba676)) ### Code Refactoring * move stac_api crate into stac crate ([#869](#869)) ([d0f7405](d0f7405)) ### Dependencies * The following workspace dependencies were updated * dependencies * stac bumped from 0.14.0 to 0.15.0 * dev-dependencies * stac-validate bumped from 0.5.0 to 0.6.0 </details> <details><summary>stac-extensions: 0.1.2</summary> ## [0.1.2](stac-extensions-v0.1.1...stac-extensions-v0.1.2) (2025-12-01) ### Dependencies * The following workspace dependencies were updated * dependencies * stac bumped from 0.14.0 to 0.15.0 </details> <details><summary>stac-io: 0.2.0</summary> ## [0.2.0](stac-io-v0.1.2...stac-io-v0.2.0) (2025-12-01) ### ⚠ BREAKING CHANGES * move stac_api crate into stac crate ([#869](#869)) * remove unused error enums ([#868](#868)) * move api client to stac-io crate ([#864](#864)) ### Features * add geoparquet writer encoder and object writing ([#863](#863)) ([ec6e7de](ec6e7de)) * specify max_row_group_size in geoparquet WriterBuilder ([#846](#846)) ([2bde538](2bde538)) ### Bug Fixes * remove unused error enums ([#868](#868)) ([cf0e815](cf0e815)) ### Code Refactoring * move api client to stac-io crate ([#864](#864)) ([e06de28](e06de28)), closes [#764](#764) * move stac_api crate into stac crate ([#869](#869)) ([d0f7405](d0f7405)) ### Dependencies * The following workspace dependencies were updated * dependencies * stac bumped from 0.14.0 to 0.15.0 </details> <details><summary>stac-server: 0.4.0</summary> ## [0.4.0](stac-server-v0.3.6...stac-server-v0.4.0) (2025-12-01) ### ⚠ BREAKING CHANGES * move stac_api crate into stac crate ([#869](#869)) * remove unused error enums ([#868](#868)) ### Bug Fixes * remove unused error enums ([#868](#868)) ([cf0e815](cf0e815)) ### Code Refactoring * move stac_api crate into stac crate ([#869](#869)) ([d0f7405](d0f7405)) ### Dependencies * The following workspace dependencies were updated * dependencies * pgstac bumped from 0.3.0 to 0.4.0 * stac bumped from 0.14.0 to 0.15.0 * stac-duckdb bumped from 0.2.0 to 0.3.0 </details> <details><summary>stac-validate: 0.6.0</summary> ## [0.6.0](stac-validate-v0.5.1...stac-validate-v0.6.0) (2025-12-01) ### ⚠ BREAKING CHANGES * remove unused error enums ([#868](#868)) ### Bug Fixes * remove circular dev depependency ([#886](#886)) ([dcb9b49](dcb9b49)) * remove unused error enums ([#868](#868)) ([cf0e815](cf0e815)) ### Dependencies * The following workspace dependencies were updated * dependencies * stac bumped from 0.14.0 to 0.15.0 * dev-dependencies * stac-io bumped from 0.1.0 to 0.2.0 </details> <details><summary>stac-wasm: 0.1.0</summary> ## [0.1.0](stac-wasm-v0.0.4...stac-wasm-v0.1.0) (2025-12-01) ### ⚠ BREAKING CHANGES * remove unused error enums ([#868](#868)) ### Features * stac_wasm.stacJsonToParquet ([#786](#786)) ([6b1971a](6b1971a)) * wasm ([#744](#744)) ([db5cd21](db5cd21)) ### Bug Fixes * pin arrow-wasm ([#785](#785)) ([8f9c28b](8f9c28b)) * remove the package lock ([#745](#745)) ([b3337f6](b3337f6)) * remove unused error enums ([#868](#868)) ([cf0e815](cf0e815)) ### Dependencies * The following workspace dependencies were updated * dependencies * stac bumped from 0.14.0 to 0.15.0 </details> --- This PR was generated with [Release Please](https://github.com/googleapis/release-please). See [documentation](https://github.com/googleapis/release-please#release-please).
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Related to
this does not expose the max_row_group_size parameter through all of the CLI and format-based functions, but does make it configurable for any application that uses WriterBuilder directly (e.g. rustac-py's.GeoparquetWriter)Description
This adds the
max_row_group_sizeparameter togeoparquet::WriterBuilderand sets the default value to150_000(per recommendations in cholmes' guide.The rustac cli has a new option
--parquet-max-row-group-sizethat can be used to set the max_row_group_size on output geoparquet files.Checklist
Delete any checklist items that do not apply (e.g. if your change is minor, it may not require documentation updates).
cargo fmt)cargo test