-
Notifications
You must be signed in to change notification settings - Fork 2k
Comparing changes
Open a pull request
base repository: apache/datafusion
base: main
head repository: apache/datafusion
compare: branch-52
- 19 commits
- 48 files changed
- 20 contributors
Commits on Jan 8, 2026
-
[branch-52] Remove dependency on
rust_decimal, remove ignore of `RU……STSEC-2026-0001` (#19666) (#19686) ## Which issue does this PR close? - part of #18566 ## Rationale for this change Let's resolve the audit workflow on branch-52 by removing our dependency on the `rust_decimal` library, per @xudong963 's request here #19666 (review) ## What changes are included in this PR? - Backport #19666 to branch-52 ## Are these changes tested? By CI ## Are there any user-facing changes? No, this is an internal dev tool change
Configuration menu - View commit details
-
Copy full SHA for 04e6cee - Browse repository at this point
Copy the full SHA 04e6ceeView commit details -
[branch-52] Split BatchPartitioner::try_new into hash and round-robin…
… constructors (#19681) Backport of #19668 to branch-52. This PR cherry-picks commit 680ddcc from main. Includes: - Split of BatchPartitioner::try_new into hash and round-robin constructors - Documentation improvements - No behavior changes part of #18566 Co-authored-by: Your Name <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d5dae86 - Browse repository at this point
Copy the full SHA d5dae86View commit details
Commits on Jan 9, 2026
-
[branch-52] Backport
list_files_cache, and make default ListingFile……sCache table scoped (#19704) ## Which issue does this PR close? - part of #18566 ## Rationale for this change Backport the fix for this regression into 52 release branch: - #19573 ## What changes are included in this PR? Backport these two commits to `branch-52` (cherry-pick was clean) - 1037f0a / #19388 - e6049de / #19616 <details><summary>Commands</summary> <p> ```shell andrewlamb@Andrews-MacBook-Pro-3:~/Software/datafusion$ git cherry-pick 1037f0a [branch-52 1fc70ac] feat: add list_files_cache table function for `datafusion-cli` (#19388) Author: jizezhang <[email protected]> Date: Tue Jan 6 05:23:39 2026 -0800 5 files changed, 446 insertions(+), 31 deletions(-) andrewlamb@Andrews-MacBook-Pro-3:~/Software/datafusion$ git cherry-pick e6049de Auto-merging datafusion/core/src/execution/context/mod.rs [branch-52 aa3d413] Make default ListingFilesCache table scoped (#19616) Author: jizezhang <[email protected]> Date: Thu Jan 8 06:34:10 2026 -0800 10 files changed, 474 insertions(+), 184 deletions(-) ``` </p> </details> ## Are these changes tested? By CI and new tests ## Are there any user-facing changes? A new datafusion-cli function and dropping a external table now clears the listing cache --------- Co-authored-by: jizezhang <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1ac18a3 - Browse repository at this point
Copy the full SHA 1ac18a3View commit details -
[branch-52] fix: DynamicFilterPhysicalExpr violates Hash/Eq contract (#…
…19659) (#19705) ## Which issue does this PR close? <!-- We generally require a GitHub issue to be filed for all bug fixes and enhancements and this helps us generate change logs for our releases. You can link an issue to this PR using the GitHub syntax. For example `Closes #123` indicates that this PR will close issue #123. --> - part of #18566 ## Rationale for this change I propose back porting the fix for #19641 to 52 release ## What changes are included in this PR? - Backport #19659 ## Are these changes tested? eYes ## Are there any user-facing changes? bug fix Co-authored-by: Kumar Ujjawal <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 69eca52 - Browse repository at this point
Copy the full SHA 69eca52View commit details -
[branch-52] Prepare 52.0.0 release version number and changelog (#19661)
## Which issue does this PR close? <!-- We generally require a GitHub issue to be filed for all bug fixes and enhancements and this helps us generate change logs for our releases. You can link an issue to this PR using the GitHub syntax. For example `Closes #123` indicates that this PR will close issue #123. --> - Part of #18566 ## Rationale for this change <!-- Why are you proposing this change? If this is already explained clearly in the issue then this section is not needed. Explaining clearly why changes are proposed helps reviewers understand your changes and offer better suggestions for fixes. --> ## What changes are included in this PR? <!-- There is no need to duplicate the description in the issue here but it is sometimes worth providing a summary of the individual changes in this PR. --> ## Are these changes tested? <!-- We typically require tests for all PRs in order to: 1. Prevent the code from being accidentally broken by subsequent changes 2. Serve as another way to document the expected behavior of the code If tests are not included in your PR, please explain why (for example, are they covered by existing tests)? --> ## Are there any user-facing changes? <!-- If there are user-facing changes then we may require documentation to be updated before approving the PR. --> <!-- If there are any breaking changes to public APIs, please add the `api change` label. -->
Configuration menu - View commit details
-
Copy full SHA for bc043c3 - Browse repository at this point
Copy the full SHA bc043c3View commit details
Commits on Jan 15, 2026
-
[branch-52] fix: expose
ListFilesEntry(#19818)## Which issue does this PR close? <!-- We generally require a GitHub issue to be filed for all bug fixes and enhancements and this helps us generate change logs for our releases. You can link an issue to this PR using the GitHub syntax. For example `Closes #123` indicates that this PR will close issue #123. --> - part of #19784. ## Rationale for this change <!-- Why are you proposing this change? If this is already explained clearly in the issue then this section is not needed. Explaining clearly why changes are proposed helps reviewers understand your changes and offer better suggestions for fixes. --> This is a follow-up of #19573 ## What changes are included in this PR? <!-- There is no need to duplicate the description in the issue here but it is sometimes worth providing a summary of the individual changes in this PR. --> Backport - #19804 ## Are these changes tested? <!-- We typically require tests for all PRs in order to: 1. Prevent the code from being accidentally broken by subsequent changes 2. Serve as another way to document the expected behavior of the code If tests are not included in your PR, please explain why (for example, are they covered by existing tests)? --> Existing tests ## Are there any user-facing changes? <!-- If there are user-facing changes then we may require documentation to be updated before approving the PR. --> no <!-- If there are any breaking changes to public APIs, please add the `api change` label. -->
Configuration menu - View commit details
-
Copy full SHA for 2073f04 - Browse repository at this point
Copy the full SHA 2073f04View commit details
Commits on Jan 16, 2026
-
[branch-52] Fix Internal error: Assertion failed: !self.finished: Lim…
…itedBatchCoalescer (#19785) (#19836) ## Which issue does this PR close? - part of #19784 - related to #19785 - backport fix of #19781 ## Rationale for this change Backport a regression found by @bert-beyondloops into the 52 release line ## What changes are included in this PR? Backport fix for #19781, #19785 to branch-52 ## Are these changes tested? Yes ## Are there any user-facing changes? bug fix Co-authored-by: Bert Vermeiren <[email protected]> Co-authored-by: Bert Vermeiren <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 18fdd8b - Browse repository at this point
Copy the full SHA 18fdd8bView commit details -
[branch 52] Fix grouping set subset satisfaction (#19855)
Brings #19853 into `branch-52` Co-authored-by: Pierre Lacave <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for eb00fe2 - Browse repository at this point
Copy the full SHA eb00fe2View commit details
Commits on Jan 19, 2026
-
Add BatchAdapter to simplify using PhysicalExprAdapter / Projector (#…
…19877) - part of #19784 - Brings #19716 from @adriangb into `branch-52` Co-authored-by: Adrian Garcia Badaracco <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c0a6f23 - Browse repository at this point
Copy the full SHA c0a6f23View commit details -
[branch-52] Update version to
52.1.0(#19878)## Which issue does this PR close? - part of #19784 ## Rationale for this change Prepare for release ## What changes are included in this PR? Update release version to 52.1.0 and add changelog ## Are these changes tested? By CI ## Are there any user-facing changes? New version
Configuration menu - View commit details
-
Copy full SHA for 9f3ddce - Browse repository at this point
Copy the full SHA 9f3ddceView commit details
Commits on Feb 12, 2026
-
[branch-52] fix: filter pushdown when merge filter (#20110) (#20289)
## Which issue does this PR close? - related to #20287 ## Rationale for this change see issue #20109 ## What changes are included in this PR? 1. Remap parent filter expressions: When a FilterExec has a projection, remap unsupported parent filter expressions from output schema coordinates to input schema coordinates using `reassign_expr_columns()` before combining them with the current filter's predicates. 2. Preserve projection: When creating the merged FilterExec, preserve the original projection instead of discarding it . ## Are these changes tested? yes, add some test case ## Are there any user-facing changes? --------- ## Which issue does this PR close? <!-- We generally require a GitHub issue to be filed for all bug fixes and enhancements and this helps us generate change logs for our releases. You can link an issue to this PR using the GitHub syntax. For example `Closes #123` indicates that this PR will close issue #123. --> - Closes #. ## Rationale for this change <!-- Why are you proposing this change? If this is already explained clearly in the issue then this section is not needed. Explaining clearly why changes are proposed helps reviewers understand your changes and offer better suggestions for fixes. --> ## What changes are included in this PR? <!-- There is no need to duplicate the description in the issue here but it is sometimes worth providing a summary of the individual changes in this PR. --> ## Are these changes tested? <!-- We typically require tests for all PRs in order to: 1. Prevent the code from being accidentally broken by subsequent changes 2. Serve as another way to document the expected behavior of the code If tests are not included in your PR, please explain why (for example, are they covered by existing tests)? --> ## Are there any user-facing changes? <!-- If there are user-facing changes then we may require documentation to be updated before approving the PR. --> <!-- If there are any breaking changes to public APIs, please add the `api change` label. --> --------- Co-authored-by: Adrian Garcia Badaracco <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 8aaa274 - Browse repository at this point
Copy the full SHA 8aaa274View commit details
Commits on Feb 23, 2026
-
[branch-52] FilterExec should remap indices of parent dynamic filters (…
Configuration menu - View commit details
-
Copy full SHA for 016e2ae - Browse repository at this point
Copy the full SHA 016e2aeView commit details
Commits on Feb 24, 2026
-
[branch-52] fix: validate inter-file ordering in eq_properties() (#20329
) (#20509) - Part of #20287 - Closes #20508 on branch-52 This PR simply - Backports #20329 from @adriangb to the branch-52 line I simply cherry-picked 53b0ffb to the branch-52 branch ```shell andrewlamb@Andrews-MacBook-Pro-3:~/Software/datafusion2$ git cherry-pick 53b0ffb Auto-merging datafusion/core/tests/physical_optimizer/partition_statistics.rs Auto-merging datafusion/datasource/src/file_scan_config.rs [alamb/backport_20329 9286563] fix: validate inter-file ordering in eq_properties() (#20329) Author: Adrian Garcia Badaracco <[email protected]> Date: Sat Feb 14 14:04:01 2026 -0500 5 files changed, 660 insertions(+), 47 deletions(-) ``` Co-authored-by: Adrian Garcia Badaracco <[email protected]> Co-authored-by: Claude Opus 4.6 <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3a5b41c - Browse repository at this point
Copy the full SHA 3a5b41cView commit details
Commits on Feb 25, 2026
-
Fix name tracker (#19856) (#20539)
- Closes #17508 The previous implementation used UUID-based aliasing as a workaround to prevent duplicate names for literals in Substrait plans. This approach had several drawbacks: - Non-deterministic plan names that made testing difficult (requiring UUID regex filters) - Only addressed literal naming conflicts, not the broader issue of name deduplication - Added unnecessary dependency on the `uuid` crate - Didn't properly handle cases where the same qualified name could appear with different schema representations 1. Enhanced NameTracker: Refactored to detect two types of conflicts: - Duplicate schema names: Tracked via schema_name() to prevent validate_unique_names failures (e.g., two Utf8(NULL) literals) - Ambiguous references: Tracked via qualified_name() to prevent DFSchema::check_names failures when a qualified field (e.g., left.Utf8(NULL)) and unqualified field (e.g., Utf8(NULL)) share the same column name 2. **Removed UUID dependency**: Eliminated the `uuid` crate from `datafusion/substrait` 3. **Removed literal-specific aliasing**: The UUID-based workaround in `project_rel.rs` is no longer needed as the improved NameTracker handles all naming conflicts consistently 4. **Deterministic naming**: Name conflicts now use predictable `__temp__N` suffixes instead of random UUIDs Note: This doesn't fully fix all the issues in #17508 which allow some special casing of `CAST` which are not included here. Yes: - Updated snapshot tests to reflect the new deterministic naming (e.g., `Utf8("people")__temp__0` instead of UUID-based names) - Modified some roundtrip tests to verify semantic equivalence (schema matching and execution) rather than exact string matching, which is more robust - All existing integration tests pass with the new naming scheme Minimal. The generated plan names are now deterministic and more readable (using `__temp__N` suffixes instead of UUIDs), but this is primarily an internal representation change. The functional behavior and query results remain unchanged. ## Which issue does this PR close? <!-- We generally require a GitHub issue to be filed for all bug fixes and enhancements and this helps us generate change logs for our releases. You can link an issue to this PR using the GitHub syntax. For example `Closes #123` indicates that this PR will close issue #123. --> - Closes #. ## Rationale for this change <!-- Why are you proposing this change? If this is already explained clearly in the issue then this section is not needed. Explaining clearly why changes are proposed helps reviewers understand your changes and offer better suggestions for fixes. --> ## What changes are included in this PR? <!-- There is no need to duplicate the description in the issue here but it is sometimes worth providing a summary of the individual changes in this PR. --> ## Are these changes tested? <!-- We typically require tests for all PRs in order to: 1. Prevent the code from being accidentally broken by subsequent changes 2. Serve as another way to document the expected behavior of the code If tests are not included in your PR, please explain why (for example, are they covered by existing tests)? --> ## Are there any user-facing changes? <!-- If there are user-facing changes then we may require documentation to be updated before approving the PR. --> <!-- If there are any breaking changes to public APIs, please add the `api change` label. --> Co-authored-by: Xander <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 4aa7071 - Browse repository at this point
Copy the full SHA 4aa7071View commit details -
[branch-52] fix: HashJoin panic with dictionary-encoded columns in mu…
Configuration menu - View commit details
-
Copy full SHA for af87ef5 - Browse repository at this point
Copy the full SHA af87ef5View commit details -
[branch-52] Fix incorrect
SortExecremoval beforeAggregateExec(#……20247) (#20507) ## Which issue does this PR close? - part of #20287 - Fixes #20287 on branch-52 ## Rationale for this change See issues ## What changes are included in this PR? - backports #20247 ## Are these changes tested? By CI ## Are there any user-facing changes? <!-- If there are user-facing changes then we may require documentation to be updated before approving the PR. --> <!-- If there are any breaking changes to public APIs, please add the `api change` label. --> --------- Co-authored-by: Mustafa Akur <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7388120 - Browse repository at this point
Copy the full SHA 7388120View commit details -
[branch-52] Update aws-smithy, bytes and time for security audits (#2…
…0546) ## Which issue does this PR close? - Part of #20287 ## Rationale for this change The security audit CI check [failed here](https://github.com/apache/datafusion/actions/runs/22381549301/job/64783156671?pr=20539) on - #20539 This is due to some dependencies being yanked (aws-smithy specifically) ## What changes are included in this PR? Let's update the relevant dependencies with small security related fixes ## Are these changes tested? <!-- We typically require tests for all PRs in order to: 1. Prevent the code from being accidentally broken by subsequent changes 2. Serve as another way to document the expected behavior of the code If tests are not included in your PR, please explain why (for example, are they covered by existing tests)? --> ## Are there any user-facing changes? <!-- If there are user-facing changes then we may require documentation to be updated before approving the PR. --> <!-- If there are any breaking changes to public APIs, please add the `api change` label. -->
Configuration menu - View commit details
-
Copy full SHA for 30545ba - Browse repository at this point
Copy the full SHA 30545baView commit details
Commits on Feb 26, 2026
-
[branch-53] Clamp early aggregation emit to the sort boundary when us…
…ing partial group ordering (#20446) (#20558) ## Which issue does this PR close? - part of #20287 - Fixes #20445 on branch-52 ## Rationale for this change See issues ## What changes are included in this PR? - backports #20446 ## Are these changes tested? By CI Co-authored-by: Jack Kleeman <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 38c89a4 - Browse repository at this point
Copy the full SHA 38c89a4View commit details -
[branch-52] Update version and prepare changelog (#20560)
## Which issue does this PR close? - part of #20287 ## Rationale for this change Prepare for release ## What changes are included in this PR? 1. Update version 2. prepare CHANGELOG. See rendered version here: https://github.com/alamb/datafusion/blob/alamb/prepare_52.2.0/dev/changelog/52.2.0.md I'll have to update this at least once more once we merge the other outstanding PRs: ```shell ./dev/release/generate-changelog.py 52.1.0 branch-52 52.2.0 > dev/changelog/52.2.0.md npx [email protected] -w dev/changelog/52.2.0.md ``` Note the security audit CI check will fail until we merge - #20546 ## Are these changes tested? By CI ## Are there any user-facing changes? <!-- If there are user-facing changes then we may require documentation to be updated before approving the PR. --> <!-- If there are any breaking changes to public APIs, please add the `api change` label. --> --------- Co-authored-by: Oleks V <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for da4014d - Browse repository at this point
Copy the full SHA da4014dView commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff main...branch-52