Thanks to visit codestin.com
Credit goes to github.com

Skip to content

[VL] Respect orc.force.positional.evolution#12234

Open
beliefer wants to merge 4 commits into
apache:mainfrom
beliefer:12232
Open

[VL] Respect orc.force.positional.evolution#12234
beliefer wants to merge 4 commits into
apache:mainfrom
beliefer:12232

Conversation

@beliefer
Copy link
Copy Markdown
Contributor

@beliefer beliefer commented Jun 4, 2026

What changes are proposed in this pull request?

This PR proposes to respect orc.force.positional.evolution while reading ORC files.
Fixes #12232

How was this patch tested?

Add new tests.
Manual test on our production environment.

Was this patch authored or co-authored using generative AI tooling?

'Yes'.

@github-actions github-actions Bot added CORE works for Gluten Core VELOX labels Jun 4, 2026
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Jun 4, 2026

Run Gluten Clickhouse CI on x86

@beliefer beliefer requested a review from wForget June 4, 2026 12:03
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Jun 4, 2026

Run Gluten Clickhouse CI on x86

Comment thread gluten-substrait/src/main/scala/org/apache/gluten/config/GlutenConfig.scala Outdated
Copy link
Copy Markdown
Member

@wForget wForget left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @beliefer

Copilot AI review requested due to automatic review settings June 5, 2026 08:03
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Jun 5, 2026

Run Gluten Clickhouse CI on x86

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR updates Gluten’s Velox ORC read path so that the effective ORC column-mapping behavior respects the Hadoop/Spark config spark.hadoop.orc.force.positional.evolution (i.e., forces positional mapping even when orcUseColumnNames would otherwise be true), aligning behavior with vanilla Spark.

Changes:

  • Add spark.hadoop.orc.force.positional.evolution handling and use it to override the Velox native session setting spark.gluten.sql.columnar.backend.velox.orcUseColumnNames=false.
  • Update VeloxConfig.orcUseColumnNames to incorporate orc.force.positional.evolution.
  • Add a regression test asserting the new effective behavior via fallback behavior.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File Description
gluten-substrait/src/main/scala/org/apache/gluten/config/GlutenConfig.scala Adds orc.force.positional.evolution config constants and forces the native Velox session conf to positional mapping when enabled.
backends-velox/src/main/scala/org/apache/gluten/config/VeloxConfig.scala Makes JVM-side orcUseColumnNames effectively false when positional evolution is forced.
backends-velox/src/test/scala/org/apache/gluten/execution/FallbackSuite.scala Adds a test case to validate the new “force positional evolution” interaction with orcUseColumnNames.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread gluten-substrait/src/main/scala/org/apache/gluten/config/GlutenConfig.scala Outdated
Comment thread backends-velox/src/test/scala/org/apache/gluten/execution/FallbackSuite.scala Outdated
Co-authored-by: Copilot Autofix powered by AI <[email protected]>
Copilot AI review requested due to automatic review settings June 5, 2026 08:18
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Jun 5, 2026

Run Gluten Clickhouse CI on x86

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

Comment thread backends-velox/src/test/scala/org/apache/gluten/execution/FallbackSuite.scala Outdated
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Jun 5, 2026

Run Gluten Clickhouse CI on x86

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CORE works for Gluten Core VELOX

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[VL] Velox ORC reader ignores orc.force.positional.evolution, returning empty/wrong results for position-mapped ORC tables

3 participants