Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Conversation

@pawel-big-lebowski
Copy link
Collaborator

@pawel-big-lebowski pawel-big-lebowski commented Jul 15, 2025

OpenLineage 1.27 introduced a change which aimed to optimise flattening RDDs to obtain datasets.
However, the change leaves the reference to the RDD after the chain of: isDefinedAt and apply method is called.

This PR merges two existing visitors: SqlExecutionRDDVisitor and LogicalRDDVisitor. Combining those two classes allows a solution where no reference to RDDs has to be stored as class property.

@pawel-big-lebowski pawel-big-lebowski added the full-tests Run full tests label Jul 15, 2025
@boring-cyborg boring-cyborg bot added area:integration/spark language:java Uses Java programming language labels Jul 15, 2025
@pawel-big-lebowski pawel-big-lebowski changed the title Make visitors stateless Make visitors stateless - avoid memory leak Jul 16, 2025
@pawel-big-lebowski pawel-big-lebowski marked this pull request as ready for review July 16, 2025 07:02
@pawel-big-lebowski pawel-big-lebowski requested a review from a team as a code owner July 16, 2025 07:02
@pawel-big-lebowski pawel-big-lebowski force-pushed the spark/potential-memory-leak branch from f03cb0c to b4fafed Compare July 16, 2025 07:20
@boring-cyborg boring-cyborg bot added area:documentation Improvements or additions to documentation area:tests Testing code labels Jul 16, 2025
Signed-off-by: Pawel Leszczynski <[email protected]>
@pawel-big-lebowski pawel-big-lebowski force-pushed the spark/potential-memory-leak branch from b4fafed to 658dd65 Compare July 16, 2025 07:32
@pawel-big-lebowski pawel-big-lebowski merged commit 551a7a4 into main Jul 16, 2025
54 checks passed
@pawel-big-lebowski pawel-big-lebowski deleted the spark/potential-memory-leak branch July 16, 2025 08:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area:documentation Improvements or additions to documentation area:integration/spark area:tests Testing code full-tests Run full tests language:java Uses Java programming language

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants