Codestin Search App

srielau · 2025-12-28T03:43:57Z

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

- Persistent functions now cached with unqualified keys for compatibility - Temporary functions use composite keys (session.funcName) - Both can coexist in the same registry - Views correctly exclude temp functions from resolution Issue: View test still failing - needs investigation into function builder resolution

- Persistent functions now stored with qualified keys (catalog.db.func) - Prevents conflicts when multiple databases have same function name - Temporary functions still use composite keys (session.func) Known issues: - View function resolution test still failing - Possible function listing regressions to investigate

Added extensive debug logging to understand why views capture wrong function class. Ready for detailed tracing.

**THE BUG:** In resolveBuiltinOrTempFunctionInternal, the 'isBuiltin' parameter was incorrectly checking if the temp/builtin identifier existed in the session registry, instead of checking the static FunctionRegistry.builtin. This caused lookupTempFuncWithViewContext to treat temp functions as builtins, bypassing view context checks and allowing temp functions created AFTER a view to incorrectly shadow the persistent function that the view should use. **THE FIX:** Changed the isBuiltin check to use FunctionRegistry.builtin.functionExists and TableFunctionRegistry.builtin.functionExists directly, matching master's behavior. **TEST RESULTS:** ✅ All 62 tests pass (PersistedViewTestSuite + FunctionQualificationSuite) ✅ SPARK-33692 view test now passes ✅ View correctly uses MyDoubleAvg and ignores temp MyDoubleSum

Removed leftover test scripts that were causing compilation errors: - test_simple_function.scala - test_view_function.scala All code now compiles cleanly.

Analysis covers: - Complete API surface (read/write operations) - Current architecture and memory usage - Three proposed optimization approaches - Detailed feasibility assessment KEY DISCOVERY: Internal functions already use separate static registry! - FunctionRegistry.internal contains ~20 ML/Pandas/Connect functions - Resolved directly, bypassing SessionCatalog - Proves composite registry pattern works in production - Validates proposed optimization approach Memory savings potential: 98% reduction for high-session deployments Implementation effort: 2-3 days coding + testing Risk: Low (pattern already proven with internal functions)

Comprehensive comparison covering: - Registry architecture (cloned vs static) - User-facing vs implementation details - Resolution paths and shadowing behavior - Examples and use cases - Historical context (Spark 4 separation) KEY FINDINGS: - Builtin: ~500 user-facing SQL functions, cloned per session - Internal: ~20 implementation functions for Connect/ML/Pandas, single global registry - Internal functions already use separate static registry pattern - Proves composite registry approach is production-ready This validates our proposed optimization approach for builtins.

Updated test to expect INVALID_TEMP_OBJ_QUALIFIER (AnalysisException) instead of INVALID_SQL_SYNTAX.CREATE_TEMP_FUNC_WITH_DATABASE (ParseException) for invalid temporary function qualifications. This aligns with the user's request to treat invalid temp function qualifications as semantic errors (42602 SQLSTATE) rather than syntax errors. Test cases updated: - CREATE TEMPORARY FUNCTION a.b() - now expects INVALID_TEMP_OBJ_QUALIFIER - CREATE TEMPORARY FUNCTION a.b.c() - now expects INVALID_TEMP_OBJ_QUALIFIER All tests pass.

These files were working notes created during development and should not be committed to the repository: - BUILTIN_VS_INTERNAL_FUNCTIONS.md - CURSOR_IMPLEMENTATION_SUMMARY.md - CURSOR_TEST_RESULTS.md - FUNCTION_QUALIFICATION_ANALYSIS.md - FUNCTION_QUALIFICATION_COMPLETE.md - FUNCTION_QUALIFICATION_SUMMARY.md - FUNCTION_REGISTRY_API_ANALYSIS.md - IMPLEMENTATION_COMPLETE.md - TABLE_FUNCTION_REGISTRY_ANALYSIS.md - UNIFIED_FUNCTION_NAMESPACE.md Only production code and tests should be in the repository.

These were working test files created during development: - test_both.sql - test_namespace.sql - test_range.sql They should not be committed to the repository.

…apsulation Refactoring #1: Extract Scalar/Table Function Duplication - Added handleViewContext() helper to centralize view resolution logic - Added lookupFunctionWithShadowing() generic helper for both scalar and table functions - Eliminated ~50 lines of duplicated shadowing and view context logic - Simplified lookupBuiltinOrTempFunction() and lookupBuiltinOrTempTableFunction() Refactoring #2: Unified Qualification Checker - Added isQualifiedWithNamespace() helper to check namespace qualifications - Refactored maybeBuiltinFunctionName() and maybeTempFunctionName() to use common helper - Eliminated ~15 lines of duplication - Prepares for future PATH implementation Documentation Improvements: - Added comprehensive scaladoc to TEMP_FUNCTION_DB explaining composite key pattern - Added scaladoc to tempFunctionIdentifier() and isTempFunctionIdentifier() - Added detailed comments to new helper methods Benefits: - Single source of truth for shadowing logic - Easier to maintain and test - Better encapsulation of view resolution context - More consistent code structure All tests pass: - FunctionQualificationSuite: 24/24 tests passed - SessionCatalogSuite: 100/100 tests passed

Naming Improvements: - Renamed 'useTempIdentifier' → 'lookupAsTemporary' for clarity - More semantic boolean parameter names Code Quality: - Made handleViewContext() functional (removed imperative 'return') - Removed unused 'tempIdentifier' and 'builtinIdentifier' variables - Extracted resolveFunctionWithFallback() to eliminate duplication - Used Option.filter for more idiomatic Scala Benefits: - More functional programming style - Clearer intent with better parameter names - No unused variables cluttering the code - Extracted common pattern reduces duplication by ~30 lines - More readable and maintainable All 24 tests in FunctionQualificationSuite pass.

Removed sql-function-qualifiers.sql and its golden files. All test coverage is comprehensively provided by FunctionQualificationSuite.scala. Rationale: - Eliminates duplication between SQL and Scala tests - Scala tests provide better error validation with checkError() - Scala tests are easier to maintain and debug - Scala tests have better test isolation - No loss of coverage: Scala suite has 24 tests covering all scenarios FunctionQualificationSuite.scala provides complete coverage: - 8 tests: Reference qualification (SELECT statements) - 9 tests: DDL (CREATE/DROP TEMPORARY FUNCTION) - 4 tests: Type mismatch errors - 3 tests: Integration scenarios All 24 tests pass.

… test bugs

Followup cleanup for function qualification PR

Resolve conflicts from search-path merge: - Analyzer: keep sessionOrder legacy doc and catalogPath.toSeq style - CheckAnalysis/Catalog: use (catalog +: currentNamespace).toSeq - FunctionResolution: use (name +: namespace).toSeq, drop NonFatal case, keep expandIdentifier then try and re-expand in catch - SessionCatalog: keep HEAD lookupTempFuncWithViewContext and PATH-based resolution - QueryCompilationErrors: keep tableOrViewNotFoundWithSearchPath - ResolveSessionCatalog: keep ResolvedTempView + DropViewInSessionCatalog, use operationNotAllowedOnBuiltinNamespaceError - SparkSqlParser: keep normalizeTempViewIdentifier for session/system.session qualifiers - Tests/sql.out: add queryContext to checkError and JSON outputs where master had them

srielau added 30 commits December 22, 2025 09:43

builtin/session fucntion qualifier.

9727d00

Table function support

9a232f9

Fixes

ad66ec0

Fix

f0d5b4e

Fix TF

dcc309e

More fixes

16c2cc2

Rework design

c59bed0

Redesign to use origunal clone

1e267c1

fix

7fc7f59

Fixes

8b17973

More fixes

1a213b8

Fix extension fucntions show fucntions

65119e8

Fix ML fucntions

4fff9c6

update golden files

c8173e2

more fixes

a0cb898

WIP: Add debug logging to trace view function resolution

897f218

Added extensive debug logging to understand why views capture wrong function class. Ready for detailed tracing.

Clean up temporary test files

fa70575

Removed leftover test scripts that were causing compilation errors: - test_simple_function.scala - test_view_function.scala All code now compiles cleanly.

Remove temporary test SQL files from git

3f41dd8

These were working test files created during development: - test_both.sql - test_namespace.sql - test_range.sql They should not be committed to the repository.

Refine design

8f38c3f

Fixes

62c6518

srielau added 9 commits February 28, 2026 13:22

Redesign "narrow waist"

3f3496d

More rework

7252bc9

scala style

5ae569f

Reowrk design

d85fa96

Fixes

d2acd9c

Fix internal functions

e5a744e

Unify table and scalar candidates

fb49e68

Introduce proper search path

8dd3c98

Ifix naming

f1ec0a5

srielau force-pushed the SPARK-54808-qualified-session-view branch from f22e093 to 3c2b3f5 Compare March 1, 2026 23:14

srielau and others added 9 commits March 1, 2026 16:18

Simplify design, generalize naming for views

14b229a

fix compile error

ef7a9d3

Still fixing

762f913

Scala style fix

88533da

Address comments by wenchen

241a125

Followup cleanup: remove Extension namespace, fix stale comments, fix…

f0643c4

… test bugs

Merge pull request #2 from cloud-fan/pr-53570-review

031a91a

Followup cleanup for function qualification PR

[SPARK-54808] Qualified temp view names (search path)

618381e

refresh to search-path latest

2d9fc86

srielau force-pushed the SPARK-54808-qualified-session-view branch from 3c2b3f5 to 2d9fc86 Compare March 2, 2026 16:31

srielau added 10 commits March 2, 2026 08:37

Address coments by Vlad

ce513df

Fix mismatches

38d989b

Align with search_path

e44265e

merge up to search-path

34178d3

Fixes

9ab3678

Cleanup bad merge

485c68a

more cleanup

df5bc6f

Fixes

3e825a1

fix compile bug

a1977d8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-54808] Qualified session view#53630

[SPARK-54808] Qualified session view#53630
srielau wants to merge 136 commits intoapache:masterfrom
srielau:SPARK-54808-qualified-session-view

srielau commented Dec 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

srielau commented Dec 28, 2025

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants