Codestin Search App

FlorianDeconinck · 2025-12-29T20:46:54Z

Description

DaCe orchestration was plagued with a slow integration due to the routine marshalling python object (arrays as arguments but also closure) into C-binding ready pointers for calling into the C library

We originally cached everything at first call but that lead to instability: any re-allocation, different argument made to the same program silently fail and we reverted it.

This PR introduces a proper argument hashing that reduces the overhead to being negligible impact on runtime while keeping stability for changing arguments. The hypothesis goes as follows:

Closure do not change because we build and run or we expect pipeline to have taken care of this
Arguments are either NDSL owned (Quantity, State) or array_interface compatible (no DLPack yet)

This PR also updates the orchestration pipeline:

prepare for proper use of the soon to be merge dace:cpu_KJI (feat[cartesian]: Layout & Schedule pairing for dace:X GridTools/gt4py#2426)
stree optimizer (still experimental) covers all backends
CPU transient memory is NOT pushed into pool memory (because of a lack of pooling for CPU)
GPU optimization run the complete dace auto-optimizer by default

How has this been tested?

Unit tests and on the microphysics benchmark conducted with GEOS.

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation (e.g. add new modules to docs/docstrings/)
My changes generate no new warnings
Any dependent changes have been merged and published in downstream modules
New check tests, if applicable, are included

…rguments

Apply `gpu` transformation by default on GPU backend Do NOT use memory pool for CPU Use `DaceExecutable` in orchestration

FlorianDeconinck · 2025-12-29T22:51:22Z

Bringing back to draft: the hashing system operates under the assumption the types given to are the orchestrated code are NDSL's OR trivially hashable. I'll introduce a system that deactivates the hashing if we encounter none of those and warns once of performance degradation

FlorianDeconinck · 2025-12-30T15:42:14Z

Bringing back to draft: the hashing system operates under the assumption the types given to are the orchestrated code are NDSL's OR trivially hashable. I'll introduce a system that deactivates the hashing if we encounter none of those and warns once of performance degradation

Done. Ready for review.

romanc

Nice work! I think your assumptions are sound. Just a couple of nitpicks and questions inline.

ndsl/dsl/dace/dace_executable.py

ndsl/dsl/dace/orchestration.py

ndsl/quantity/quantity.py

ndsl/quantity/state.py

tests/dsl/orchestration/test_call.py

twicki · 2026-01-07T16:22:43Z

ndsl/dsl/dace/dace_executable.py

+                stacklevel=2,
+            )
+            self.arguments = None  # Flush arguments to force recompute
+            self._skip_hash = True  # Skip future checks


If we call it once with non-hashable stuff and afterwards always hashable, _skip_hash never gets reset, right?

Correct, you are back to the safe-zone: arguments cached is always None and you go do the marshalling

twicki · 2026-01-07T16:23:31Z

ndsl/dsl/dace/orchestration.py

+                if config.get_backend() == "dace:cpu_kfirst":
+                    passes.extend(
+                        [
+                            CleanUpScheduleTree(),


at some point it might make sense to write full pipelines for backends that can get fetched from just the backend, but that is not for now

Yeah I think one of the things a better Backend concept would be doing is carry it's default optimization !

FlorianDeconinck added 7 commits December 29, 2025 15:14

Introduce DaceExecutable to collapse lib, arguments and hashed of a…

674d23c

…rguments

Rework STREE pipeline to apply to dace:cpu_KJI and dace:GPU

a1d6c12

Apply `gpu` transformation by default on GPU backend Do NOT use memory pool for CPU Use `DaceExecutable` in orchestration

Update ordering for refine transient

d31107e

Hash of DSL memory (Quantity, State)

1077ff8

Linting new file

287636e

Fix post-parsing save

d509caf

Verbose

81c56e5

FlorianDeconinck requested review from fmalatino, romanc and twicki December 29, 2025 20:46

FlorianDeconinck added the Enhancement New feature or request label Dec 29, 2025

Fix missing refactored bits

7577152

FlorianDeconinck marked this pull request as draft December 29, 2025 22:50

Escape hashing checks when an argument is not hashable

f8545ca

FlorianDeconinck marked this pull request as ready for review December 30, 2025 15:41

fmalatino approved these changes Dec 30, 2025

View reviewed changes

FlorianDeconinck mentioned this pull request Dec 30, 2025

[NDSL | GFDL1M] Interface performance engineering GEOS-ESM/GEOSgcm_GridComp#1186

Merged

romanc approved these changes Jan 7, 2026

View reviewed changes

Lint

5bf56c9

twicki reviewed Jan 7, 2026

View reviewed changes

FlorianDeconinck enabled auto-merge January 7, 2026 17:19

FlorianDeconinck added this pull request to the merge queue Jan 7, 2026

Merged via the queue into NOAA-GFDL:develop with commit b49e6a0 Jan 7, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature | Orchestration] Optimizated C call overhead away, update pipeline, optimize CPU transient residency#348

[Feature | Orchestration] Optimizated C call overhead away, update pipeline, optimize CPU transient residency#348
FlorianDeconinck merged 10 commits intoNOAA-GFDL:developfrom
FlorianDeconinck:feat/dace_call_hash

FlorianDeconinck commented Dec 29, 2025 •

edited by romanc

Loading

Uh oh!

FlorianDeconinck commented Dec 29, 2025

Uh oh!

FlorianDeconinck commented Dec 30, 2025

Uh oh!

romanc left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

twicki Jan 7, 2026

Uh oh!

FlorianDeconinck Jan 7, 2026

Uh oh!

twicki Jan 7, 2026

Uh oh!

FlorianDeconinck Jan 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

FlorianDeconinck commented Dec 29, 2025 • edited by romanc Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

How has this been tested?

Checklist

Uh oh!

FlorianDeconinck commented Dec 29, 2025

Uh oh!

FlorianDeconinck commented Dec 30, 2025

Uh oh!

romanc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

twicki Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

FlorianDeconinck Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

twicki Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

FlorianDeconinck Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

FlorianDeconinck commented Dec 29, 2025 •

edited by romanc

Loading