Issue #1536 reduce values calls splitting #1537

JoerivanEngelen · 2025-05-23T14:55:02Z

Description

Remove unnecessary calls to .values
Fix bug in _skip_if_datarray where the spatial dimension of unstructured grids was hardcoded
Refactor exchange creation logic to use xr.Dataset().to_dataframe(), this to carefully merge variables into a dataset with matching dimensions, then convert to pandas dataframe.

I can't verify yet if this improves things with Teun's example as I do not have his scripts, but this fixes a bug and will improve performance somewhat when using dask (as it reduces unnecessary loads into memory)

Checklist

Links to correct issue
Update changelog, if changes affect users
PR title starts with Issue #nr, e.g. Issue #737
Unit tests were added
If feature added: Added/extended example

…ove references to .values

…rge into dataset

Manangka · 2025-05-27T08:46:19Z

imod/tests/test_common/test_utilities/test_mask_util.py

+    with raise_if_dask_computes():
+        assert _skip_dataarray(grid) is False
+        assert _skip_dataarray(xr.DataArray(True)) is True
+        assert _skip_dataarray(layer_da) is True


Line 22 and 23 is independent of the cases provided to this test. You could put them in a separate test.
Maybe name the tests something along:
test_skip_dataarray_grids_types
test_skip_dataarray_non_grid_types

Good point, I decided to add separate cases for the layered constants and constants to GridCases, which I renamed to DataArrayCases.

imod/mf6/exchangebase.py

Manangka · 2025-05-27T08:48:52Z

imod/mf6/exchangebase.py

+                )
+
+        all_geometric_vars = ["ihc", "cl1", "cl2", "hwva", "angldegx", "cdist"]
+        for var in all_geometric_vars:


You can combine these line
for var in all_geometric_vars if var in in self.dataset.data_vars:

This doesn't work without a list/dict comprehension. So it would end up in this:

geometric_vars = ["ihc", "cl1", "cl2", "hwva", "angldegx", "cdist"] vars_to_render.update({ var: (index_dim, self.dataset[var].data) for var in geometric_vars if var in self.dataset.data_vars })

Which looks more complicated than currently.

Manangka · 2025-05-27T08:51:47Z

imod/mf6/exchangebase.py

+        for var in all_geometric_vars:
+            if var in self.dataset.data_vars:
+                vars_to_render[var] = (index_dim, self.dataset[var].data)
+        datablock = xr.merge([vars_to_render], join="exact").to_dataframe()


What does merge on a single object do?

xr.merge merges all dictionaries which map variable names to DataArrays into a single xr.Dataset. So with one element in the list, the dictionary is converted to a xr.Dataset. xr.merge doesn't support providing a single dictionary directly.

imod/mf6/exchangebase.py

sonarqubecloud · 2025-05-27T13:45:17Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

JoerivanEngelen added 5 commits May 23, 2025 15:37

Remove unnecessary calls to .values

db0a612

Refactor code to create pandas dataframe directly from xarray and rem…

ffe54b5

…ove references to .values

Add test for skip_dataarray not loaded into memory

5c1c1c0

Avoid using hardcoded unstructured grid dim

77a6547

Update changelog

e4ed9a6

JoerivanEngelen requested a review from Manangka May 23, 2025 14:55

JoerivanEngelen added 2 commits May 23, 2025 17:02

Update changelog

0b80853

Refactor further to avoid unnecessary dataset copies and carefully me…

cdcadfb

…rge into dataset

Manangka requested changes May 27, 2025

View reviewed changes

JoerivanEngelen added 3 commits May 27, 2025 13:00

Expand test cases

11eae02

Apply Manangka's suggestion

fe4214a

Merge branch 'master' into issue_#1536_reduce_values_calls_splitting

fc94852

Manangka approved these changes May 27, 2025

View reviewed changes

Add missing layer index

90a26d8

JoerivanEngelen enabled auto-merge May 27, 2025 11:37

JoerivanEngelen added 4 commits May 27, 2025 14:01

Fix call to non-existing method

551cb18

Merge branch 'master' into issue_#1536_reduce_values_calls_splitting

a7ffe1a

Properly determine is_structured

34d98ed

Take proper dim size regardless of name for is_structured

136eb9f

JoerivanEngelen added this pull request to the merge queue May 27, 2025

Merged via the queue into master with commit 4ff9db9 May 27, 2025
7 checks passed

JoerivanEngelen deleted the issue_#1536_reduce_values_calls_splitting branch May 27, 2025 14:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Issue #1536 reduce values calls splitting #1537

Issue #1536 reduce values calls splitting #1537

Uh oh!

JoerivanEngelen commented May 23, 2025 •

edited

Loading

Uh oh!

Manangka May 27, 2025

Uh oh!

JoerivanEngelen May 27, 2025

Uh oh!

Uh oh!

Manangka May 27, 2025

Uh oh!

JoerivanEngelen May 27, 2025

Uh oh!

Manangka May 27, 2025

Uh oh!

JoerivanEngelen May 27, 2025

Uh oh!

Uh oh!

sonarqubecloud bot commented May 27, 2025

Uh oh!

Uh oh!

Uh oh!

Issue #1536 reduce values calls splitting #1537

Issue #1536 reduce values calls splitting #1537

Uh oh!

Conversation

JoerivanEngelen commented May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

Manangka May 27, 2025

Choose a reason for hiding this comment

Uh oh!

JoerivanEngelen May 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Manangka May 27, 2025

Choose a reason for hiding this comment

Uh oh!

JoerivanEngelen May 27, 2025

Choose a reason for hiding this comment

Uh oh!

Manangka May 27, 2025

Choose a reason for hiding this comment

Uh oh!

JoerivanEngelen May 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sonarqubecloud bot commented May 27, 2025

Quality Gate passed

Uh oh!

Uh oh!

Uh oh!

JoerivanEngelen commented May 23, 2025 •

edited

Loading