ENH: add Spatial Adaptive Agglomerative Aggregation (SA3) regionalisation algorithm #482

u3ks · 2025-04-18T13:32:06Z

Hi all,

We implemented an algorithm to delineate contiguous areas within cities that have identical characteristics and configurations of buildings and streets, but we thought it might be useful for other applications since the procedure is quite generic.

The idea is that is a kind of spatially restricted HDBSCAN, so there is only one parameter to specify - the minimum number of observations to form a cluster. The procedure basically consists of two steps: first, carrying out a full spatially, restricted sklearn.cluster.AgglometariveClustering clustering; and second, extracting clusters from the resulting linkage matrix, using density-clustering extraction algorithms - Excess of Mass or Leaf. This results in multiscale (clusters have varying ranges of internal similarity), contiguous clusters with noise (some observations are not attached to any clusters).

I try to explain more how it works, examples and advantages and disadvantages in the sa3.ipynb notebook.

spopt/region/sa3.py

martinfleis · 2025-05-06T13:37:34Z

spopt/region/sa3.py

+
+    The algorithm carries out ``sklearn.cluster.AgglometariveClustering``
+    per the specified parameters and extracts clusters from it, using density-clustering
+    extraction algorithms - Excess of Mass or Leaf. This results in multiscale,


Can we have some reference to what EoM and Leaf mean?

martinfleis · 2025-05-06T13:39:51Z

spopt/region/sa3.py

+
+from libpysal.graph import Graph
+from libpysal.weights import W
+from numpy import column_stack, full, unique, where, zeros


can you import numpy as np and use np.where etc?

martinfleis · 2025-05-06T13:40:01Z

spopt/region/sa3.py

+from libpysal.graph import Graph
+from libpysal.weights import W
+from numpy import column_stack, full, unique, where, zeros
+from pandas import Series, concat


Same as with numpy.

martinfleis · 2025-05-06T13:41:10Z

spopt/region/sa3.py

+        gdf,
+        w,
+        attrs_name,
+        min_cluster_size=15,


The default is meaningless without knowing the use case. Shall we maybe make it a required arg?

codecov · 2025-05-06T19:08:53Z

Codecov Report

Attention: Patch coverage is 92.30769% with 6 lines in your changes missing coverage. Please review.

Project coverage is 78.2%. Comparing base (13ca45e) to head (ed383c4).
Report is 9 commits behind head on main.

Files with missing lines	Patch %	Lines
spopt/region/sa3.py	92.2%	6 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##            main    #482     +/-   ##
=======================================
+ Coverage   77.8%   78.2%   +0.4%     
=======================================
  Files         27      28      +1     
  Lines       2638    2716     +78     
=======================================
+ Hits        2053    2125     +72     
- Misses       585     591      +6

Files with missing lines	Coverage Δ
spopt/region/__init__.py	`100.0% <100.0%> (ø)`
spopt/region/sa3.py	`92.2% <92.2%> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

martinfleis · 2025-05-06T19:10:14Z

@jGaboardi can we bump lipysal min req to 4.10 here?

jGaboardi · 2025-05-06T19:12:38Z

@jGaboardi can we bump lipysal min req to 4.10 here?

Let's open an issue for that and discuss with @knaaptime, @gegen07, @ljwolf. I think it will be OK, but want to get their inputs.

martinfleis

This is now fine by me! @u3ks I pushed some changes to future-proof the API (extraction keyword that is not a bool) and to clean the API (kwargs passed directly to sklearn rather than via a dedicated dictionary).

knaaptime · 2025-05-08T15:39:28Z

cool

gegen07

LGTM

…tion algorithm (pysal#482) * init * notebook * public extraction api * notebook * notebook * change cluster renumbering * formatting and docstrings * reorder imports * ci change * more ci changes * formatting * more formatting * test failures * typo * Update sa3.py * backwards compat * load data within setup_method * lint * better API * fix tests i broke --------- Co-authored-by: Martin Fleischmann <[email protected]>

u3ks added 9 commits March 21, 2025 14:24

init

865eacf

notebook

920dddc

public extraction api

5e70e76

notebook

a2990c4

notebook

247fbac

change cluster renumbering

c8dbb4d

formatting and docstrings

cdb0fed

Merge branch 'main' into sa3_algo

d816b3e

reorder imports

27c3335

martinfleis self-requested a review April 18, 2025 17:22

jGaboardi requested review from gegen07 and knaaptime April 19, 2025 16:38

jGaboardi assigned u3ks Apr 19, 2025

jGaboardi added enhancement New feature or request region labels Apr 19, 2025

u3ks added 2 commits May 6, 2025 15:22

ci change

8b4e685

Merge branch 'main' into sa3_algo

923985e

martinfleis reviewed May 6, 2025

View reviewed changes

u3ks and others added 6 commits May 6, 2025 16:09

more ci changes

8661ebb

formatting

26abaa3

more formatting

c219c47

test failures

9a10a15

typo

3994008

Update sa3.py

81703e5

backwards compat

70110b0

martinfleis mentioned this pull request May 7, 2025

Bump libpysal req to 4.10 #485

Closed

load data within setup_method

374b136

martinfleis added 3 commits May 7, 2025 10:56

lint

22fb0ae

better API

998d5e9

fix tests i broke

ed383c4

martinfleis changed the title ~~Spatial Adaptive Agglomerative Aggregation (SA3) clustering~~ ENH: add Spatial Adaptive Agglomerative Aggregation (SA3) regionalisation algorithm May 7, 2025

martinfleis approved these changes May 7, 2025

View reviewed changes

knaaptime approved these changes May 8, 2025

View reviewed changes

gegen07 approved these changes May 8, 2025

View reviewed changes

martinfleis merged commit 5e8fcad into pysal:main May 8, 2025
11 checks passed

jGaboardi mentioned this pull request Jun 11, 2025

add optional reqs needed from sa3 #489

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ENH: add Spatial Adaptive Agglomerative Aggregation (SA3) regionalisation algorithm #482

ENH: add Spatial Adaptive Agglomerative Aggregation (SA3) regionalisation algorithm #482

Uh oh!

u3ks commented Apr 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

martinfleis May 6, 2025

Uh oh!

martinfleis May 6, 2025

Uh oh!

martinfleis May 6, 2025

Uh oh!

martinfleis May 6, 2025

Uh oh!

codecov bot commented May 6, 2025 •

edited

Loading

Uh oh!

martinfleis commented May 6, 2025

Uh oh!

jGaboardi commented May 6, 2025

Uh oh!

martinfleis left a comment

Uh oh!

knaaptime commented May 8, 2025

Uh oh!

gegen07 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ENH: add Spatial Adaptive Agglomerative Aggregation (SA3) regionalisation algorithm #482

ENH: add Spatial Adaptive Agglomerative Aggregation (SA3) regionalisation algorithm #482

Uh oh!

Conversation

u3ks commented Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

martinfleis May 6, 2025

Choose a reason for hiding this comment

Uh oh!

martinfleis May 6, 2025

Choose a reason for hiding this comment

Uh oh!

martinfleis May 6, 2025

Choose a reason for hiding this comment

Uh oh!

martinfleis May 6, 2025

Choose a reason for hiding this comment

Uh oh!

codecov bot commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

martinfleis commented May 6, 2025

Uh oh!

jGaboardi commented May 6, 2025

Uh oh!

martinfleis left a comment

Choose a reason for hiding this comment

Uh oh!

knaaptime commented May 8, 2025

Uh oh!

gegen07 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

u3ks commented Apr 18, 2025 •

edited

Loading

codecov bot commented May 6, 2025 •

edited

Loading