Treat units info as nu, make nu a list rather than single function #17

ksunden · 2022-11-04T22:38:50Z

This is mostly a proof of concept/concrete implementation that can be discussed for the idea of allowing nu functions to actually be lists of functions applied successively.

As part of this, the treatment of units has been moved to be in the list of nu functions, which allows transformations to happen either pre- or post- unit conversion, based on the order in the list.

@tacaswell has stated "nu is a property of the artist, the unit convesion is a property of the host Axes", indicating that perhaps this is not the "correct" way to to treat unit conversions.

I do wish to challenge this thought at least a little bit: the actual call to convert units has a signature that is compatible with nu (although admittedly it is a bound method rather than a pure function).
I further wonder if the key to resolving unit inconsistencies is to handle the units separate from at least some of the existing units machinery and coax the existing machinery to work with the new system rather than the other way around. (Note, that is not to say I want users to have to change their code, but rather just that we don't tie ourselves down too early.)

Now... this implementation was pretty hastily put together and none of our currently implemented examples actually use units yet, so it's not even particularly well tested.
Additionally, the implementation conflicts with changes to nu proposed in #15.

story645 · 2022-11-06T00:28:50Z

"nu is a property of the artist, the unit convesion is a property of the host Axes

I'm interpreting that as each artist might need to do separate dunitization depending on what data is being passed in, but that all the conversions have to be consistent such that for multiple artists on one axes:

two artists taking the same unitized data set (for example categoricals) can't encode the values that data set in two different ways (so artists should have the same nu, both artists need to map apple to 0)
the inches and centimeter example works where the input is two different types (and therefore different nus on each artist) but the return unit type is the same

xref: I think @jklymak was trying to do some of this w/ matplotlib/matplotlib#9776

ETA: also this consistency probably needs to be true at the figure level - like apple shouldn't be mapped to 0 in one axes and 1 in another 'cause for example that could lead to different positions in small multiples.

I further wonder if the key to resolving unit inconsistencies is to handle the units separate from at least some of the existing units machinery and coax the existing machinery to work with the new system rather than the other way around

Um for what it's worth, I kinda thought this was the plan.

tacaswell · 2022-11-06T04:03:35Z

The reason that unit conversion belongs the to the Axis (in the Axes) is that there needs to be the matching transform on the tick labels (in the current system we have inverse functions and store the axis limits post-conversion so that the transform stack all "just works", but I think it should be possible to make the whole thing work with only forward functions, but that is a side point).

If we are going to put a tick on the axes and label it '2 in' (or 'apple' or a datetime), then the data at that level should be what the tick says. If we mix the unit conversion up in a list of functions (I think the list of functions is a good idea in general) than it becomes much harder to reach in and make sure it is consistent with all the other places that should have the same unit -> unitless conversion. There are a bunch of issues due to people who make plots with Pandas who install their version of the unitfull -> unitless converters and then people try to use our datetime formatters which expect a different unitless -> unitful conversion (if I recall correctly, in some cases pandas treats time ranges as ~ catagoricals, maps to integers, and then we interpret that as fractional days past an epoch (do not remember when day 0 is, it changed recently) which is in general very wrong).

Another interesting idea to play with is to push on how little the Airtst need to know about their parents. Currently we go through a bunch of work to make sure that the same Artist does not get put in more than one Axes (or Figure) because we store a ref to transData on the instances and you get really weird renderings if you try it (there are closed issues, check the history of why Arist.axes is a property). I think that units and transData are the only two critical pieces of information that the Artist needs and they in principle can be passed in at draw time.

I further wonder if the key to resolving unit inconsistencies is to handle the units separate from at least some of the existing units machinery and coax the existing machinery to work with the new system rather than the other way around.

Part of your job is to sort out the best path on this ;)

tacaswell · 2022-11-06T04:05:15Z

data_prototype/wrappers.py

        self._cache[cache_key] = data
        return data

-    def __init__(self, data, nus, **kwargs):
+    def __init__(self, data, nus, xunits: List[str] = [], yunits: List[str] = [], **kwargs):


Suggested change

def __init__(self, data, nus, xunits: List[str] = [], yunits: List[str] = [], **kwargs):

def __init__(self, data, nus, xunits: Tuple[str] = (,), yunits: Tuple[str] = (,), **kwargs):

tacaswell · 2022-11-06T04:24:17Z

My logic for sticking taping 'query' and 'transform' together is so that the caching logic could be implemented in one place [1]. Maybe the right way to change the signature is to

def _query_and_transform(self, renderer, *, pre_nu, unit_nu, post_nu, invalidate_cache=False):
     ...

(modulo defaults to make them optional) or maybe

def _query_and_transform(self, renderer, *nus, invalidate_cache=False):
     ...

where we can take as many as we want and up to the artist sub-class to order them correctly. I can also see a case for

def _query_and_transform(self, renderer, nus: Dict[str, List], *, invalidate_cache=False):
    ...

Thus we can stuff the linearization / lookup logic still all in one place and we have an outside control that the Artist knows the cache should be invalidated (because one of the three nu's changed).

[1] There are 2 hard problems in computer science:

naming things
cache invalidation
off-by-one bugs

ksunden · 2023-02-09T01:17:28Z

OKay, rebased... I think I got all of the functionality of #15 and this PR...

All of the examples seem to work, though I would like to make one with more unit behavior.

I also think some solution for ordering the units conversion within the nus is needed (currently it is always the last step).

tacaswell reviewed Nov 6, 2022

View reviewed changes

Treat units info as nu, make nu a list rather than single function

65b295b

ksunden force-pushed the list_nu branch from fc65076 to 65b295b Compare February 9, 2023 01:07

STY: blacken

baebd64

ksunden mentioned this pull request May 12, 2023

Ideas regarding "nu" #26

Closed

ksunden mentioned this pull request Jun 6, 2023

Conversion Node implementation of 'nu' #31

Merged

tacaswell closed this in #31 Jun 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Treat units info as nu, make nu a list rather than single function #17

Treat units info as nu, make nu a list rather than single function #17

Uh oh!

ksunden commented Nov 4, 2022

Uh oh!

story645 commented Nov 6, 2022 •

edited

Loading

Uh oh!

tacaswell commented Nov 6, 2022

Uh oh!

tacaswell Nov 6, 2022

Uh oh!

tacaswell commented Nov 6, 2022

Uh oh!

ksunden commented Feb 9, 2023

Uh oh!

Uh oh!

	def __init__(self, data, nus, xunits: List[str] = [], yunits: List[str] = [], **kwargs):
	def __init__(self, data, nus, xunits: Tuple[str] = (,), yunits: Tuple[str] = (,), **kwargs):

Treat units info as nu, make nu a list rather than single function #17

Treat units info as nu, make nu a list rather than single function #17

Uh oh!

Conversation

ksunden commented Nov 4, 2022

Uh oh!

story645 commented Nov 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tacaswell commented Nov 6, 2022

Uh oh!

tacaswell Nov 6, 2022

Choose a reason for hiding this comment

Uh oh!

tacaswell commented Nov 6, 2022

Uh oh!

ksunden commented Feb 9, 2023

Uh oh!

Uh oh!

story645 commented Nov 6, 2022 •

edited

Loading