Speed up trim_zeros #16783

jonashaag · 2020-07-08T13:16:38Z

a = np.hstack([
    np.zeros((100_000,)),
    np.random.uniform(size=(100_000,)),
    np.zeros((100_000,)),
])
trim_zeros(a)

Here the call to trim_zeros takes about 50ms.

Looking at the implementation of trim_zeros, it is implemented in the most obvious and unoptimized way imaginable (a for loop looking at each item separately).

I think there should be a warning in the documentation about the fact that it's entirely unoptimized and may be horrendously slow, or we should strive to improve performance.

As an implementation idea to improve performance, I prototyped a "block-wise" trim function to be used before trim_zeros:

def fast_trim_zeros(filt, trim='fb'):
    filt = trim_zeros_block(filt, trim)
    return np.trim_zeros(filt, trim)


def trim_zeros_block(filt, trim='fb', block_size=1024):
    """Trim blocks of zeros"""
    trim = trim.upper()
    first = 0
    if 'F' in trim:
        for i in range(0, len(filt), block_size):
            if np.any(filt[i:i+block_size] != 0.):
                first = i
                break
    last = len(filt)
    if 'B' in trim:
        for i in range(len(filt)-1, block_size - 1, -block_size):
            if np.any(filt[i-block_size:i] != 0.):
                last = i
                break
    return filt[first:last]

Speed of a call to fast_trim_zeros is about 2ms, so roughly 25x as fast.

The text was updated successfully, but these errors were encountered:

Qiyu8 · 2020-07-09T02:21:50Z

Good to hear that, can you provide a pull request and corresponding benchmark test case?

jonashaag · 2020-07-09T07:40:08Z

Are you saying that I should submit a PR with benchmark code or also with the code I suggested above? If the latter, there are probably hundreds of ways to implement it and the code above is just the first thing that came to my mind; so why use exactly that code?

BvB93 · 2020-07-09T11:59:08Z

As an implementation idea to improve performance, I prototyped a "block-wise" trim function to be used before trim_zeros

How about converting the passed object into a boolean array and then use np.argmax() to find the first/last non-zero element?
With your previously defined example array I'm seeing an increase in execution speed of ~2 orders of magnitude (398 µs versus 37 ms).

import numpy as np

def trim_zeros(filt, trim='fb'):
    a = np.asanyarray(filt, dtype=bool)
    if a.ndim != 1:
        raise ValueError('trim_zeros requires an array of exactly one dimension')

    trim_upper = trim.upper()
    len_a = len(a)
    i = j = None
    
    if 'F' in trim_upper:
        i = a.argmax()
        if not a[i]:  # i.e. all elements of `filt` evaluate to `False`
            return filt[len_a:]

    if 'B' in trim_upper:
        j = len_a - a[::-1].argmax()
        if not j:  # i.e. all elements of `filt` evaluate to `False`
            return filt[len_a:]

    return filt[i:j]

eric-wieser · 2020-07-09T12:05:55Z

Does that code work without the if not ...s?

BvB93 · 2020-07-09T12:15:44Z

Does that code work without the if not ...s?

Without the if not ... it will fail if the input array consists entirely of zeros,
in which case argmax() will always return 0 and thus the filt[i:j] == filt.

>>> import numpy as np

>>> a = np.zeros(10, dtype=bool)
>>> i = a.argmax()
>>> j = len(a) - a[::-1].argmax()

>>> print(i, j)
0 10

>>> print(np.all(a == a[i:j]))  # Uhoh, `a` is not being trimmed
True

BvB93 · 2020-07-09T12:18:08Z

Another option is to check with np.any() right at the beginning, though this appears to be a bit slower.

eric-wieser · 2020-07-09T12:28:31Z

in which case argmax() will always return 0

Ah, I thought it might return len(a) - 1

BvB93 · 2020-07-11T17:07:34Z

Shall I create a pull request with the implementation as proposed above?

BvB93 · 2020-07-20T12:04:58Z

I've just created a pull request for the issue at #16911.

Qiyu8 added 01 - Enhancement 28 - Benchmark labels Jul 9, 2020

rossbar added the NumPy Sprint label Jul 11, 2020

rossbar removed the NumPy Sprint label Jul 14, 2020

BvB93 mentioned this issue Jul 20, 2020

ENH: Speed up trim_zeros #16911

Merged

mattip closed this as completed in #16911 Aug 4, 2020

mattip mentioned this issue Aug 27, 2020

BUG: revert trim_zeros changes from gh-16911 #17171

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Speed up trim_zeros #16783

Speed up trim_zeros #16783

jonashaag commented Jul 8, 2020

Qiyu8 commented Jul 9, 2020

Uh oh!

jonashaag commented Jul 9, 2020

Uh oh!

BvB93 commented Jul 9, 2020 •

edited

Loading

Uh oh!

eric-wieser commented Jul 9, 2020 •

edited

Loading

Uh oh!

BvB93 commented Jul 9, 2020 •

edited

Loading

Uh oh!

BvB93 commented Jul 9, 2020

Uh oh!

eric-wieser commented Jul 9, 2020

Uh oh!

BvB93 commented Jul 11, 2020

Uh oh!

BvB93 commented Jul 20, 2020

Uh oh!

Uh oh!

Speed up trim_zeros #16783

Speed up trim_zeros #16783

Comments

jonashaag commented Jul 8, 2020

Qiyu8 commented Jul 9, 2020

Uh oh!

jonashaag commented Jul 9, 2020

Uh oh!

BvB93 commented Jul 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eric-wieser commented Jul 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BvB93 commented Jul 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BvB93 commented Jul 9, 2020

Uh oh!

eric-wieser commented Jul 9, 2020

Uh oh!

BvB93 commented Jul 11, 2020

Uh oh!

BvB93 commented Jul 20, 2020

Uh oh!

BvB93 commented Jul 9, 2020 •

edited

Loading

eric-wieser commented Jul 9, 2020 •

edited

Loading

BvB93 commented Jul 9, 2020 •

edited

Loading