BUG: Matrix multiplication in windows produces large numerical errors inconsistently in numpy >= 2 #27036

alasdairwilson · 2024-07-25T12:13:13Z

Describe the issue:

We discovered a simple test failing on windows only: sunpy/sunpy#7754

I have investigated the symptoms of this in some detail but have not tried to find the cause: In short it seems like matrix multiplications with largeish numbers fails inconsistently in windows, and when it does fail it will fail a few times in a row before returning to normal. I would suspect openblas but the failure never occurs with numpy <2

Reproduce the code example:

import numpy as np
def test():
    # Test whether matrix multiplication involving a large matrix always gives the same answer
    # This indirectly tests whichever BLAS/LAPACK libraries that NumPy is linking to (if any)
    x = np.arange(500000, dtype=np.float64)
    src = np.vstack((x, -10*x)).T
    matrix = np.array([[0, 1], [1, 0]])

    expected = np.vstack((-10*x, x)).T  # src @ matrix
    
    mismatches = np.zeros(500, int)
    for i in range(len(mismatches)):
        result = src @ matrix
        mismatches[i] = (~np.isclose(result, expected)).sum()
        if mismatches[i] != 0:
            print(f"{mismatches[i]} mismatching elements in multiplication #{i}")

Error message:

This bug is absolutely wild by the way:

With numpy 2.0.1:

16 mismatching elements in multiplication #22
316 mismatching elements in multiplication #33
28 mismatching elements in multiplication #53
307 mismatching elements in multiplication #100
32 mismatching elements in multiplication #201
32 mismatching elements in multiplication #267
288 mismatching elements in multiplication #272
1177 mismatching elements in multiplication #276
1596 mismatching elements in multiplication #289
32 mismatching elements in multiplication #298
16 mismatching elements in multiplication #314
3268 mismatching elements in multiplication #407
1403 mismatching elements in multiplication #419
64 mismatching elements in multiplication #446

with numpy 2.0.0

80 mismatching elements in multiplication #0
920 mismatching elements in multiplication #183
683 mismatching elements in multiplication #186
232 mismatching elements in multiplication #187
400 mismatching elements in multiplication #190
1217 mismatching elements in multiplication #195
1515 mismatching elements in multiplication #249
593 mismatching elements in multiplication #269
108 mismatching elements in multiplication #285
120 mismatching elements in multiplication #295
204 mismatching elements in multiplication #332
78 mismatching elements in multiplication #357
1816 mismatching elements in multiplication #380
164 mismatching elements in multiplication #396
48 mismatching elements in multiplication #492

with 1.26.4 it passes.

So it is a numpy 2 issue and not a 2.0.1 issue... it is also random:

Inserting a return and a break on the first time the matmul fails:

In [52]: test()
32 mismatching elements in multiplication #27

In [53]: test()
152 mismatching elements in multiplication #44

In [54]: test()
108 mismatching elements in multiplication #75

In [55]: test()
68 mismatching elements in multiplication #44

In [56]: test()
964 mismatching elements in multiplication #1

In [57]: test()
228 mismatching elements in multiplication #89

In [58]: test()
24 mismatching elements in multiplication #0

In a given set of multiplications the errors always have the same values.

In [92]: print([i for i in x if np.any(i != 0)])
[array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,   6948.]), array([-69480.,  
 6948.])]

In [94]: print([i for i in (res-ex) if np.any(i != 0)])
[array([-1289600.,   128960.]), array([-1289600.,   128960.]), array([-1289600.,   128960.]), array([-1289600.,   128960.]), array([-1289600.,   128960.]), array([-1289600.,   128960.]), array([-1289600.,   128960.]), array([-1289600.,   128960.])]

In [101]: print([i for i in (res-ex) if np.any(i != 0)])
[array([-208450.,   20845.]), array([-208450.,   20845.]), array([-208450.,   20845.]), array([-208450.,   20845.]), array([-208450.,   20845.]), array([-208450.,   20845.]), array([-208450.,   20845.]), array([-208450.,   20845.])]

As you can see they also have the weird symmetry where the first column is -10* the second column which I guess makes some twisted sense because thats what the calculation is.

The errors are also always in the same region i.e. with indexs > 400k or so

They also always fail multiple indexes in a row, e.g. you might have 20 or 50 or whatever failed calcualtions in a row but then it starts working again.

e.g.

[417292, 417293, 417294, 417295, 417296, 417297, 417298, 417299, 417300, 417301, 417302, 417303, 449292, 449293, 449294, 449295, 449296, 449297, 449298, 449299, 449300, 449301, 449302, 449303, 449400, 449401, 449402, 449403, 449404, 449405, 449406, 449407, 449408, 449409, 449410, 449411, 449412, 449413, 449414, 449415, 449416, 449417, 449418, 449419, 449420, 449421, 449422, 449423]

or

[487736, 487737, 487738, 487739, 487740, 487741, 487742, 487743, 487760, 487761, 487762, 487763, 487764, 487765, 487766, 487767, 490490, 490491, 490492, 490493, 490494, 490495, 490496, 490497, 496838, 496839, 496840, 496841, 496842, 496843, 496844, 496845]

You can see in one of the plots there has been the same failure for 2 separate sequences of indices.

Python and NumPy Versions:

Windows 10, cannot recreate this at all in linux.

python 3.11.8
numpy 2.0.0 or numpy 2.0.1 fail in the same manner

This also failed on our CI on a windows-latest runner.
https://github.com/sunpy/sunpy/actions/runs/10072095800/job/27843572893#step:10:5160

Runtime Environment:

[{'numpy_version': '2.0.1',
  'python': '3.11.8 (tags/v3.11.8:db85d51, Feb  6 2024, 22:03:32) [MSC v.1937 '
            '64 bit (AMD64)]',
  'uname': uname_result(system='Windows', node='DESKTOP-0EA2O83', release='10', version='10.0.19045', machine='AMD64')},
 {'simd_extensions': {'baseline': ['SSE', 'SSE2', 'SSE3'],
                      'found': ['SSSE3',
                                'SSE41',
                                'POPCNT',
                                'SSE42',
                                'AVX',
                                'F16C',
                                'FMA3',
                                'AVX2'],
                      'not_found': ['AVX512F',
                                    'AVX512CD',
                                    'AVX512_SKX',
                                    'AVX512_CLX',
                                    'AVX512_CNL',
                                    'AVX512_ICL']}},
 {'architecture': 'Haswell',
  'filepath': 'C:\\Users\\Alasdair\\AppData\\Local\\Programs\\Python\\Python311\\Lib\\site-packages\\numpy.libs\\libscipy_openblas64_-fb1711452d4d8cee9f276fd1449ee5c7.dll',
  'internal_api': 'openblas',
  'num_threads': 8,
  'prefix': 'libscipy_openblas',
  'threading_layer': 'pthreads',
  'user_api': 'blas',
  'version': '0.3.27'}]

Context for the issue:

Any matrix multiplication result could go wrong at any time in windows.

The text was updated successfully, but these errors were encountered:

mattip · 2024-07-25T14:31:41Z

It does seem to be a problem with OpenBLAS and threading on windows. NumPy 1.26.4 uses v0.3.23-293. Numpy 2.0 and 2.0.1 use v0.3.27.44. Setting OPENBLAS_NUM_THREADS=1 makes the test pass. I didn't see any relevant issues searching for windows and threading on the OpenBLAS issue tracker, but maybe I need to search differently. @martin-frbg does this sound familiar?

mattip · 2024-07-25T14:32:31Z

I wonder how our CI passed ☹️

mattip · 2024-07-25T14:37:12Z

I don't think this is related to the kernels chosen. The test still fails when I set OPENBLAS_CORETYPE=PRESCOTT (it was HASWELL).

martin-frbg · 2024-07-25T14:46:04Z

what does 0.3.27.44 correspond to, git-wise ? If post-0.3.27, any chance this could be related to the NAN handling changes in SCAL that I merged on June 29 ?

ayshih · 2024-07-25T14:58:29Z

For what it's worth, sunpy has this test because 4 years ago we encountered intermittent failures with OpenBLAS and multithreading on macOS (see sunpy/sunpy#4290 (comment)). Those failures went away eventually with a subsequent release of OpenBLAS.

ayshih · 2024-07-25T15:10:00Z

Importantly, there were and maybe still are bad interactions when a threaded program used OpenBLAS that was configured to use threads as well. Hence, that's why setting OpenBLAS to use a single thread "fixed" issues.

martin-frbg · 2024-07-25T15:41:44Z

Windows gets a lot less attention, and there were a few PRs merged since 0.3.23 (whatever build your -293 is) and 0.3.27-whatever that (supposedly) improved multithreading on that platform. Notably OpenMathLib/OpenBLAS#4359 - not trying to blame anyone, just that this would be the most serious change (merged in December, so debuting in 0.3.26)

martin-frbg · 2024-07-27T07:14:03Z

@mseminatore

mseminatore · 2024-07-27T17:56:25Z

Is it possible to test this against an earlier (pre 0.3.26) OpenBLAS version to isolate whether this could be related to the Windows threading changes? @martin-frbg do we have existing validation tests that would/should show this issue?

Does this show up for any NTHREADS > 1?

Do we have a C repro for this yet? If not I will attempt to translate the Python to C and see if I can reproduce/debug it.

martin-frbg · 2024-07-27T18:23:02Z

I have just managed to set up a virtual environment for reproducing this (with a pip-installed numpy - I did not get conda to install anything other than mkl or netlib even if it claimed to have switched to openblas). It definitely happens with as few as 4 cores (though not on every run), but I have not yet gotten it to fail with just two. (Unfortunately I haven't figured out how to downgrade the openblas version, pip tells me there are no packages that match the version requirement)

martin-frbg · 2024-07-27T18:27:20Z

3 threads are definitely enough to reproduce it (on the fourth try), I still haven't seen a failure with 2 in about 20 runs so far

mattip · 2024-07-27T18:30:06Z

The openblas implementation is in <venv>/lib/site-packages/numpy.libs. You can try various scipy-openblas64 versions. Note that pip-installing the package does not replace the DLL used by numpy: the package is only used when building numpy from source. You can copy the DLL from the site-package/scipy_openblas/lib directory, overwriting the one in numpy.libs (be sure to preserve the mangled name).

mattip · 2024-07-27T18:36:06Z

Note I am close to uploading a newer scipy-openblas64 based on latest OpenBLAS HEAD. You can download a pre-release from https://anaconda.org/scientific-python-nightly-wheels/scipy-openblas64/files. You want scipy_openblas64-0.3.27.341.0-py3-none-win_amd64.whl.

martin-frbg · 2024-07-27T22:15:08Z

@mattip thank you very much. No failures seen so far with 4 threads and the 0.3.24.95.1 version of the library copied over the original libscipy_openblas64_-fb1711452d4d8cee9f276fd1449ee5c7.dll in numpy.libs

mattip · 2024-07-29T14:08:15Z

@Siddharth-Latthe-07 is that a chatgpt-generated comment? Your comment does not add to the body of knowledge around the issue. We already know it is related to OpenBLAS, and that it is related to some thread-related problem inside OpenBLAS that only manifests itself on windows, so your analysis of the problem is not helpful. Converting the wheels we ship to another BLAS/LAPACK implementation is not something we are considering at this time.

mseminatore · 2024-07-29T16:46:30Z

I've seen this happen with dtype = float32 as well. @martin-frbg in validating that this doesn't happen with earlier versions have you by chance been able to grab a log file with server tracing enabled?

mattip · 2024-07-29T16:49:17Z

a log file with server tracing enabled

How can I help provide a version with that enabled?

mseminatore · 2024-07-29T16:59:30Z

@mattip if OpenBLAS is built with SMP_DEBUG defined then the thread server will send debug logging to stderr. That may help hint at what is going on.

martin-frbg · 2024-07-29T17:12:28Z

@mseminatore have not gotten around to using an own build in that context yet, sorry. will try tonight.

mseminatore · 2024-07-29T17:12:42Z

@martin-frbg in looking over the Windows server code I note the merge of OpenMathLib/OpenBLAS#4577 in April which says it changes how threads are allocated and introduces new functions like adjust_thread_buffers(). The note on adjust_thread_buffers() says that it changes from local buffers to global buffers which sounds like it could have introduced a subtle new timing requirement on buffer usage.

Not trying to shift the focus elsewhere, but it would be helpful to rule out that merge as a potential source of these issues.

@mattip Is there any sense for when this issue started? IOW is the repro test case something that is run regularly or only when an issue is identified? As Martin pointed out, the most significant code change to Windows threading was my merge (4359) on Dec. 5th 2023 so that is a prime candidate and I will look over the code, but wondering if that long of a latency in discovering the issue is expected.

martin-frbg · 2024-07-29T17:21:20Z

#4577 was merged only after the 0.3.27 release so probably/hopefully not in that build yet.
as I understood, the repro is from sunpy i.e. another level removed from numpy/scipy and was noticed there only because numpy/scipy switched from bundling OpenBLAS 0.3.23 to 0.3.27. the only other potential indication of trouble is a headsup from the conda packager that their build of 0.3.27 is surprisingly slow compared to earlier ones.

martin-frbg · 2024-07-29T17:22:40Z

(conda issue is conda-forge/openblas-feedstock#160 (comment))

mseminatore · 2024-07-29T17:36:51Z

#4577 was merged only after the 0.3.27 release so probably/hopefully not in that build yet. as I understood, the repro is from sunpy i.e. another level removed from numpy/scipy and was noticed there only because numpy/scipy switched from bundling OpenBLAS 0.3.23 to 0.3.27. the only other potential indication of trouble is a headsup from the conda packager that their build of 0.3.27 is surprisingly slow compared to earlier ones.

Thank you @martin-frbg that is helpful to know! I will continue to review the code and attempt to debug the repro in pursuit of a working theory and a potential fix. Please let me know should you decide that you would prefer reverting the code given that this is not the first issue we've encountered and that this is having downstream impact. Regardless of whether we keep or revert the Windows code I hope we can expand test coverage to allow us to catch similar issues earlier in the future.

martin-frbg · 2024-07-29T19:21:35Z

I'll see if I can come up with anything tonight. We are already close to a month past my tentative release date for 0.3.28 due to various stability problems (including with my health), so I guess it would not be a problem to add another week in case it becomes obvious where this goes wrong.

martin-frbg · 2024-07-30T13:57:15Z

not sure if this output from a build with SMP_DEBUG is going to help (stopped the program after it reported the first couple of mismatches)
debuglog.zip

mseminatore · 2024-07-30T15:03:43Z

@mattip do you happen to know the magnitude of the errors? Is it just outside of epsilon or large? Trying to roughly characterize this as either lost work or error accumulation to guide the investigation.

mattip · 2024-07-31T20:47:46Z

Should confirm that src = src.copy("C") fixes it then in Python

No, I still am seeing problems even with this.

mattip · 2024-07-31T20:49:34Z

If I shrink the size of x to 300_000, I do not see the problem. Maybe a int32 overflow somewhere?

martin-frbg · 2024-07-31T20:49:59Z

It should mean it is 44 commits past 0.3.27, but when looking at openblas-libs's commit 14ebf5d, I see the pyproject.toml version is 0.3.27.44.1 but the OpenBLAS commit is exactly v0.3.27. That is a mistake and is misleading, sorry.

The next jump in OpenBLAS version is in the 0.3.27.57 wheel, which uses OpenBLAS de465ffd.

sorry I do not understand - does that mean your 0.3.27.44 happened to be OpenBLAS-0.3.27 exactly, or that its exact commit hash past 0.3.27 can no longer be determined ? (de465ffd would appear to be shortly before 4577 landed so I think that other thread-handling change should not have any bearing if the sunpy folks used "0.3.27.44")

martin-frbg · 2024-07-31T21:16:50Z

If I shrink the size of x to 300_000, I do not see the problem. Maybe a int32 overflow somewhere?

Good catch but that would be extra weird as mseminatore's work was entirely threading infrastructure not concerned with data sizes, and I do not recall any PR in the relevant timeframe that could have replaced a "blasint" with a standard int (rendering your regular INTERFACE64=1 build option useless). I had started bisecting, but ran into weird stability issues with the VM and/or compiler so need to redo everything...

mattip · 2024-07-31T21:48:31Z

sorry I do not understand - does that mean your 0.3.27.44 happened to be OpenBLAS-0.3.27 exactly?

Yes

mseminatore · 2024-08-01T00:10:24Z

If I shrink the size of x to 300_000, I do not see the problem. Maybe a int32 overflow somewhere?

Good catch but that would be extra weird as mseminatore's work was entirely threading infrastructure not concerned with data sizes, and I do not recall any PR in the relevant timeframe that could have replaced a "blasint" with a standard int (rendering your regular INTERFACE64=1 build option useless). I had started bisecting, but ran into weird stability issues with the VM and/or compiler so need to redo everything...

The last issue was a bug in queue management where work submitted re-entrantly was lost. The way the join logic works exec_blas_async_wait() will hang if we don't process all the work that was submitted. So, I would expect lost work to result in a hang as before.

I am reviewing the log file that Martin captured to see if there are any hints there.

mseminatore · 2024-08-01T02:08:44Z

OK, looking over the log file, I am seeing the following pattern for work submission:

Server[ 0] Started.  Mode = 0x2003 M =   2 N=500000 K=  2
Server[ 1] Started.  Mode = 0x2003 M =   2 N=500000 K=  2
Server[ 2] Started.  Mode = 0x2003 M =   2 N=500000 K=  2

This suggests that each call to cblas_dgemm() which if I understand the repro code would be called from Python with M=2, N=500000 and K=2 was not decomposed into a set of smaller sub-tasks. Help me out @martin-frbg, is that expected behavior for a level3 GEMM call?

I would expect the matrix to be converted into a number of sub-tiles/sub-tasks in a work_queue (such that each sub-tile A, B and C all fit in the L2$) but I am not familiar with the OpenBLAS L3 work model. My math says that a MxN matrix of order (2, 500000) of float64 would be 8MB. Therefore A, B, and C would be 8MB, 32b (2 * 2 * 8) and 8MB, thus 16MB in size, larger than any L2$ I've seen.

As an example, my own BLAS code decomposes this GEMM call into 1954 sub-tasks in the work_queue.

But I am dealing with a family medical situation, it's late and I'm tired so I may be thinking wrongly about it.

martin-frbg · 2024-08-01T03:28:55Z

Saw this too, maybe it is the extreme imbalance of matrix dimensiions (and openblas failing to split on N).
I may have broken it myself in OpenMathLib/OpenBLAS#4585 - an attempt to get better utilization of threads. If I got that wrong and it makes OpenBLAS go single-threaded, there would be nothing the level3 driver could do to split the workload. This should affect all platforms equally though, and would not explain why the problem only appears on Windows.
(It is very early morning here, and I have been dealing with an ongoing medical situation in my family too in recent months. I will try to look into this particular PR first thing in the morning. Eventually the bisect should find the cause, if it isn't this one either)

martin-frbg · 2024-08-01T17:13:28Z

sorry, so far I have only managed to trash my build environment - the ancient m2w64-gcc (5) from conda (that I am sure worked on another system on the weekend) suddenly throws internal compiler errors when building OpenBLAS.
Trying to replace it with the conda-forge::gcc (14) ends in a mess of conflicting dependencies involving the msys2-conda-epoch package as well as "make", and a separate msys2/mingw64 build of OpenBLAS appears to be incompatible with the pip-installed numpy (although symbols look correct).

mattip · 2024-08-02T04:51:52Z

Using the DLL from the 0.3.26.0.4 scipy-openblas64 wheels, this test and the one from #26643 both pass. That uses the OpenBLAS v0.3.26 tagged version.

Now I am getting myself confused. Rerunning the test with a 0.3.26 OpenBLAS fails, both when I use the DLL from the 0.3.26.4 wheel and also when I rebuild OpenBLAS v0.3.26.

I am using this to rebuild OpenBLAS with the compilers from the rtools package. This installs gcc/gfortran 10.3

rem this needs rtools installed, run this in an admin cmd.exe
rem choco install -y rtools --no-progress --force --version=4.0.0.20220206
set BASH_PATH=c:\rtools40\usr\bin\bash.exe
set CHERE_INVOKING=yes
set INTERFACE64=1
set LDFLAGS=-lucrt -static -static-libgcc
set MSYSTEM=UCRT64
%BASH_PATH% -lc make BINARY=64 DYNAMIC_ARCH=0 USE_THREAD=1 USE_OPENMP=0 NUM_THREADS=24 NO_WARMUP=1 NO_AFFINITY=1 CONSISTENT_FPCSR=1 BUILD_LAPACK_DEPRECATED=1 TARGET=PRESCOTT BUFFERSIZE=20 'LDFLAGS=-lucrt -static -static-libgcc' 'COMMON_OPT=-O2 -march=x86-64 -mtune=generic -fno-asynchronous-unwind-tables' 'FCOMMON_OPT= -O2 -march=x86-64 -mtune=generic -fno-asynchronous-unwind-tables -frecursive -ffpe-summary=invalid,zero -fdefault-integer-8' MAX_STACK_ALLOC=2048 INTERFACE64=1 SYMBOLSUFFIX=64_ LIBNAMESUFFIX=64_ SYMBOLPREFIX=scipy_ LIBNAMEPREFIX=scipy_ FIXED_LIBNAME=1

Then I can copy the DLL over the one used by numpy.

Unfortunately, the prefixing does not work correctly pre-0.3.26, so I will continue bisecting backwards from 0.3.26 using NumPy 1.26.4 (which does not have any scipy_ function and dll prefixes but does use the 64_ suffixes).

mattip · 2024-08-02T08:47:00Z

I have 0.3.26 building for NumPy 1.26.4 (using only SYMBOLSUFFIX) and it fails the test. Now bisecting further back...

martin-frbg · 2024-08-02T08:51:22Z

Interesting, that would mean I am in the clear with my GEMM thread count change from PR 4585. Thanks for the chocolatey pointer, looks like I'm up and running with that again now.

mattip · 2024-08-02T09:33:57Z

I have a good commit with v0.3.25. Now I can bisect.

Bisecting: 91 revisions left to test after this (roughly 7 steps)"

mattip · 2024-08-02T12:48:24Z

Hmm. good is at d32f38fb, bad is at 302ca7edc7, PR OpenMathLib/OpenBLAS#4359 is inside the bisect range. I won't be able to finish the bisect today (each step takes an hour), but that is the most probable candidate of the remaining revisions. Maybe worthwhile staring at that a bit.

martin-frbg · 2024-08-02T12:58:19Z

thanks - my bisect is four steps from completion, I am also building a current HEAD with 4359 manually reverted now "just in case".

martin-frbg · 2024-08-02T13:59:11Z

Indeed HEAD with the changes from 4359 reverted appears to work fine - whether that means the problem is in 4359 or only uncovered by it remains to be seen. (Bisect still needs two steps but appears to be gravitating towards the same result)

mseminatore · 2024-08-02T23:08:30Z

Thank you @mattip abd @martin-frbg for the bisect work. I am sorry that I can’t more actively participate (for personal reasons) but will continue inspecting the code to see if I can identify a cause and fix.

martin-frbg · 2024-08-03T07:29:36Z

I think it is equally possible that the actual problem lies somewhere in driver/level3/level3_thread.c (though it has its own CriticalSection), but I lack the Windows developer experience to assess this

mseminatore · 2024-08-03T12:40:41Z

I haven’t looked very closely at that code but I will take a look.

mattip · 2024-08-04T06:28:36Z

Any further thoughts about #OpenMathLib/OpenBLAS#4835 and whether NumPy should build its OpenBLAS with that PR as a patch to move the NumPy2.1 release forward?

martin-frbg · 2024-08-04T10:26:11Z

not sure if that counts as thoughts, but my current intention is to merge 4835 in order to get the 0.3.28 release out in the next couple of days (probably not today though)

mattip · 2024-08-08T15:06:11Z

I backported the changes in OpenBLAS's 4835 in the 0.3.27.44.4 version of scipy-openblas, and used those wheels in #27140. I also added the test to prevent this from regressing again. Closing. Please reopen or open a new issue if I missed something.

alasdairwilson added the 00 - Bug label Jul 25, 2024

charris added this to the 2.0.2 release milestone Jul 25, 2024

This comment was marked as off-topic.

Sign in to view

mattip mentioned this issue Jul 29, 2024

BUG: dtype changed from float64 to int64 in scipy discrete_rv #27054

Closed

EwoutH mentioned this issue Aug 1, 2024

ENH: 3.13 wheels scipy/scipy#20992

Closed

martin-frbg mentioned this issue Aug 2, 2024

Temporarily revert to the coarse-grained locking in the Windows thread server OpenMathLib/OpenBLAS#4835

Merged

mattip mentioned this issue Aug 8, 2024

MAINT: Scipy openblas 0.3.27.44.4 #27140

Merged

mattip closed this as completed Aug 8, 2024

This was referenced Aug 8, 2024

MAINT: Scipy openblas 0.3.27.44.4 #27144

Merged

MAINT: Scipy openblas 0.3.27.44.4 #27146

Merged

mattip mentioned this issue Aug 11, 2024

BUG: Very slow execution of linalg.inv on intel ultra 7 #27174

Open

charris mentioned this issue Nov 25, 2024

REL: Prepare for the NumPy 2.2.0rc1 release [wheel build] #27853

Merged

Uh oh!

BUG: Matrix multiplication in windows produces large numerical errors inconsistently in numpy >= 2 #27036

BUG: Matrix multiplication in windows produces large numerical errors inconsistently in numpy >= 2 #27036

Comments

alasdairwilson commented Jul 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe the issue:

Reproduce the code example:

Error message:

Python and NumPy Versions:

Runtime Environment:

Context for the issue:

mattip commented Jul 25, 2024

Uh oh!

mattip commented Jul 25, 2024

Uh oh!

mattip commented Jul 25, 2024

Uh oh!

martin-frbg commented Jul 25, 2024

Uh oh!

ayshih commented Jul 25, 2024

Uh oh!

ayshih commented Jul 25, 2024

Uh oh!

martin-frbg commented Jul 25, 2024

Uh oh!

martin-frbg commented Jul 27, 2024

Uh oh!

mseminatore commented Jul 27, 2024

Uh oh!

martin-frbg commented Jul 27, 2024

Uh oh!

martin-frbg commented Jul 27, 2024

Uh oh!

mattip commented Jul 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattip commented Jul 27, 2024

Uh oh!

martin-frbg commented Jul 27, 2024

Uh oh!

This comment was marked as off-topic.

mattip commented Jul 29, 2024

Uh oh!

mseminatore commented Jul 29, 2024

Uh oh!

mattip commented Jul 29, 2024

Uh oh!

mseminatore commented Jul 29, 2024

Uh oh!

martin-frbg commented Jul 29, 2024

Uh oh!

mseminatore commented Jul 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

martin-frbg commented Jul 29, 2024

Uh oh!

martin-frbg commented Jul 29, 2024

Uh oh!

mseminatore commented Jul 29, 2024

Uh oh!

martin-frbg commented Jul 29, 2024

Uh oh!

martin-frbg commented Jul 30, 2024

Uh oh!

mseminatore commented Jul 30, 2024

Uh oh!

mattip commented Jul 31, 2024

Uh oh!

mattip commented Jul 31, 2024

Uh oh!

martin-frbg commented Jul 31, 2024

Uh oh!

martin-frbg commented Jul 31, 2024

Uh oh!

mattip commented Jul 31, 2024

Uh oh!

mseminatore commented Aug 1, 2024

Uh oh!

mseminatore commented Aug 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

alasdairwilson commented Jul 25, 2024 •

edited

Loading

mattip commented Jul 27, 2024 •

edited

Loading

mseminatore commented Jul 29, 2024 •

edited

Loading

mseminatore commented Aug 1, 2024 •

edited

Loading

mattip commented Aug 2, 2024 •

edited

Loading