buffer is not large enough when running pytorch on Mac M1 mps #77886

xiaoouwang · 2022-05-19T20:33:25Z

🐛 Describe the bug

The bug seems related to #77851

To reproduce the bug:

from transformers import AutoModel
from transformers import AutoTokenizer
import torch
model_ckpt = "distilbert-base-uncased"
device = torch.device("mps")
model = AutoModel.from_pretrained(model_ckpt).to(device)
model_ckpt = "distilbert-base-uncased"
tokenizer = AutoTokenizer.from_pretrained(model_ckpt)
text = "this is a test"
inputs = tokenizer(text, return_tensors="pt")
inputs = {k:v.to(device) for k,v in inputs.items()}
with torch.no_grad():
    outputs = model(**inputs)

The error message:

/AppleInternal/Library/BuildRoots/8d3bda53-8d9c-11ec-abd7-fa6a1964e34e/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShaders/MPSCore/Types/MPSNDArray.mm:782: failed assertion `[MPSNDArray, initWithBuffer:descriptor:] Error: buffer is not large enough. Must be 432 bytes
'
[1] 75519 abort python 02.py
/Users/xiaoou/opt/anaconda3/lib/python3.9/multiprocessing/resource_tracker.py:216: UserWarning: resource_tracker: There appear to be 1 leaked semaphore objects to clean up at shutdown
warnings.warn('resource_tracker: There appear to be %d '

Versions

1.12.0.dev20220519

The text was updated successfully, but these errors were encountered:

glenn-jocher · 2022-05-20T16:18:07Z

Same issue with YOLOv5 on MPS noted in #77748 (comment). I see buffer is not large enough. Must be 25600 bytes

XinBow99 · 2022-05-22T12:21:36Z

same issue

pytorch#77886)

Fix crashes in view tensors due to buffer size mismatch (pytorch#78247, pytorch#77886)

Fixes #78247, #77886 Pull Request resolved: #78496 Approved by: https://github.com/albanD, https://github.com/malfet

razarmehr · 2022-06-01T00:02:17Z

This issue is fixed in PR #78496 (nightly build 1.13.0.dev20220531 or later).

…#78496) Summary: Fixes #78247, #77886 Pull Request resolved: #78496 Approved by: https://github.com/albanD, https://github.com/malfet Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/017b0ae9431ae3780a4eb9bf6d8865dfcd02cd92 Reviewed By: seemethere Differential Revision: D36784418 Pulled By: seemethere fbshipit-source-id: 3558273f2fa3342f4e028fe1186c733bc5a370a8

crayon7442 · 2022-06-05T02:53:25Z

Same issue with YOLOv5 on device "mps"
MPSNDArray.mm:782: failed assertion `[MPSNDArray, initWithBuffer:descriptor:] Error: buffer is not large enough. Must be 19200 bytes' #78492

crayon7442 · 2022-06-05T08:17:25Z

This issue is fixed in PR #78496 (nightly build 1.13.0.dev20220531 or later).
This issue has not been fixed..

…#78496) Fixes pytorch#78247, pytorch#77886 Pull Request resolved: pytorch#78496 Approved by: https://github.com/albanD, https://github.com/malfet

glenn-jocher · 2022-06-07T11:20:33Z

@jerjer1223 @GerardWalsh can you please reinstall nightly and see if this resolves ultralytics/yolov5#8102

Fixes #78247, #77886 Pull Request resolved: #78496 Approved by: https://github.com/albanD, https://github.com/malfet (cherry picked from commit 017b0ae)

csmetzner · 2022-06-08T00:01:46Z

Hello I am still receiving this error. What do I have to do to resolve this bug/issue? Thanks.

pytorch nightly version: torch-1.13.0.dev20220607

/AppleInternal/Library/BuildRoots/b6051351-c030-11ec-96e9-3e7866fcf3a1/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShaders/MPSCore/Types/MPSNDArray.mm:782: failed assertion `[MPSNDArray, initWithBuffer:descriptor:] Error: buffer is not large enough. Must be 160000 bytes

jerjer1223 · 2022-06-08T07:22:14Z

I'm also still receiving this error, a fix would be appreciated.

PyTorch version 1.13.0.dev20220607

Fusing layers...
YOLOv5s summary: 213 layers, 7225885 parameters, 0 gradients
/AppleInternal/Library/BuildRoots/b6051351-c030-11ec-96e9-3e7866fcf3a1/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShaders/MPSCore/Types/MPSNDArray.mm:782: failed assertion `[MPSNDArray, initWithBuffer:descriptor:] Error: buffer is not large enough. Must be 25600 bytes

GerardWalsh · 2022-06-08T08:47:15Z

@glenn-jocher

can you please reinstall nightly and see if this resolves ultralytics/yolov5#8102

No it does not

python yolov5/detect.py --source 0 --device='mps'                                       
detect: weights=yolov5/yolov5s.pt, source=0, data=yolov5/data/coco128.yaml, imgsz=[640, 640], conf_thres=0.25, iou_thres=0.45, max_det=1000, device=mps, view_img=False, save_txt=False, save_conf=False, save_crop=False, nosave=False, classes=None, agnostic_nms=False, augment=False, visualize=False, update=False, project=yolov5/runs/detect, name=exp, exist_ok=False, line_thickness=3, hide_labels=False, hide_conf=False, half=False, dnn=False
YOLOv5 🚀 v6.1-246-g2dd3db0 Python-3.8.13 torch-1.13.0.dev20220607 MPS

Fusing layers... 
YOLOv5s summary: 213 layers, 7225885 parameters, 0 gradients
1/1: 0...  Success (inf frames 1280x720 at 30.00 FPS)

/AppleInternal/Library/BuildRoots/b6051351-c030-11ec-96e9-3e7866fcf3a1/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShaders/MPSCore/Types/MPSNDArray.mm:782: failed assertion `[MPSNDArray, initWithBuffer:descriptor:] Error: buffer is not large enough. Must be 25600 bytes
'
zsh: abort      python yolov5/detect.py --source 0 --device='mps'

Collecting environment information...
PyTorch version: 1.13.0.dev20220607
Is debug build: False
CUDA used to build PyTorch: None
ROCM used to build PyTorch: N/A

OS: macOS 12.4 (arm64)
GCC version: Could not collect
Clang version: 13.1.6 (clang-1316.0.21.2.5)
CMake version: Could not collect
Libc version: N/A

Python version: 3.8.13 | packaged by conda-forge | (default, Mar 25 2022, 06:04:14)  [Clang 12.0.1 ] (64-bit runtime)
Python platform: macOS-12.4-arm64-arm-64bit
Is CUDA available: False
CUDA runtime version: No CUDA
GPU models and configuration: No CUDA
Nvidia driver version: No CUDA
cuDNN version: No CUDA
HIP runtime version: N/A
MIOpen runtime version: N/A
Is XNNPACK available: True

Versions of relevant libraries:
[pip3] numpy==1.22.4
[pip3] torch==1.13.0.dev20220607
[pip3] torchvision==0.14.0.dev20220608
[conda] numpy                     1.22.4           py38he1fcd3f_0    conda-forge
[conda] pytorch                   1.13.0.dev20220607         py3.8_0    pytorch-nightly
[conda] torchvision               0.14.0.dev20220603          pypi_0    pypi

giuseppebrb · 2022-06-12T20:28:13Z

Same issue try running YOLOv5s with mps on M1 Pro

YOLOv5 🚀 2022-6-12 Python-3.9.12 torch-1.13.0.dev20220612 MPS

/AppleInternal/Library/BuildRoots/b6051351-c030-11ec-96e9-3e7866fcf3a1/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShaders/MPSCore/Types/MPSNDArray.mm:782: failed assertion [MPSNDArray, initWithBuffer:descriptor:] Error: buffer is not large enough. Must be 19200 bytes

mclean-connor · 2022-06-13T17:10:05Z

same issue when running with stable baselines 3 contrib PPO recurrent

/AppleInternal/Library/BuildRoots/b6051351-c030-11ec-96e9-3e7866fcf3a1/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShaders/MPSCore/Types/MPSNDArray.mm:782: failed assertion `[MPSNDArray, initWithBuffer:descriptor:] Error: buffer is not large enough. Must be 576 bytes
'

Darkfeast · 2022-06-16T17:13:56Z

YOLOv5 🚀 v6.1-253-g75bbaa8 Python-3.10.4 torch-1.13.0.dev20220616 MPS

/AppleInternal/Library/BuildRoots/b6051351-c030-11ec-96e9-3e7866fcf3a1/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShaders/MPSCore/Types/MPSNDArray.mm:782: failed assertion `[MPSNDArray, initWithBuffer:descriptor:] Error: buffer is not large enough. Must be 25600 bytes

Require explicit request for MPS, i.e. ```bash python detect.py --device mps ``` Reverts #8210 for preferring MPS if available. Note that torch MPS is experiencing ongoing compatibility issues in pytorch/pytorch#77886

glenn-jocher · 2022-07-02T16:29:22Z

I confirm I'm experiencing the same YOLOv5 Apple MPS bug with torch 1.12 on MacBook M1: Error: buffer is not large enough. Must be 25600 bytes

$ glennjocher@Glenns-MacBook-Air yolov5 % python detect.py --device mps

detect: weights=yolov5s.pt, source=data/images, data=data/coco128.yaml, imgsz=[640, 640], conf_thres=0.25, iou_thres=0.45, max_det=1000, device=mps, view_img=False, save_txt=False, save_conf=False, save_crop=False, nosave=False, classes=None, agnostic_nms=False, augment=False, visualize=False, update=False, project=runs/detect, name=exp, exist_ok=False, line_thickness=3, hide_labels=False, hide_conf=False, half=False, dnn=False
YOLOv5 🚀 v6.1-386-g858a1a3 Python-3.9.13 torch-1.12.0 MPS

Fusing layers... 
YOLOv5s summary: 213 layers, 7225885 parameters, 0 gradients
/AppleInternal/Library/BuildRoots/b6051351-c030-11ec-96e9-3e7866fcf3a1/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShaders/MPSCore/Types/MPSNDArray.mm:782: failed assertion `[MPSNDArray, initWithBuffer:descriptor:] Error: buffer is not large enough. Must be 25600 bytes

glenn-jocher · 2022-07-02T16:34:46Z

This issue is fixed in PR #78496 (nightly build 1.13.0.dev20220531 or later).

@albanD @razarmehr is this error supposed to exist in torch 1.12? I see the fix was was in a 1.13 nightly.

Require explicit request for MPS, i.e. ```bash python detect.py --device mps ``` Reverts #8210 for preferring MPS if available. Note that torch MPS is experiencing ongoing compatibility issues in pytorch/pytorch#77886

albanD · 2022-07-04T09:42:25Z

Hi,

I'm not sure if this made it for the release no.
If you're using MPS a lot, I would recommend using the nightly though as we did quite a few fixes that didn't make it to 1.12.

malfet · 2022-07-04T15:48:54Z

I'm not sure if this made it for the release no. If you're using MPS a lot, I would recommend using the nightly though as we did quite a few fixes that didn't make it to 1.12.

Hmm #78496 was picked into release branch as e3e7531

malfet · 2022-07-04T17:55:27Z

Reopening to investigate if it still crashes on trunk, and if it is not, why Pytorch-1.12 is still affected

Require explicit request for MPS, i.e. ```bash python detect.py --device mps ``` Reverts ultralytics#8210 for preferring MPS if available. Note that torch MPS is experiencing ongoing compatibility issues in pytorch/pytorch#77886

daniwnwd · 2022-07-12T16:06:26Z

I'm still getting the issue on 1.13.0.dev20220712

Is there any fix?

DenisVieriu97 · 2022-07-22T16:51:59Z

@glenn-jocher, @daniwnwd this should be fixed in the latest PyTorch nightly (1.13.0.dev20220722). Could you please let me know if you are still seeing the issue on your end?

crayon7442 · 2022-07-23T00:52:20Z

@DenisVieriu97
NotImplementedError: The operator 'aten::index.Tensor_out' is not current implemented for the MPS device. #82034

glenn-jocher · 2022-07-23T19:29:33Z

@DenisVieriu97 I confirm that the original buffer is not large enough error is now resolved in latest nightly.

YOLOv5 inference still fails on operator 'aten::index.Tensor_out' is not current implemented for the MPS device as mentioned by @crayon7442, but that's a separate issue, so I believe this issue can be closed now.

DenisVieriu97 · 2022-07-25T16:55:58Z

Thanks a lot @crayon7442 and @glenn-jocher for checking this!
index.Tensor_out is already part of https://github.com/kulinseth/pytorch and we hope to get it soon in PyTorch master

glenn-jocher · 2022-07-25T17:08:08Z

@DenisVieriu97 awesome! Thanks for the update.

Require explicit request for MPS, i.e. ```bash python detect.py --device mps ``` Reverts ultralytics#8210 for preferring MPS if available. Note that torch MPS is experiencing ongoing compatibility issues in pytorch/pytorch#77886

astrowonk · 2022-10-03T13:32:02Z

I'm on the latest nightly torch-1.13.0.dev20221003 and still getting this error. Could it be cause aten::repeat_interleave.self_int fell back to CPU?

/usr/local/opt/miniforge3/lib/python3.9/site-packages/whisper/decoding.py:628:
/UserWarning: The operator 'aten::repeat_interleave.self_int' is not currently
/supported on the MPS backend and will fall back to run on the CPU. This may
/have performance implications. (Triggered internally at
//Users/runner/work/pytorch/pytorch/pytorch/aten/src/ATen/mps/MPSFallback.mm:11
/.)
  audio_features = audio_features.repeat_interleave(self.n_group, dim=0)
/AppleInternal/Library/BuildRoots/a0876c02-1788-11ed-b9c4-96898e02b808/Library/
/Caches/com.apple.xbs/Sources/MetalPerformanceShaders/MPSCore/Types/MPSNDArray.
/mm:782: failed assertion `[MPSNDArray, initWithBuffer:descriptor:] Error:
/buffer is not large enough. Must be 201452 bytes
'

Edited to hardwrap the long lines

glenn-jocher · 2022-10-03T13:40:01Z

@astrowonk that's not an error, that's a warning. As the message states not all torch ops are fully converted to MPS yet.

astrowonk · 2022-10-03T16:09:59Z

@astrowonk that's not an error, that's a warning. As the message states not all torch ops are fully converted to MPS yet.

I think you have have to side scroll to see everything @glenn-jocher, the formatting wasn't great. " Error: buffer is not large enough. Must be 201452 bytes" is there on the last line.

and then the kernel dies (in Ipython or Jupyter)

kulinseth · 2022-10-03T18:19:43Z

@astrowonk , can you create a new issue with the network and command line and we will take a look. Thanks.

glenn-jocher · 2022-10-03T19:12:18Z

@astrowonk ah yes, I stand corrected!

astrowonk · 2022-10-03T20:22:13Z

@astrowonk , can you create a new issue with the network and command line and we will take a look. Thanks.

@kulinseth Opened #86152

joannercsheppard · 2023-04-06T15:10:37Z

I just had this issue and I found the solution. It turns out the error was because my MacOS was too old and PyTorch mps was not compatible. If nothing else it working try updating the operating system.

xiaoouwang mentioned this issue May 19, 2022

MPSNDArray or MPSGraphTensorData allocated with wrong size #77851

Closed

albanD added module: memory usage PyTorch is using more memory than it should, or it is leaking memory triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module module: mps Related to Apple Metal Performance Shaders framework labels May 19, 2022

lhoenig mentioned this issue May 20, 2022

General MPS op coverage tracking issue #77764

Open

This comment was marked as off-topic.

Sign in to view

Willian-Zhang mentioned this issue May 21, 2022

Non-contiguous tensor fails on MPS backend for .cat and .stack #78043

Closed

razarmehr added a commit to kulinseth/pytorch that referenced this issue May 27, 2022

Fix crashes in view tensors due to buffer size mismatch (pytorch#78247,

c7f8808

pytorch#77886)

kulinseth pushed a commit to kulinseth/pytorch that referenced this issue May 30, 2022

Merge pull request #14 from kulinseth/razarmehr/view_fix1

0567d31

Fix crashes in view tensors due to buffer size mismatch (pytorch#78247, pytorch#77886)

kulinseth mentioned this issue May 30, 2022

MPS: Fix crashes in view tensors due to buffer size mismatch #78496

Closed

pytorchmergebot pushed a commit that referenced this issue May 31, 2022

MPS: Fix crashes in view tensors due to buffer size mismatch (#78496)

017b0ae

Fixes #78247, #77886 Pull Request resolved: #78496 Approved by: https://github.com/albanD, https://github.com/malfet

razarmehr closed this as completed Jun 1, 2022

glenn-jocher mentioned this issue Jun 7, 2022

Torch MPS (gpu) acceleration not working M1 Mac. ultralytics/yolov5#8102

Closed

2 tasks

glenn-jocher mentioned this issue Jul 2, 2022

Do not prefer Apple MPS ultralytics/yolov5#8446

Merged

glenn-jocher mentioned this issue Jul 2, 2022

YOLOv5: MPS on Macbook Air M1 NotImplementedError: Could not run 'aten::empty.memory_format' with arguments... #77748

Closed

glenn-jocher mentioned this issue Jul 2, 2022

Failed to Run WebCam Inference ultralytics/yolov5#8442

Closed

2 tasks

glenn-jocher mentioned this issue Jul 4, 2022

detection failed on Apple MPS ultralytics/yolov5#8459

Closed

2 tasks

malfet reopened this Jul 4, 2022

glenn-jocher mentioned this issue Jul 7, 2022

YOLOv5 Apple Metal Performance Shader (MPS) support ultralytics/yolov5#7878

Merged

kulinseth closed this as completed Jul 25, 2022

astrowonk mentioned this issue Oct 3, 2022

MPSNDArray.mm:782: failed assertion; bufer is not large enough Mac M1 MPS #86152

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

buffer is not large enough when running pytorch on Mac M1 mps #77886

buffer is not large enough when running pytorch on Mac M1 mps #77886

xiaoouwang commented May 19, 2022 •

edited

Loading

glenn-jocher commented May 20, 2022 •

edited

Loading

This comment was marked as off-topic.

XinBow99 commented May 22, 2022

razarmehr commented Jun 1, 2022

crayon7442 commented Jun 5, 2022

crayon7442 commented Jun 5, 2022

glenn-jocher commented Jun 7, 2022

csmetzner commented Jun 8, 2022 •

edited

Loading

jerjer1223 commented Jun 8, 2022 •

edited

Loading

GerardWalsh commented Jun 8, 2022 •

edited

Loading

giuseppebrb commented Jun 12, 2022 •

edited

Loading

mclean-connor commented Jun 13, 2022 •

edited

Loading

Darkfeast commented Jun 16, 2022

glenn-jocher commented Jul 2, 2022 •

edited

Loading

glenn-jocher commented Jul 2, 2022 •

edited

Loading

albanD commented Jul 4, 2022

malfet commented Jul 4, 2022

malfet commented Jul 4, 2022

daniwnwd commented Jul 12, 2022

DenisVieriu97 commented Jul 22, 2022

crayon7442 commented Jul 23, 2022

glenn-jocher commented Jul 23, 2022

DenisVieriu97 commented Jul 25, 2022

glenn-jocher commented Jul 25, 2022

astrowonk commented Oct 3, 2022 •

edited

Loading

glenn-jocher commented Oct 3, 2022

astrowonk commented Oct 3, 2022

kulinseth commented Oct 3, 2022

glenn-jocher commented Oct 3, 2022

astrowonk commented Oct 3, 2022

joannercsheppard commented Apr 6, 2023

buffer is not large enough when running pytorch on Mac M1 mps #77886

buffer is not large enough when running pytorch on Mac M1 mps #77886

Comments

xiaoouwang commented May 19, 2022 • edited Loading

🐛 Describe the bug

Versions

glenn-jocher commented May 20, 2022 • edited Loading

This comment was marked as off-topic.

XinBow99 commented May 22, 2022

razarmehr commented Jun 1, 2022

crayon7442 commented Jun 5, 2022

crayon7442 commented Jun 5, 2022

glenn-jocher commented Jun 7, 2022

csmetzner commented Jun 8, 2022 • edited Loading

jerjer1223 commented Jun 8, 2022 • edited Loading

GerardWalsh commented Jun 8, 2022 • edited Loading

giuseppebrb commented Jun 12, 2022 • edited Loading

mclean-connor commented Jun 13, 2022 • edited Loading

Darkfeast commented Jun 16, 2022

glenn-jocher commented Jul 2, 2022 • edited Loading

glenn-jocher commented Jul 2, 2022 • edited Loading

albanD commented Jul 4, 2022

malfet commented Jul 4, 2022

malfet commented Jul 4, 2022

daniwnwd commented Jul 12, 2022

DenisVieriu97 commented Jul 22, 2022

crayon7442 commented Jul 23, 2022

glenn-jocher commented Jul 23, 2022

DenisVieriu97 commented Jul 25, 2022

glenn-jocher commented Jul 25, 2022

astrowonk commented Oct 3, 2022 • edited Loading

glenn-jocher commented Oct 3, 2022

astrowonk commented Oct 3, 2022

kulinseth commented Oct 3, 2022

glenn-jocher commented Oct 3, 2022

astrowonk commented Oct 3, 2022

joannercsheppard commented Apr 6, 2023

xiaoouwang commented May 19, 2022 •

edited

Loading

glenn-jocher commented May 20, 2022 •

edited

Loading

csmetzner commented Jun 8, 2022 •

edited

Loading

jerjer1223 commented Jun 8, 2022 •

edited

Loading

GerardWalsh commented Jun 8, 2022 •

edited

Loading

giuseppebrb commented Jun 12, 2022 •

edited

Loading

mclean-connor commented Jun 13, 2022 •

edited

Loading

glenn-jocher commented Jul 2, 2022 •

edited

Loading

glenn-jocher commented Jul 2, 2022 •

edited

Loading

astrowonk commented Oct 3, 2022 •

edited

Loading