Add GPUCommandEncoder.updateBuffer for embedding small buffer updates. #650

kdashg · 2020-03-26T17:42:12Z

kainino0x · 2020-03-26T21:10:39Z

spec/index.bs

+
+  Embed a copy of |source| from |sourceOffset| to |size| into the {{GPUCommandEncoder}}.
+  Encode a command into the {{GPUCommandEncoder}} that copies |size| bytes of data from embedded copy to the |destinationOffset| of another {{GPUBuffer}} |destination|.
+  |size| must be less than or equal to 65536 bytes.


Possibly implementable with Metal's set{,Vertex,Fragment}Bytes which are limited to (or recommended to be under?) 4k.

setBytes is only transient. It's basically for setting uniforms OpenGL-style.

You're right. I guess those are only useful for a push-constant-like feature.

Kangz · 2020-03-27T15:19:12Z

Apart from being closer to Vulkan, I'm not sure what the improvement is compared to writeBuffer:

writeBuffer is less constrained.
writeBuffer doesn't require allocating storage for the data for an unknown duration, and everything can be just streamed.
writeBuffer doesn't make GPUCommandEncoders hold on to data in addition to commands.

kvark · 2020-03-27T16:39:41Z

That approach is very similar to #154, which we discussed at length at San Diego F2F. The points about returning ArrayBuffer objects obviously don't apply to this one, but concerns about lifetimes that are mentioned here by @Kangz still do.

Kangz · 2020-04-02T14:58:38Z

The API we have at the moment for WebGPU has a nice property of increasing in complexity as you want to extract the most of it, but works good enough if you do the simple things. writeBuffer is another step in that direction, but inlineUpdateBuffer isn't in my opinion.

inlineUpdateBuffer can cause the application to break in the wild while it worked on the developer's machine because all of a sudden some data became big enough. @jdashg mentioned in the call that this is similar to the max texture size limits, but I think there are two difference: inlineUploadBuffer doesn't have a limit, and texture sizes don't tend to very as much as, say, the size of model data that's put in an inlineUpdateBuffer.

Also inlineUpdateBuffer is the simple option that developers will be looking to use, and the natural thing to do when wanting to upload more data will be to call it multiple times. And doing so will result in surprisingly bad performance. Sure we can have console warnings, but for an important usecase like this one, it's just better to give developers a good enough options that works good enough and doesn't have footguns.

Also inlineUpdateBuffer essentially forces 4 copies:

JS ArrayBuffer to shmem
shmem to an temporary allocation that lives next to the GPU process object for a GPUCommandBuffer
that allocation to staging memory
staging memory to the final place

Due to its streaming and "instantaneous" nature, writeToBuffer skips the temporary allocation, getting to 3 copies, and can even be streamed into GPU-visible shmem, reducing copies to 2.

kvark · 2020-04-02T14:56:58Z

spec/index.bs

+For example, updating model-view and projection matrices before or interleaved with rendering of a scene.
+When these uploads are small, it's viable to inline the update data into the command buffer.
+This does require more copies than other upload paths, but for small data sizes this overhead is negligible.
+Implementations are expected to warn against using this for medium-to-large buffer updates. (e.g. >64k)


Where do we draw the line of the "suboptimal" behavior? I.e. what if an application updates 64k of different multiple buffers? what if it updates 64k of data of the same buffer? does it matter if the updated range is the same? etc

kvark · 2020-04-02T14:59:55Z

spec/index.bs

+For example, updating model-view and projection matrices before or interleaved with rendering of a scene.
+When these uploads are small, it's viable to inline the update data into the command buffer.
+This does require more copies than other upload paths, but for small data sizes this overhead is negligible.
+Implementations are expected to warn against using this for medium-to-large buffer updates. (e.g. >64k)


The problem with warnings here is that this size could easily be unknown at build/development time. Say, the developer loads a mesh and updates some vertices using this function, and everything works on their machine. But then later an user loads a bigger mesh, and not only they get a warning spam, it's also animating suspiciously slow, because of how many copies the data needs to go through (i.e. 4 on this path, as estimated by @Kangz).

spec/index.bs

kvark · 2020-04-02T15:55:07Z

spec/index.bs

+
+In Vulkan, this is similar to |vkCmdUpdateBuffer|.
+In D3D12, implementations can leverage |ID3D12GraphicsCommandList2::WriteBufferImmediate|.
+Metal might use |makeBuffer(bytesNoCopy:length:options:deallocator:)| around some section of shared command buffer serialization memory.


I wonder how this would work in practice. It requires page-size alignment for both the pointer and the size, and also:

The existing memory allocation must be covered by a single VM region, typically allocated with vm_allocate or mmap. Memory allocated by malloc is specifically disallowed.

Kangz · 2020-05-20T17:03:06Z

Closing now that #708 is landed

kdashg · 2020-05-20T23:42:30Z

We may still want this later.

Add GPUCommandEncoder.updateBuffer for embedding small buffer updates.

a7d45f2

kainino0x reviewed Mar 26, 2020

View reviewed changes

s/updateBuffer/inlineUpdateBuffer/, add description, use-cases.

30c71af

kvark reviewed Apr 2, 2020

View reviewed changes

kainino0x mentioned this pull request Apr 13, 2020

Recycling ArrayBuffer in createBufferMapped #697

Open

Kangz closed this May 20, 2020

kdashg reopened this May 20, 2020

kvark changed the base branch from master to main June 23, 2020 13:16

kainino0x added the proposal label Aug 25, 2022

kainino0x added this to the Polish post-V1 milestone Aug 25, 2022

kainino0x marked this pull request as draft August 25, 2022 01:33

ben-clayton pushed a commit to ben-clayton/gpuweb that referenced this pull request Sep 6, 2022

Add shouldWebGPUCTSFailOnWarnings to template (gpuweb#650)

f2825eb

kainino0x modified the milestones: Polish post-V1, Milestone 2? Aug 15, 2023

kainino0x added the api WebGPU API label May 21, 2024

kainino0x modified the milestones: Polish post-V1, Milestone 1 Jul 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add GPUCommandEncoder.updateBuffer for embedding small buffer updates. #650

Add GPUCommandEncoder.updateBuffer for embedding small buffer updates. #650

Uh oh!

kdashg commented Mar 26, 2020 •

edited by pr-preview bot

Loading

Uh oh!

kainino0x Mar 26, 2020

Uh oh!

Kangz Mar 26, 2020

Uh oh!

kainino0x Mar 26, 2020

Uh oh!

Kangz commented Mar 27, 2020

Uh oh!

kvark commented Mar 27, 2020

Uh oh!

Kangz commented Apr 2, 2020

Uh oh!

kvark Apr 2, 2020

Uh oh!

kvark Apr 2, 2020

Uh oh!

Uh oh!

kvark Apr 2, 2020

Uh oh!

Kangz commented May 20, 2020

Uh oh!

kdashg commented May 20, 2020

Uh oh!

Uh oh!

Add GPUCommandEncoder.updateBuffer for embedding small buffer updates. #650

Are you sure you want to change the base?

Add GPUCommandEncoder.updateBuffer for embedding small buffer updates. #650

Uh oh!

Conversation

kdashg commented Mar 26, 2020 • edited by pr-preview bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kainino0x Mar 26, 2020

Choose a reason for hiding this comment

Uh oh!

Kangz Mar 26, 2020

Choose a reason for hiding this comment

Uh oh!

kainino0x Mar 26, 2020

Choose a reason for hiding this comment

Uh oh!

Kangz commented Mar 27, 2020

Uh oh!

kvark commented Mar 27, 2020

Uh oh!

Kangz commented Apr 2, 2020

Uh oh!

kvark Apr 2, 2020

Choose a reason for hiding this comment

Uh oh!

kvark Apr 2, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kvark Apr 2, 2020

Choose a reason for hiding this comment

Uh oh!

Kangz commented May 20, 2020

Uh oh!

kdashg commented May 20, 2020

Uh oh!

Uh oh!

kdashg commented Mar 26, 2020 •

edited by pr-preview bot

Loading